Starred repositories
Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
Official code for NeurIPS 2025 paper "AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise"
A curated list of papers on LLMs and agents for scientific research and development
BioDiscoveryAgent is an LLM-based AI agent for closed-loop design of genetic perturbation experiments
Automated Hypothesis Testing with Agentic Sequential Falsifications
Discovering Data-driven Hypotheses in the Wild
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
RewardBench: the first evaluation tool for reward models.
Fine-tune LLM agents with online reinforcement learning
A collection of learning resources for curious software engineers
High-speed Large Language Model Serving for Local Deployment
Solve puzzles. Improve your pytorch.
LLMs as Copilots for Theorem Proving in Lean
Turn (almost) any Python command line program into a full GUI application with one line
Machine Learning Engineering Open Book
21 Lessons, Get Started Building with Generative AI
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams
Sandbox repo for building JS PoCs for OpenLocus
The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
Resolve production issues, fast. An open source observability platform unifying session replays, logs, metrics, traces and errors powered by ClickHouse and OpenTelemetry.
A chrome extension that helps you by keeping you productive when you are learning from YouTube.
Vector Storage is a vector database that enables semantic similarity searches on text documents in the browser's local storage. It uses OpenAI embeddings to convert documents into vectors and allow…