Skip to content
View conglu1997's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report conglu1997

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A collection of formalized statements of conjectures in Lean.

Lean 772 191 Updated Jan 16, 2026

Further computation of R(N) in #321, see https://github.com/teorth/erdosproblems/issues/161.

Python 1 Updated Dec 31, 2025

Official code for StochasTok: Improving Fine-Grained Subword Understanding in LLMs

Python 13 3 Updated Jun 19, 2025

Darwin GΓΆdel Machine: Open-Ended Evolution of Self-Improving Agents

Python 1,794 386 Updated Aug 13, 2025

The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search

Python 2,003 379 Updated Dec 19, 2025

Automated Capability Discovery via Foundation Model Self-Exploration

Python 66 4 Updated Feb 12, 2025

Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.

Python 227 9 Updated Jan 16, 2026

Benchmark for studying the imitation gap when training autonomous driving policies from human demonstrations

Jupyter Notebook 20 Updated Dec 8, 2025

πŸ™Œ OpenHands: AI-Driven Development

Python 66,695 8,286 Updated Jan 17, 2026

[ICLR 2025] Automated Design of Agentic Systems

Python 1,492 226 Updated Jan 28, 2025

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery πŸ§‘β€πŸ”¬

Jupyter Notebook 11,947 1,743 Updated Dec 19, 2025

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 3,031 293 Updated Jan 14, 2025

aider is AI pair programming in your terminal

Python 39,839 3,820 Updated Jan 4, 2026

METR Task Standard

TypeScript 170 36 Updated Feb 3, 2025

Related papers for reinforcement learning, including classic papers and latest papers in top conferences

518 37 Updated Jan 14, 2026

Must-read Papers on LLM Agents.

2,849 169 Updated Jan 15, 2026

Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models

Inform 7 65 6 Updated Feb 25, 2025

Code for Stable Control Representations

Python 26 1 Updated Apr 5, 2025

Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"

Python 149 8 Updated Jul 19, 2024

Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"

Python 202 19 Updated Apr 17, 2025

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 3,089 222 Updated Nov 17, 2025

Official implementation of Reach-Aware Value Estimation (RAVL) from the paper: "The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning."

Python 7 Updated Apr 27, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,364 4,772 Updated Jun 2, 2025

High throughput synchronous and asynchronous reinforcement learning

Python 967 144 Updated Nov 14, 2025

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,767 2,923 Updated Sep 2, 2024

Inference code for Llama models

Python 59,066 9,808 Updated Jan 26, 2025

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Python 1 Updated Jan 24, 2024

A JAX-based simulator for autonomous driving research.

Python 1,025 124 Updated Oct 23, 2025
Next