Skip to content
View conglu1997's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report conglu1997

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A collection of formalized statements of conjectures in Lean.

Lean 761 176 Updated Jan 11, 2026

Further computation of R(N) in #321, see https://github.com/teorth/erdosproblems/issues/161.

Python 1 Updated Dec 31, 2025

Official code for StochasTok: Improving Fine-Grained Subword Understanding in LLMs

Python 13 3 Updated Jun 19, 2025

Darwin GΓΆdel Machine: Open-Ended Evolution of Self-Improving Agents

Python 1,789 385 Updated Aug 13, 2025

The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search

Python 1,974 373 Updated Dec 19, 2025

Automated Capability Discovery via Foundation Model Self-Exploration

Python 66 4 Updated Feb 12, 2025

Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.

Python 226 9 Updated Jan 5, 2026

Benchmark for studying the imitation gap when training autonomous driving policies from human demonstrations

Jupyter Notebook 20 Updated Dec 8, 2025

πŸ™Œ OpenHands: AI-Driven Development

Python 66,495 8,235 Updated Jan 11, 2026

[ICLR 2025] Automated Design of Agentic Systems

Python 1,485 225 Updated Jan 28, 2025

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery πŸ§‘β€πŸ”¬

Jupyter Notebook 11,925 1,741 Updated Dec 19, 2025

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 3,023 293 Updated Jan 14, 2025

aider is AI pair programming in your terminal

Python 39,696 3,815 Updated Jan 4, 2026

METR Task Standard

TypeScript 169 36 Updated Feb 3, 2025

Related papers for reinforcement learning, including classic papers and latest papers in top conferences

515 36 Updated Jan 11, 2026

Must-read Papers on LLM Agents.

2,838 167 Updated Jan 7, 2026

Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models

Inform 7 65 6 Updated Feb 25, 2025

Code for Stable Control Representations

Python 26 1 Updated Apr 5, 2025

Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"

Python 149 8 Updated Jul 19, 2024

Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"

Python 201 19 Updated Apr 17, 2025

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 3,068 221 Updated Nov 17, 2025

Official implementation of Reach-Aware Value Estimation (RAVL) from the paper: "The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning."

Python 7 Updated Apr 27, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,350 4,777 Updated Jun 2, 2025

High throughput synchronous and asynchronous reinforcement learning

Python 965 143 Updated Nov 14, 2025

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,766 2,925 Updated Sep 2, 2024

Inference code for Llama models

Python 59,042 9,813 Updated Jan 26, 2025

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Python 1 Updated Jan 24, 2024

A JAX-based simulator for autonomous driving research.

Python 1,021 124 Updated Oct 23, 2025
Next