conglu1997

Follow

🎯

Focusing

Cong Lu conglu1997

🎯

Focusing

Follow

Research Scientist @ Google DeepMind

325 followers · 29 following

London, UK
06:15 (UTC)
https://www.conglu.co.uk
@cong_ml

Achievements

Achievements

Highlights

Pro

Stars

google-deepmind / formal-conjectures

A collection of formalized statements of conjectures in Lean.

Lean 761 176 Updated Jan 11, 2026

conglu1997 / erdos_321_computation

Further computation of R(N) in #321, see https://github.com/teorth/erdosproblems/issues/161.

Python 1 Updated Dec 31, 2025

anyasims / stochastok

Official code for StochasTok: Improving Fine-Grained Subword Understanding in LLMs

Python 13 3 Updated Jun 19, 2025

jennyzzt / dgm

Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents

Python 1,789 385 Updated Aug 13, 2025

SakanaAI / AI-Scientist-v2

The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search

Python 1,974 373 Updated Dec 19, 2025

SakanaAI / AI-Scientist-ICLR2025-Workshop-Experiment

Python 277 21 Updated Apr 18, 2025

conglu1997 / ACD

Automated Capability Discovery via Foundation Model Self-Exploration

Python 66 4 Updated Feb 12, 2025

FLAIROx / Kinetix

Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.

Python 226 9 Updated Jan 5, 2026

clemgris / IGDrivSim

Benchmark for studying the imitation gap when training autonomous driving policies from human demonstrations

Jupyter Notebook 20 Updated Dec 8, 2025

OpenHands / OpenHands

🙌 OpenHands: AI-Driven Development

Python 66,495 8,235 Updated Jan 11, 2026

ShengranHu / ADAS

[ICLR 2025] Automated Design of Agentic Systems

Python 1,485 225 Updated Jan 28, 2025

SakanaAI / AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 11,925 1,741 Updated Dec 19, 2025

noahshinn / reflexion

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 3,023 293 Updated Jan 14, 2025

ndrwmlnk / Awesome-Video-Diffusion-Models

55 Updated Feb 11, 2025

Aider-AI / aider

aider is AI pair programming in your terminal

Python 39,696 3,815 Updated Jan 4, 2026

METR / task-standard

METR Task Standard

TypeScript 169 36 Updated Feb 3, 2025

yingchengyang / Reinforcement-Learning-Papers

Related papers for reinforcement learning, including classic papers and latest papers in top conferences

515 36 Updated Jan 11, 2026

zjunlp / LLMAgentPapers

Must-read Papers on LLM Agents.

2,838 167 Updated Jan 7, 2026

conglu1997 / intelligent-go-explore

Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models

Inform 7 65 6 Updated Feb 25, 2025

ykarmesh / stable-control-representations

Code for Stable Control Representations

Python 26 1 Updated Apr 5, 2025

EmptyJackson / policy-guided-diffusion

Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"

Python 149 8 Updated Jul 19, 2024

YifeiZhou02 / ArCHer

Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"

Python 201 19 Updated Apr 17, 2025

THUDM / AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 3,068 221 Updated Nov 17, 2025

anyasims / edge-of-reach

Official implementation of Reach-Aware Value Estimation (RAVL) from the paper: "The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning."

Python 7 Updated Apr 27, 2025

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,350 4,777 Updated Jun 2, 2025

alex-petrenko / sample-factory

High throughput synchronous and asynchronous reinforcement learning

Python 965 143 Updated Nov 14, 2025

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,766 2,925 Updated Sep 2, 2024

meta-llama / llama

Inference code for Llama models

Python 59,042 9,813 Updated Jan 26, 2025

floatingsun / gpt-neox

Forked from EleutherAI/gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Python 1 Updated Jan 24, 2024

waymo-research / waymax

A JAX-based simulator for autonomous driving research.

Python 1,021 124 Updated Oct 23, 2025