zhentingqi

Follow

Zhenting Qi zhentingqi

Follow

Carpe Diem.

43 followers · 0 following

Achievements

Achievements

Highlights

Pro

Stars

subconscious-systems / TIMRUN

60 8 Updated Nov 7, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,254 1,315 Updated Nov 19, 2025

thinking-machines-lab / tinker-cookbook

Post-training with Tinker

Python 2,127 171 Updated Nov 18, 2025

zhentingqi / evolm

Python 15 3 Updated Jun 23, 2025

ericjiang18 / EnergyORM

Python 11 1 Updated Jun 5, 2025

fannie1208 / Awesome-Agentic-RL

6 Updated Jun 26, 2025

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 95,220 25,954 Updated Nov 20, 2025

tulerfeng / Awesome-Embodied-Multimodal-LLMs

Latest Advances on Embodied Multimodal LLMs (or Vison-Language-Action Models).

121 6 Updated Jul 4, 2024

HuandongChang / ElaLoRA

Python 3 Updated Jun 8, 2025

facebookresearch / swe-rl

[NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"

Python 620 51 Updated Mar 16, 2025

satori-reasoning / Satori

[ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

Python 108 6 Updated Jun 3, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 2,065 119 Updated Jun 2, 2025

allenai / open-instruct

AllenAI's post-training codebase

Python 3,301 458 Updated Nov 20, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 16,144 2,597 Updated Nov 19, 2025

OpenAutoCoder / Agentless

Agentless🐱: an agentless approach to automatically solve software development problems

Python 1,965 213 Updated Dec 22, 2024

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 30,124 2,416 Updated Nov 18, 2025

sail-sg / oat-zero

A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.

Python 248 10 Updated Apr 15, 2025

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Python 2,326 257 Updated Sep 3, 2025

xlang-ai / OSWorld

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 2,321 331 Updated Nov 19, 2025

OpenHands / OpenHands

🙌 OpenHands: Code Less, Make More

Python 65,107 7,934 Updated Nov 20, 2025

sail-sg / oat

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 568 47 Updated Oct 31, 2025

huggingface / lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,125 384 Updated Nov 18, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,656 2,404 Updated Sep 8, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,423 816 Updated Nov 9, 2025

yale-nlp / MMVU

Data and Code for CVPR 2025 paper "MMVU: Measuring Expert-Level Multi-Discipline Video Understanding"

Python 75 1 Updated Feb 28, 2025

facebookresearch / cruxeval

CRUXEval: Code Reasoning, Understanding, and Execution Evaluation

Python 158 26 Updated Oct 11, 2024

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 27,652 2,549 Updated Nov 19, 2025

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 39,918 6,920 Updated Nov 20, 2025

ise-uiuc / magicoder

[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct

Python 2,059 170 Updated Nov 1, 2024

xjdr-alt / entropix

Entropy Based Sampling and Parallel CoT Decoding

Python 3,424 324 Updated Nov 13, 2024