Skip to content
View ducheng678's full-sized avatar

Block or report ducheng678

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

slime is an LLM post-training framework for RL Scaling.

Python 2,615 288 Updated Nov 28, 2025

The official repository of The Road Less Traveled: Enhancing Exploration in LLMs via Sequential Sampling.

Python 1 Updated Oct 17, 2025

[ICLR 2025 OralπŸ”₯] SD-LoRA: Scalable Decoupled Low-Rank Adaptation for Class Incremental Learning

Python 68 11 Updated Jun 27, 2025

Production-ready platform for agentic workflow development.

TypeScript 120,063 18,640 Updated Nov 28, 2025

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 17,874 1,896 Updated Nov 24, 2025

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 24,466 2,054 Updated Jul 29, 2025

[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models

Jupyter Notebook 3,233 328 Updated Feb 6, 2024

Train transformer language models with reinforcement learning.

Python 16,456 2,322 Updated Nov 28, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

68,322 7,754 Updated Jun 4, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,562 714 Updated Nov 28, 2025

An AI Hedge Fund Team

Python 42,453 7,532 Updated Nov 13, 2025

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 59,722 7,316 Updated Oct 4, 2025

Build resilient language agents as graphs.

Python 21,533 3,798 Updated Nov 28, 2025

πŸ€– Assemble, configure, and deploy autonomous AI Agents in your browser.

TypeScript 35,292 9,487 Updated Apr 29, 2025
Jupyter Notebook 241 75 Updated Nov 24, 2025

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 73,056 8,710 Updated Nov 28, 2025

A live stream development of RL tunning for LLM agents

Python 3,632 506 Updated Oct 8, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 51,038 8,905 Updated Nov 17, 2025

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 180,003 46,183 Updated Nov 28, 2025

Fine-tuning & Reinforcement Learning for LLMs. πŸ¦₯ Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,781 4,015 Updated Nov 28, 2025

Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.

Jupyter Notebook 671 61 Updated Mar 22, 2025

Model Context Protocol Servers

TypeScript 73,500 8,894 Updated Nov 27, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 64,175 11,616 Updated Nov 28, 2025

A lightweight, powerful framework for multi-agent workflows

Python 17,568 2,918 Updated Nov 27, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 63,245 7,649 Updated Nov 27, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,851 4,651 Updated Nov 26, 2025

My learning notes/codes for ML SYS.

Python 4,294 259 Updated Nov 25, 2025

minimal-cost for training 0.5B R1-Zero

Python 787 102 Updated May 14, 2025
Next