Skip to content
View ddy-ddy's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Bytedance
  • Beijing

Block or report ddy-ddy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

HuggingFace conversion and training library for Megatron-based models

Python 226 70 Updated Nov 28, 2025

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python 162 34 Updated Nov 27, 2025

AgentScope: Agent-Oriented Programming for Building LLM Applications

Python 14,105 1,162 Updated Nov 27, 2025

The best ChatGPT that $100 can buy.

Python 37,676 4,621 Updated Nov 17, 2025

Fully open data curation for reasoning models

Python 2,149 180 Updated Sep 3, 2025

Build AI Agents, Visually

TypeScript 46,948 23,196 Updated Nov 27, 2025
Python 23 5 Updated Sep 22, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,505 59 Updated Jun 14, 2025
Python 1,116 97 Updated Oct 22, 2025

Memory Augmented Neural Networks (Pytorch)

Python 14 Updated Sep 2, 2018

Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates. - Professor Yu Liu

Jupyter Notebook 1,652 99 Updated Nov 24, 2025

Concatenate a directory full of files into a single prompt for use with LLMs

Python 2,520 151 Updated Feb 19, 2025

📄 Configuration files that enhance Cursor AI editor experience with custom rules and behaviors

MDX 35,711 3,044 Updated Oct 24, 2025

Data validation using Python type hints

Python 25,951 2,338 Updated Nov 27, 2025

🧠 Curated collection of system prompts for top AI tools. Perfect for AI agent builders and prompt engineers. Incuding: ChatGPT, Claude, Perplexity, Manus, Claude-Code, Loveable, v0, Grok, same new,…

TypeScript 4,455 699 Updated Aug 26, 2025

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Python 68,470 7,390 Updated Nov 28, 2025

The absolute trainer to light up AI agents.

Python 8,984 716 Updated Nov 28, 2025

GPT powered sorting using structured output

Python 493 23 Updated Jul 12, 2025

A course on aligning smol models.

Jupyter Notebook 6,525 2,297 Updated Nov 10, 2025

[TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Python 142 1 Updated Oct 10, 2025

🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )

Python 1,913 207 Updated Nov 13, 2025

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 13,121 2,162 Updated Sep 6, 2025

An AI agent system for solving International Mathematical Olympiad (IMO) problems using Google's Gemini, OpenAI, and XAI APIs.

Python 868 119 Updated Oct 1, 2025

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

Python 20,916 3,022 Updated Nov 27, 2025

[ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"

Python 14 Updated May 24, 2025

Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.

TypeScript 9,905 444 Updated Nov 10, 2025

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 799 58 Updated Jul 31, 2025

The official code of ARPO & AEPO

Python 807 36 Updated Nov 15, 2025

Train transformer language models with reinforcement learning.

Python 16,452 2,321 Updated Nov 27, 2025

A curated list of reinforcement learning with human feedback resources (continually updated)

4,216 249 Updated Sep 19, 2025
Next