Skip to content
View initial-h's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report initial-h

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Autonomous LLM Agent for Complex Task Solving

Python 8,484 891 Updated Aug 12, 2024

PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion

Python 59 9 Updated Feb 29, 2024

This repository summarizes recent advances in the VLA + RL paradigm and provides a taxonomic classification of relevant works.

377 4 Updated Oct 10, 2025

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-re…

Python 8,899 781 Updated Jan 1, 2026

SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning

Python 751 105 Updated Oct 27, 2025

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 845 58 Updated Jul 31, 2025

DQN with target network for spaceshooter

Python 2 Updated Sep 14, 2018

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 20,620 3,389 Updated Jan 1, 2026

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 1,942 190 Updated Dec 31, 2025

从零手搓Flow Matching(Rectified Flow)

Python 564 33 Updated Dec 10, 2025

Production-ready platform for agentic workflow development.

Python 124,355 19,338 Updated Jan 2, 2026

The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usable locally.

TypeScript 5,636 512 Updated Nov 10, 2025
Python 30 5 Updated Dec 4, 2024

(ICML'25 Outstanding) CollabLLM: From Passive Responders to Active Collaborators

Jupyter Notebook 268 28 Updated Sep 25, 2025

Self-Guided Function Calling in Large Language Models via Stepwise Experience Recall

HTML 8 Updated Sep 29, 2025

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,367 1,346 Updated Jul 9, 2025

“AI-Compass”将为社区指引在 AI 技术海洋中航行的方向,无论你是初学者还是进阶开发者,都能在这里找到通往 AI 各大方向的路径。旨在帮助开发者系统性地了解 AI 的核心概念、主流技术、前沿趋势,并通过实践掌握从理论到落地的全过程。

480 58 Updated Dec 11, 2025

[EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models

Python 487 36 Updated Oct 2, 2025

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Jupyter Notebook 2,048 182 Updated Aug 13, 2024

🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …

Python 16,106 1,294 Updated Dec 1, 2025

[AAAI 2023 Oral] Official code for "PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction".

Python 20 Updated Jul 26, 2025

⏩ Ship faster with Continuous AI. Open-source CLI that can be used in TUI mode as a coding agent or Headless mode to run background agents

TypeScript 30,613 3,976 Updated Jan 1, 2026

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 18,910 2,371 Updated Jan 1, 2026

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

Python 7,279 696 Updated Nov 19, 2025

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 5,039 459 Updated Dec 13, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,964 2,939 Updated Jan 2, 2026

Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents

Python 1,779 383 Updated Aug 13, 2025
Next