initial-h

🎯

Focusing

Hongming Zhang initial-h

🎯

Focusing

Shape the way you think.

41 followers · 33 following

www.cnblogs.com/initial-h/

Achievements

Highlights

Stars

OpenBMB / XAgent

An Autonomous LLM Agent for Complex Task Solving

Python 8,484 891 Updated Aug 12, 2024

gydpku / PPTC

PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion

Python 59 9 Updated Feb 29, 2024

OpenHelix-Team / Awesome-VLA-RL

This repository summarizes recent advances in the VLA + RL paradigm and provides a taxonomic classification of relevant works.

377 4 Updated Oct 10, 2025

xorbitsai / inference

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-re…

Python 8,899 781 Updated Jan 1, 2026

rail-berkeley / serl

SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning

Python 751 105 Updated Oct 27, 2025

BytedTsinghua-SIA / MemAgent

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 845 58 Updated Jul 31, 2025

initial-h / spaceShooter_DQN

DQN with target network for spaceshooter

Python 2 Updated Sep 14, 2018

initial-h / in-sample-deep-reinforcement-learning

Python 3 1 Updated Feb 25, 2025

LejuRobotics / kuavo_data_challenge

Python 76 9 Updated Dec 31, 2025

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 20,620 3,389 Updated Jan 1, 2026

RLinf / RLinf

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 1,942 190 Updated Dec 31, 2025

TongTong313 / rectified-flow

从零手搓Flow Matching（Rectified Flow）

Python 564 33 Updated Dec 10, 2025

langgenius / dify

Production-ready platform for agentic workflow development.

Python 124,355 19,338 Updated Jan 2, 2026

Physical-Intelligence / openpi

Python 9,614 1,294 Updated Dec 27, 2025

11cafe / jaaz

The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usable locally.

TypeScript 5,636 512 Updated Nov 10, 2025

BunsenFeng / model_swarm

Python 30 5 Updated Dec 4, 2024

Wuyxin / collabllm

(ICML'25 Outstanding) CollabLLM: From Passive Responders to Active Collaborators

Jupyter Notebook 268 28 Updated Sep 25, 2025

AI-Research-TeamX / SEER

Self-Guided Function Calling in Large Language Models via Stepwise Experience Recall

HTML 8 Updated Sep 29, 2025

HW-whistleblower / True-Story-of-Pangu

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,367 1,346 Updated Jul 9, 2025

tingaicompass / AI-Compass

“AI-Compass”将为社区指引在 AI 技术海洋中航行的方向，无论你是初学者还是进阶开发者，都能在这里找到通往 AI 各大方向的路径。旨在帮助开发者系统性地了解 AI 的核心概念、主流技术、前沿趋势，并通过实践掌握从理论到落地的全过程。

480 58 Updated Dec 11, 2025

thu-coai / CharacterGLM-6B

[EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models

Python 487 36 Updated Oct 2, 2025

LC1332 / Chat-Haruhi-Suzumiya

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Jupyter Notebook 2,048 182 Updated Aug 13, 2024

xming521 / WeClone

🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …

Python 16,106 1,294 Updated Dec 1, 2025

ChangWinde / PiCor

[AAAI 2023 Oral] Official code for "PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction".

Python 20 Updated Jul 26, 2025

continuedev / continue

⏩ Ship faster with Continuous AI. Open-source CLI that can be used in TUI mode as a coding agent or Headless mode to run background agents

TypeScript 30,613 3,976 Updated Jan 1, 2026

bytedance / deer-flow

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 18,910 2,371 Updated Jan 1, 2026

zilliztech / deep-searcher

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

Python 7,279 696 Updated Nov 19, 2025

jina-ai / node-DeepResearch

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 5,039 459 Updated Dec 13, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,964 2,939 Updated Jan 2, 2026

jennyzzt / dgm

Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents

Python 1,779 383 Updated Aug 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hongming Zhang initial-h

Achievements

Achievements

Highlights

Block or report initial-h

Stars

OpenBMB / XAgent

gydpku / PPTC

OpenHelix-Team / Awesome-VLA-RL

xorbitsai / inference

rail-berkeley / serl

BytedTsinghua-SIA / MemAgent

initial-h / spaceShooter_DQN

initial-h / in-sample-deep-reinforcement-learning

LejuRobotics / kuavo_data_challenge

huggingface / lerobot

RLinf / RLinf

TongTong313 / rectified-flow

langgenius / dify

Physical-Intelligence / openpi

11cafe / jaaz

BunsenFeng / model_swarm

Wuyxin / collabllm

AI-Research-TeamX / SEER

HW-whistleblower / True-Story-of-Pangu

tingaicompass / AI-Compass

thu-coai / CharacterGLM-6B

LC1332 / Chat-Haruhi-Suzumiya

xming521 / WeClone

ChangWinde / PiCor

continuedev / continue

bytedance / deer-flow

zilliztech / deep-searcher

jina-ai / node-DeepResearch

volcengine / verl

jennyzzt / dgm