acbull

🦉

goo-goo-goo　

Ziniu Hu acbull

🦉

goo-goo-goo　

https://acbull.github.io/

266 followers · 19 following

Achievements

x3 x3

Achievements

x3 x3

Highlights

Stars

THUDM / TreeRL

TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25

Python 82 6 Updated Jun 16, 2025

Rafa-zy / QLASS

Python 52 3 Updated Aug 24, 2025

deepseek-ai / DualPipe

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,878 305 Updated Mar 10, 2025

ziminz19 / AutoMolCo

[COLING 2025] Automated Molecular Concept Generation and Labeling with Large Language Models

Python 3 Updated Dec 29, 2024

ZongyueQin / MTAD

Source code of Multi-Token Assisted Decoding

Python 7 1 Updated Apr 11, 2025

codespace-optimization / sfs

Official codebase for the Scattered Forest Search: Smarter Code Space Exploration and Inference Scaling with LLMs

Jupyter Notebook 9 1 Updated Feb 20, 2025

THUDM / DataSciBench

DataSciBench: An LLM Agent Benchmark for Data Science

Python 40 3 Updated Sep 1, 2025

THUDM / T1

RL Scaling and Test-Time Scaling (ICML'25)

112 1 Updated Jan 23, 2025

llm-strategist / llm-strategist.github.io

The website of paper "Strategist: Learning Strategic Skills by LLMs via Bi-Level Tree Search"

JavaScript 3 1 Updated Apr 10, 2025

ggflow123 / DDRL

Repository for Data Distillation for Offline Reinforcement Learning

Python 8 Updated Aug 2, 2024

wzsmith / cs145-pst

Sci-BeRT model for paper reference source tracing. Submission for 2024 PST-KDD Cup.

Jupyter Notebook 3 1 Updated Jun 15, 2024

yecchen / MIRAI

Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"

Python 81 15 Updated Jul 2, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

35,579 1,937 Updated Aug 1, 2024

the-catalyst / KDD_AQA

Course project for CS 145 - KDD 2024 AQA Challenge

Python 2 1 Updated Jun 13, 2024

rizvi-ha / team2_gcn

Python 1 Updated Jun 12, 2024

THUDM / ReST-MCTS

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 678 50 Updated Jan 20, 2025

HenryCai11 / LLM-Self-Control

The official repo of paper "Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller"

Jupyter Notebook 18 2 Updated Aug 13, 2024

yihedeng9 / STIC

Enhancing Large Vision Language Models with Self-Training on Image Comprehension.

Python 70 4 Updated May 31, 2024

embedded-robotics / path-rag

Path-RAG: Knowledge-Guided Key Region Retrieval for Open-ended Pathology Visual Question Answering

Jupyter Notebook 53 9 Updated Nov 13, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,097 3,483 Updated Jan 26, 2025

camel-ai / agent-trust

🤝 The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"

Python 102 15 Updated Apr 6, 2025

ZongyueQin / HLSyn

HLSyn benchmark for paper "Towards a Comprehensive Benchmark for FPGA Targeted High-Level Synthesis"

Python 3 2 Updated Oct 26, 2023

yjhuangcd / rule-guided-music

Official code for Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion (ICML 2024, Oral).

Python 84 9 Updated Aug 12, 2024

uclaml / SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,218 102 Updated May 8, 2024

eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python 2,782 231 Updated Aug 11, 2024

THUDM / SciGLM

SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)

Python 83 10 Updated Feb 25, 2024

UCLA-DM / HLSyn

Forked from ZongyueQin/HLSyn

HLSyn benchmark for paper "Towards a Comprehensive Benchmark for FPGA Targeted High-Level Synthesis"

Python 29 1 Updated Dec 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ziniu Hu acbull

Achievements

Achievements

Highlights

Block or report acbull

Stars

THUDM / TreeRL

Rafa-zy / QLASS

deepseek-ai / DualPipe

ziminz19 / AutoMolCo

ZongyueQin / MTAD

codespace-optimization / sfs

THUDM / DataSciBench

THUDM / T1

llm-strategist / llm-strategist.github.io

ggflow123 / DDRL

wzsmith / cs145-pst

yecchen / MIRAI

karpathy / LLM101n

the-catalyst / KDD_AQA

rizvi-ha / team2_gcn

THUDM / ReST-MCTS

HenryCai11 / LLM-Self-Control

yihedeng9 / STIC

embedded-robotics / path-rag

meta-llama / llama3

camel-ai / agent-trust

ZongyueQin / HLSyn

yjhuangcd / rule-guided-music

uclaml / SPIN

eric-mitchell / direct-preference-optimization

THUDM / SciGLM

UCLA-DM / HLSyn

THUDM / AgentBench

jonathanmli / Avalon-LLM

Graph-and-Geometric-Learning / MolGroup