Skip to content
View acbull's full-sized avatar
🦉
goo-goo-goo 
🦉
goo-goo-goo 

Highlights

  • Pro

Block or report acbull

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25

Python 82 6 Updated Jun 16, 2025
Python 52 3 Updated Aug 24, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,878 305 Updated Mar 10, 2025

[COLING 2025] Automated Molecular Concept Generation and Labeling with Large Language Models

Python 3 Updated Dec 29, 2024

Source code of Multi-Token Assisted Decoding

Python 7 1 Updated Apr 11, 2025

Official codebase for the Scattered Forest Search: Smarter Code Space Exploration and Inference Scaling with LLMs

Jupyter Notebook 9 1 Updated Feb 20, 2025

DataSciBench: An LLM Agent Benchmark for Data Science

Python 40 3 Updated Sep 1, 2025

RL Scaling and Test-Time Scaling (ICML'25)

112 1 Updated Jan 23, 2025

The website of paper "Strategist: Learning Strategic Skills by LLMs via Bi-Level Tree Search"

JavaScript 3 1 Updated Apr 10, 2025

Repository for Data Distillation for Offline Reinforcement Learning

Python 8 Updated Aug 2, 2024

Sci-BeRT model for paper reference source tracing. Submission for 2024 PST-KDD Cup.

Jupyter Notebook 3 1 Updated Jun 15, 2024

Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"

Python 81 15 Updated Jul 2, 2024

LLM101n: Let's build a Storyteller

35,579 1,937 Updated Aug 1, 2024

Course project for CS 145 - KDD 2024 AQA Challenge

Python 2 1 Updated Jun 13, 2024
Python 1 Updated Jun 12, 2024

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 678 50 Updated Jan 20, 2025

The official repo of paper "Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller"

Jupyter Notebook 18 2 Updated Aug 13, 2024

Enhancing Large Vision Language Models with Self-Training on Image Comprehension.

Python 70 4 Updated May 31, 2024

Path-RAG: Knowledge-Guided Key Region Retrieval for Open-ended Pathology Visual Question Answering

Jupyter Notebook 53 9 Updated Nov 13, 2024

The official Meta Llama 3 GitHub site

Python 29,097 3,483 Updated Jan 26, 2025

🤝 The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"

Python 102 15 Updated Apr 6, 2025

HLSyn benchmark for paper "Towards a Comprehensive Benchmark for FPGA Targeted High-Level Synthesis"

Python 3 2 Updated Oct 26, 2023

Official code for Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion (ICML 2024, Oral).

Python 84 9 Updated Aug 12, 2024

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,218 102 Updated May 8, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,782 231 Updated Aug 11, 2024

SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)

Python 83 10 Updated Feb 25, 2024

HLSyn benchmark for paper "Towards a Comprehensive Benchmark for FPGA Targeted High-Level Synthesis"

Python 29 1 Updated Dec 13, 2023

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 2,941 209 Updated Nov 17, 2025

This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'

Python 128 13 Updated May 30, 2025

Learning to Group Auxiliary Datasets for Molecule, NeurIPS2023

Python 18 Updated Dec 19, 2023
Next