monica0325

Follow

☺️

MonicaHuang monica0325

☺️

Follow

2 followers · 2 following

Stars

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 21,018 1,856 Updated Oct 25, 2025

stepfun-ai / Step-Audio2

Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.

Python 1,240 91 Updated Sep 22, 2025

aakaran / reasoning-with-sampling

Python 333 43 Updated Nov 7, 2025

zhenyu-02 / LogitLens4LLMs

A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enabling layer-wise analysis of hidden states and predictions.

Jupyter Notebook 132 11 Updated Aug 14, 2025

TransformerLensOrg / TransformerLens

A library for mechanistic interpretability of GPT-style language models

Python 2,811 474 Updated Nov 27, 2025

zai-org / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 3,088 266 Updated Dec 5, 2024

stanford-crfm / helm

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…

Python 2,556 344 Updated Nov 23, 2025

IAAR-Shanghai / SurveyX

Academic Survey Paper Generation.

TeX 936 88 Updated Jun 22, 2025

emo-box / EmoBox

[INTERSPEECH 2024] EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark

Python 295 15 Updated Mar 31, 2025

MattYoon / reasoning-models-confidence

[NeurIPS 2025] Reasoning Models Better Express Their Confidence"

Python 21 Updated Nov 19, 2025

saccharomycetes / mllms_know

[ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'

Python 298 15 Updated Apr 20, 2025

langchain-ai / langchain

🦜🔗 The platform for reliable agents.

Python 120,752 19,912 Updated Nov 28, 2025

tim-learn / SHOT

code released for our ICML 2020 paper "Do We Really Need to Access the Source Data? Source Hypothesis Transfer for Unsupervised Domain Adaptation"

Python 473 83 Updated Feb 22, 2024

huggingface / search-and-learn

Recipes to scale inference-time compute of open models

Python 1,118 128 Updated May 22, 2025

tim-learn / awesome-test-time-adaptation

Collection of awesome test-time (domain/batch/instance) adaptation methods

1,135 72 Updated Nov 14, 2025

madaan / self-refine

LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.

Python 758 64 Updated Oct 4, 2024

cmu-l3 / l1

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning

Jupyter Notebook 257 29 Updated May 14, 2025

hendrycks / math

The MATH Dataset (NeurIPS 2021)

Python 1,260 110 Updated Sep 6, 2025

google-deepmind / mathematics_dataset

This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.

Python 1,920 262 Updated Dec 23, 2024

google-deepmind / AQuA

A algebraic word problem dataset, with multiple choice questions annotated with rationales.

328 49 Updated Nov 2, 2017

openai / simple-evals

Python 4,195 454 Updated Jul 31, 2025

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 10,783 2,880 Updated Nov 27, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 63,276 7,651 Updated Nov 30, 2025

camel-ai / camel

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 14,926 1,643 Updated Nov 30, 2025

Libr-AI / do-not-answer

Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMs

Jupyter Notebook 297 28 Updated Jun 7, 2024

nlpdata / c3

Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension

Python 168 23 Updated Apr 20, 2022

DRCKnowledgeTeam / DRCD

A 30000+ Chinese MRC dataset - Delta Reading Comprehension Dataset

313 51 Updated Apr 21, 2020

mlfoundations / task_vectors

Editing Models with Task Arithmetic

Python 515 47 Updated Jan 11, 2024

allenai / natural-instructions

Expanding natural instructions

Python 1,023 197 Updated Dec 11, 2023

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 73,890 2,272 Updated Nov 29, 2025