Skip to content
View cameron-chen's full-sized avatar

Highlights

  • Pro

Block or report cameron-chen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 60,652 10,700 Updated Oct 21, 2025

Post-training with Tinker

Python 1,074 75 Updated Oct 21, 2025

A Gym for Agentic LLMs

Python 329 13 Updated Oct 13, 2025

Understanding R1-Zero-Like Training: A Critical Perspective

Python 1,126 53 Updated Aug 27, 2025

🔱 Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

68 3 Updated Mar 21, 2025

A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.

Python 247 10 Updated Apr 15, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,284 1,513 Updated Apr 24, 2025

Critique-out-Loud Reward Models

Python 70 7 Updated Oct 18, 2024

Official implementation of On Learning Informative Trajectory Embeddings for Imitation, Classification and Regression

Python 5 Updated May 29, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,751 99 Updated Mar 18, 2025

LeetCode 101:力扣刷题指南

9,774 1,243 Updated Dec 8, 2024

The HELMET Benchmark

Jupyter Notebook 178 31 Updated Aug 15, 2025

Friends of OLMo and their links.

348 30 Updated Sep 15, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 10,477 1,074 Updated Apr 30, 2025

AnchorAttention: Improved attention for LLMs long-context training

Python 213 6 Updated Jan 15, 2025

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 539 45 Updated Oct 21, 2025

Benchmarking LLMs with Challenging Tasks from Real Users

Python 243 50 Updated Nov 3, 2024

[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)

Python 131 5 Updated Jul 8, 2025

[ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)

Jupyter Notebook 84 1 Updated Oct 23, 2024

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 747 52 Updated Sep 27, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 6,900 592 Updated Jul 4, 2025

neural-cognitive-models-for-human-decision-making

Python 4 1 Updated Feb 7, 2025

[NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.

Python 130 10 Updated Mar 21, 2025

Official implementation of Bootstrapping Language Models via DPO Implicit Rewards

Python 44 3 Updated Apr 15, 2025

Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses (NeurIPS 2024)

Jupyter Notebook 65 11 Updated Jan 11, 2025

Improved techniques for optimization-based jailbreaking on large language models (ICLR2025)

Python 131 11 Updated Apr 7, 2025

This is work done by the Oxen.ai Community, trying to reproduce the Self-Rewarding Language Model paper from MetaAI.

Python 130 11 Updated Nov 16, 2024

Dromedary: towards helpful, ethical and reliable LLMs.

Python 1,143 89 Updated Sep 18, 2025

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Python 1,399 70 Updated Apr 11, 2024
Next