Whether you're compiling kernels, training models, or just waiting for sleep 600 to finish, JobDone makes sure you never miss the moment your job ends—successfully, tragically, or somewhere in betw…

Python 15 Updated Oct 26, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 40,028 5,129 Updated Jan 8, 2026

microsoft / AI-For-Beginners

12 Weeks, 24 Lessons, AI for All!

Jupyter Notebook 44,830 9,042 Updated Jan 5, 2026

S3IC-Lab / RobustJudge

An easy-to-use Python framework for testing the robustness of models as evaluators

Python 4 1 Updated Sep 21, 2025

zjunlp / steer-target-atoms

[ACL 2025] Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms

Python 35 2 Updated Jun 4, 2025

nelvko / clash-for-linux-install

😼 优雅地使用基于 clash/mihomo 的代理环境

Shell 7,781 950 Updated Jan 9, 2026

ruizheliUOA / Awesome-Interpretability-in-Large-Language-Models

This repository collects all relevant resources about interpretability in LLMs

389 26 Updated Nov 1, 2024

EasyJailbreak / EasyJailbreak

An easy-to-use Python framework to generate adversarial jailbreak prompts.

Python 803 77 Updated Mar 27, 2025

aaronmueller / MIB

Landing page for MIB: A Mechanistic Interpretability Benchmark

Python 23 3 Updated Aug 15, 2025

karminski / one-small-step

这是一个简单的技术科普教程项目，主要聚焦于解释一些有趣的，前沿的技术概念和原理。每篇文章都力求在 5 分钟内阅读完成。

6,458 586 Updated Nov 10, 2025

dajale423 / refusal_direction

Forked from andyrdt/refusal_direction

Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".

Python 2 Updated Jan 14, 2025

boson-ai / higgs-audio

Text-audio foundation model from Boson AI

Python 7,825 590 Updated Sep 15, 2025

saprmarks / dictionary_learning

Python 382 91 Updated Aug 21, 2025

OpenMOSS / Language-Model-SAEs

Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.

Python 170 22 Updated Jan 9, 2026

jacobdunefsky / transcoder_circuits

Jupyter Notebook 193 32 Updated Nov 17, 2024

NISPLab / JBShield

Code for USENIX Security 2025 paper "JBShield: Defending Large Language Models from Jailbreak Attacks through Activated Concept Analysis and Manipulation"

Python 215 29 Updated May 10, 2025

zepingyu0512 / neuron-attribution

code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models

Jupyter Notebook 48 8 Updated Nov 17, 2024

lucyknada / baukit-modified

Python 3 Updated Jul 24, 2024

Chuokun Xu gebilxs

Organizations

Lists (5)

Agent

Learn Md

PhotoChange

Reasoning model

VLM

Starred repositories

Chrome

Bitcoin