Skip to content
View gabrielhuang's full-sized avatar

Highlights

  • Pro

Block or report gabrielhuang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The project code for the Overhearing Agents project (AI Agents Workshop @ COLM 2025).

Python 3 Updated May 18, 2025

Improved techniques for optimization-based jailbreaking on large language models (ICLR2025)

Python 131 12 Updated Apr 7, 2025

Code for the paper "Defeating Prompt Injections by Design"

Jupyter Notebook 133 22 Updated Jun 20, 2025

Flow Integrity Deterministic Enforcement System. Mechanisms for securing AI agents with information-flow control.

Jupyter Notebook 57 6 Updated May 30, 2025

Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.

990 86 Updated Oct 22, 2025

A fast + lightweight implementation of the GCG algorithm in PyTorch

Python 292 60 Updated May 13, 2025

Universal and Transferable Attacks on Aligned Language Models

Python 4,274 572 Updated Aug 2, 2024

An unofficial implementation of AutoDAN attack on LLMs (arXiv:2310.15140)

Python 44 9 Updated Feb 8, 2024

πŸš€ The fast, Pythonic way to build MCP servers and clients

Python 19,443 1,418 Updated Oct 23, 2025

mcptee - tool for MCP developers to log stdin transport

Go 8 Updated Apr 12, 2025

DoomArena is a Framework for Testing AI Agents Against Evolving Security Threats

Python 49 5 Updated Sep 12, 2025

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

1,486 102 Updated Sep 26, 2025

Autonomous Agents (LLMs) research papers. Updated Daily.

1,039 74 Updated Oct 22, 2025

SafeArena is a benchmark for assessing the harmful capabilities of web agents

Python 19 3 Updated Apr 23, 2025

🌎πŸ’ͺ BrowserGym, a Gym environment for web task automation

Python 941 131 Updated Oct 22, 2025

Two conversational AI agents switching from English to sound-level protocol after confirming they are both AI agents

TypeScript 4,705 385 Updated Jul 28, 2025

AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.

Python 429 90 Updated Oct 22, 2025

AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM

Python 74 8 Updated Nov 3, 2024

Papers and resources related to the security and privacy of LLMs πŸ€–

Python 536 43 Updated Jun 8, 2025

Python tool for converting files and office documents to Markdown.

Python 82,008 4,589 Updated Oct 20, 2025

TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle

Python 298 36 Updated Oct 15, 2025

Interactive Tables and Data Grids for JavaScript

JavaScript 7,364 867 Updated Aug 27, 2025

πŸ’©πŸš€ Windows 95 in Electron. Runs on macOS, Linux, and Windows.

TypeScript 22,941 1,314 Updated Sep 4, 2025

A curated list of trustworthy deep learning papers. Daily updating...

375 39 Updated Aug 20, 2025

πŸ“‚ Web File Browser

Go 31,653 3,540 Updated Oct 22, 2025

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,203 183 Updated Mar 27, 2024

Locally run an Instruction-Tuned Chat-Style LLM

C 10,204 880 Updated Apr 19, 2023

Aligning pretrained language models with instruction data generated by themselves.

Python 4,504 521 Updated Mar 27, 2023

πŸ¦œπŸ”— Build context-aware reasoning applications

Python 117,825 19,400 Updated Oct 22, 2025

A framework for the evaluation of autoregressive code generation language models.

Python 986 251 Updated Jul 22, 2025
Next