chawins

Chawin Sitawarin chawins

Research Scientist @ Google DeepMind (prev. @ Meta and UC Berkeley). ML security & privacy.

115 followers · 24 following

Achievements

x3 x2

Achievements

x3 x2

Highlights

Organizations

Lists (1)

Sort

Language Diffusion

2 repositories

Stars

inclusionAI / dInfer

dInfer: An Efficient Inference Framework for Diffusion Language Models

Python 287 26 Updated Nov 7, 2025

zhijie-group / d2f_vllm

The vLLM Support for D2F

Python 39 6 Updated Nov 5, 2025

yizhu-joy / DataFilter

Python 8 Updated Oct 27, 2025

google-deepmind / gemma

Gemma open-weight LLM library, from Google DeepMind

Python 3,804 575 Updated Nov 5, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 36,193 4,240 Updated Nov 5, 2025

Greysahy / ipiguard

[EMNLP 2025 Oral] IPIGuard: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents

Python 15 1 Updated Sep 16, 2025

TalEliyahu / Awesome-AI-Security

Curated resources, research, and tools for securing AI systems

172 31 Updated Nov 6, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,137 1,917 Updated Nov 1, 2025

openai / harmony

Renderer for the harmony response format to be used with gpt-oss

Rust 3,988 223 Updated Nov 5, 2025

facebookresearch / Meta_SecAlign

Repo for the paper "Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks".

Python 34 8 Updated Oct 30, 2025

google-research / camel-prompt-injection

Code for the paper "Defeating Prompt Injections by Design"

Jupyter Notebook 145 24 Updated Jun 20, 2025

ChenWu98 / agent-attack

[ICLR 2025] Dissecting adversarial robustness of multimodal language model agents

Python 113 6 Updated Feb 19, 2025

algorithmicsuperintelligence / openevolve

Open-source implementation of AlphaEvolve

Python 4,479 664 Updated Nov 1, 2025

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,188 215 Updated Nov 8, 2025

google / langfun

OO for LLMs

Python 868 67 Updated Nov 7, 2025

wagner-group / JailbreaksOverTime

Dataset and code for "JailbreaksOverTime: Detecting Jailbreak Attacks Under Distribution Shift"

Jupyter Notebook 7 Updated Apr 24, 2025

usail-hkust / JailTrickBench

Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs. Empirical tricks for LLM Jailbreaking. (NeurIPS 2024)

Python 153 12 Updated Nov 30, 2024

facebookresearch / SecAlign

Repo for the research paper "SecAlign: Defending Against Prompt Injection with Preference Optimization"

Python 74 5 Updated Jul 24, 2025

tongwu2020 / ISE

Python 10 Updated Mar 22, 2025

ethz-spylab / agentdojo

A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.

Python 343 88 Updated Oct 29, 2025

CryptoAILab / Awesome-LM-SSP

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

1,728 116 Updated Nov 2, 2025

liuxuannan / Awesome-Multimodal-Jailbreak

A Survey on Jailbreak Attacks and Defenses against Multimodal Generative Models

255 11 Updated Nov 4, 2025

facebookresearch / AugLy

A data augmentations library for audio, image, text, and video.

Python 5,057 309 Updated Oct 31, 2025

speed1313 / fast-near-duplicate-matching

Fast near-duplicate matching is a method for quickly finding near-duplicate spans in a document by utilizing the Rabin-Karp algorithm.

Rust 2 Updated Sep 22, 2024

llm-jp / memorization-analysis

Python 1 Updated Jun 7, 2024

protectai / llm-guard

The Security Toolkit for LLM Interactions

Python 2,228 301 Updated Nov 3, 2025

protectai / rebuff

LLM Prompt Injection Detector

TypeScript 1,370 118 Updated Aug 7, 2024

tldrsec / prompt-injection-defenses

Every practical and proposed defense against prompt injection.

574 38 Updated Feb 22, 2025

ethz-spylab / non-adversarial-reproduction

Official code for "Measuring Non-Adversarial Reproduction of Training Data in Large Language Models" (https://arxiv.org/abs/2411.10242)

Jupyter Notebook 8 1 Updated Nov 18, 2024

ruyimarone / data-portraits

Documenting large text datasets 🖼️ 📚

Python 14 3 Updated Dec 17, 2024