manishshettym

Manish Shetty manishshettym

PhD student at UC, Berkeley | AI4Code

118 followers · 3 following

UC Berkeley
Berkeley, CA
manishs.org
@slimshetty_

Achievements

x2 x3

Achievements

x2 x3

Highlights

Lists (3)

Sort

🔮 Future ideas

✨ Inspiration

🚀 My stack

Stars

gso-bench / gso-experiments

Open sourced execution logs, trajectories, and results from evaluation runs on GSO

Python 1 Updated Dec 25, 2025

psanford / wormhole-william

End-to-end encrypted file transfer. A magic wormhole CLI and API in Go (golang).

Go 1,212 67 Updated Aug 5, 2025

OpenHands / ToM-SWE

The theory of mind module for the SWE agent

Python 67 9 Updated Jan 13, 2026

NousResearch / atropos

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

Python 830 209 Updated Jan 20, 2026

UCB-ADRS / ADRS

AI-Driven Research Systems (ADRS)

Jupyter Notebook 117 16 Updated Dec 17, 2025

facebookresearch / cwm

Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.

Python 799 65 Updated Dec 26, 2025

kenkoooo / AtCoderProblems

Extend your AtCoder

TypeScript 1,561 153 Updated Jul 27, 2024

math-inc / strongpnt

Lean 288 19 Updated Sep 11, 2025

thinking-machines-lab / batch_invariant_ops

Python 950 71 Updated Nov 4, 2025

PrimeIntellect-ai / verifiers

Our library for RL environments + evals

Python 3,753 472 Updated Jan 20, 2026

SWE-agent / mini-swe-agent

The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!

Python 2,561 335 Updated Jan 20, 2026

SevenTV / chatterino7

Forked from Chatterino/chatterino2

Chat client for https://twitch.tv

C++ 464 87 Updated Jan 19, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 10,888 1,411 Updated Nov 3, 2025

gso-bench / gso

[NeurIPS '25] GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents

Python 62 3 Updated Jan 14, 2026

HazyResearch / Megakernels

kernels, of the mega variety

Python 650 40 Updated Sep 28, 2025

google / licenseclassifier

A License Classifier

Go 343 78 Updated Oct 14, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,208 1,846 Updated Jan 9, 2026

R2E-Gym / R2E-Gym

[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents

Python 226 43 Updated Jul 13, 2025

openai / frontier-evals

OpenAI Frontier Evals

Python 984 115 Updated Dec 6, 2025

facebookresearch / swe-rl

[NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"

Python 665 57 Updated Mar 16, 2025

facebookresearch / MLGym

MLGym A New Framework and Benchmark for Advancing AI Research Agents

Python 583 57 Updated Aug 10, 2025

nautechsystems / nautilus_trader

A high-performance algorithmic trading platform and event-driven backtester

Rust 18,123 2,128 Updated Jan 20, 2026

yamadashy / repomix

📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools lik…

TypeScript 21,341 993 Updated Jan 18, 2026