Skip to content
View manishshettym's full-sized avatar

Highlights

  • Pro

Block or report manishshettym

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open sourced execution logs, trajectories, and results from evaluation runs on GSO

Python 1 Updated Dec 25, 2025

End-to-end encrypted file transfer. A magic wormhole CLI and API in Go (golang).

Go 1,212 67 Updated Aug 5, 2025

The theory of mind module for the SWE agent

Python 67 9 Updated Jan 13, 2026

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

Python 830 209 Updated Jan 20, 2026

AI-Driven Research Systems (ADRS)

Jupyter Notebook 117 16 Updated Dec 17, 2025

Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.

Python 799 65 Updated Dec 26, 2025

Extend your AtCoder

TypeScript 1,561 153 Updated Jul 27, 2024
Lean 288 19 Updated Sep 11, 2025

Our library for RL environments + evals

Python 3,753 472 Updated Jan 20, 2026

The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!

Python 2,561 335 Updated Jan 20, 2026

Chat client for https://twitch.tv

C++ 464 87 Updated Jan 19, 2026

Nano vLLM

Python 10,888 1,411 Updated Nov 3, 2025

[NeurIPS '25] GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents

Python 62 3 Updated Jan 14, 2026

kernels, of the mega variety

Python 650 40 Updated Sep 28, 2025

A License Classifier

Go 343 78 Updated Oct 14, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,208 1,846 Updated Jan 9, 2026

[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents

Python 226 43 Updated Jul 13, 2025

OpenAI Frontier Evals

Python 984 115 Updated Dec 6, 2025

[NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"

Python 665 57 Updated Mar 16, 2025

MLGym A New Framework and Benchmark for Advancing AI Research Agents

Python 583 57 Updated Aug 10, 2025

A high-performance algorithmic trading platform and event-driven backtester

Rust 18,123 2,128 Updated Jan 20, 2026

📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools lik…

TypeScript 21,341 993 Updated Jan 18, 2026

This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"

1,437 140 Updated Jul 18, 2025

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 4,136 738 Updated Jan 4, 2026

[FSE-2024] Towards AI-Assisted Synthesis of Verified Dafny Methods

Dafny 54 1 Updated Jun 9, 2024

Minimal reproduction of DeepSeek R1-Zero

Python 12,609 1,543 Updated Apr 24, 2025

A new version of Soot with a completely overhauled architecture

Java 765 111 Updated Jan 19, 2026

A GitHub :octocat: app to automatically review Python code style over Pull Requests

Python 617 87 Updated Dec 3, 2025
Next