Skip to content
View andyl98's full-sized avatar
🫥
🫥
  • Roblox
  • United States
  • 23:34 (UTC -08:00)
  • LinkedIn in/andyl98

Block or report andyl98

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A PyTorch native platform for training generative AI models

Python 4,951 662 Updated Jan 11, 2026

Post-training with Tinker

Python 2,712 290 Updated Jan 8, 2026

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 18,224 1,954 Updated Dec 29, 2025

[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents

Python 510 90 Updated Jan 9, 2026

slime is an LLM post-training framework for RL Scaling.

Python 3,286 410 Updated Jan 12, 2026

Democratizing Reinforcement Learning for LLMs

Python 4,965 477 Updated Jan 10, 2026

Toolchain manager for Roblox projects

Rust 227 32 Updated Nov 13, 2025

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 22,278 4,021 Updated Jan 12, 2026

The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.

Python 23,641 5,148 Updated Jan 12, 2026

Our library for RL environments + evals

Python 3,730 470 Updated Jan 11, 2026

Lightweight coding agent that runs in your terminal

Rust 55,905 7,188 Updated Jan 12, 2026

Renderer for the harmony response format to be used with gpt-oss

Rust 4,131 244 Updated Dec 15, 2025

Copilot Chat extension for VS Code

TypeScript 9,259 1,560 Updated Jan 12, 2026

Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.

Rust 73,071 6,554 Updated Jan 12, 2026

✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork

Python 301 18 Updated Sep 6, 2025

This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"

1,437 140 Updated Jul 18, 2025

Model Context Protocol Servers

TypeScript 76,000 9,206 Updated Jan 11, 2026

Train transformer language models with reinforcement learning.

Python 6 2 Updated Jul 28, 2025

Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper

Python 791 50 Updated Aug 15, 2025

TransMLA: Multi-Head Latent Attention Is All You Need (NeurIPS 2025 Spotlight)

Python 423 25 Updated Sep 23, 2025

s1: Simple test-time scaling

Python 6,625 765 Updated Jun 25, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,242 3,004 Updated Jan 12, 2026

Fully open reproduction of DeepSeek-R1

Python 25,808 2,409 Updated Nov 24, 2025

⏩ Ship faster with Continuous AI. Open-source CLI that can be used in TUI mode as a coding agent or Headless mode to run background agents

TypeScript 30,830 4,024 Updated Jan 12, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 8,768 848 Updated Jan 8, 2026

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Python 6,472 790 Updated Jan 9, 2026

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 17,538 2,863 Updated Nov 3, 2025

The Open Cookbook for Top-Tier Code Large Language Model

Python 1,985 115 Updated Dec 8, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,206 4,680 Updated Jan 12, 2026
Next