zia1138

Zia Khan zia1138

San Francisco, CA
19:58 (UTC -08:00)
https://scholar.google.com/citations?user=gJkhFkgAAAAJ&hl=en

Stars

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,792 599 Updated Nov 6, 2025

pydantic / pydantic-ai

GenAI Agent Framework, the Pydantic way

Python 13,262 1,368 Updated Nov 8, 2025

evalstate / fast-agent

Define, Prompt and Test MCP enabled Agents and Workflows

Python 3,417 360 Updated Nov 8, 2025

daytonaio / daytona

Daytona is a Secure and Elastic Infrastructure for Running AI-Generated Code

TypeScript 29,528 2,497 Updated Nov 8, 2025

BoundaryML / baml

The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)

Rust 6,662 325 Updated Nov 9, 2025

shangshang-wang / Tina

Tina: Tiny Reasoning Models via LoRA

Python 303 36 Updated Sep 23, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,341 809 Updated Oct 31, 2025

dstackai / dstack

dstack is an open-source control plane for running development, training, and inference jobs on GPUs—across hyperscalers, neoclouds, or on-prem.

Python 1,952 202 Updated Nov 7, 2025

knoveleng / open-rs

Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"

Python 269 24 Updated Oct 16, 2025

pilancilab / LLM-Lasso

Implements LLM-Lasso

Python 36 6 Updated Jul 28, 2025

trycua / acu

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

1,501 105 Updated Sep 26, 2025

philschmid / deep-learning-pytorch-huggingface

Jupyter Notebook 1,308 258 Updated Feb 27, 2025

A-Alpha-Bio / alphabind

AlphaBind code + model accompanying pre-print

Jupyter Notebook 84 11 Updated Jul 24, 2025

maitrix-org / llm-reasoners

A library for advanced large language model reasoning

Python 2,300 202 Updated Jun 10, 2025

huggingface / search-and-learn

Recipes to scale inference-time compute of open models

Python 1,116 125 Updated May 22, 2025

jwohlwend / boltz

Official repository for the Boltz biomolecular interaction models

Python 3,436 675 Updated Oct 3, 2025

LambdaLabsML / distributed-training-guide

Best practices & guides on how to write distributed pytorch training code

Python 531 53 Updated Oct 22, 2025

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 3,674 250 Updated Sep 25, 2025