Skip to content
View zia1138's full-sized avatar

Block or report zia1138

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,792 599 Updated Nov 6, 2025

GenAI Agent Framework, the Pydantic way

Python 13,262 1,368 Updated Nov 8, 2025

Define, Prompt and Test MCP enabled Agents and Workflows

Python 3,417 360 Updated Nov 8, 2025

Daytona is a Secure and Elastic Infrastructure for Running AI-Generated Code

TypeScript 29,528 2,497 Updated Nov 8, 2025

The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)

Rust 6,662 325 Updated Nov 9, 2025

Tina: Tiny Reasoning Models via LoRA

Python 303 36 Updated Sep 23, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,341 809 Updated Oct 31, 2025

dstack is an open-source control plane for running development, training, and inference jobs on GPUs—across hyperscalers, neoclouds, or on-prem.

Python 1,952 202 Updated Nov 7, 2025

Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"

Python 269 24 Updated Oct 16, 2025

Implements LLM-Lasso

Python 36 6 Updated Jul 28, 2025

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

1,501 105 Updated Sep 26, 2025

AlphaBind code + model accompanying pre-print

Jupyter Notebook 84 11 Updated Jul 24, 2025

A library for advanced large language model reasoning

Python 2,300 202 Updated Jun 10, 2025

Recipes to scale inference-time compute of open models

Python 1,116 125 Updated May 22, 2025

Official repository for the Boltz biomolecular interaction models

Python 3,436 675 Updated Oct 3, 2025

Best practices & guides on how to write distributed pytorch training code

Python 531 53 Updated Oct 22, 2025

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 3,674 250 Updated Sep 25, 2025

A programming framework for agentic AI

Python 51,497 7,836 Updated Oct 8, 2025

Enhancing Gene Set Overrepresentation Analysis with Large Language Models

Jupyter Notebook 10 2 Updated Mar 17, 2025

Improved antibody structure-based design using inverse folding

Python 143 26 Updated Aug 21, 2025

Convenience Python APIs for antibody numbering using ANARCI

Python 107 14 Updated May 19, 2025

Package management made easy

Rust 5,616 373 Updated Nov 9, 2025

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,410 199 Updated May 7, 2025

Chat With All Kinds of AI Models Through a Common Interface

R 137 10 Updated Feb 17, 2025

A python module to repair invalid JSON from LLMs

Python 3,893 153 Updated Nov 8, 2025

Use the OpenAI Batch tool to make async batch requests to the OpenAI API.

Python 100 3 Updated Mar 2, 2024

A guidance language for controlling large language models.

Jupyter Notebook 20,896 1,122 Updated Oct 14, 2025

Sleeping Mac = Bluetooth off

Swift 2,494 69 Updated Feb 19, 2024
Jupyter Notebook 293 42 Updated Mar 18, 2024
Next