Skip to content
View akzaidi's full-sized avatar

Highlights

  • Pro

Organizations

@Azure

Block or report akzaidi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 977 48 Updated Oct 13, 2025

Scalable toolkit for efficient model reinforcement

Python 1,022 166 Updated Nov 13, 2025

Standard Open Arm 100

4,454 373 Updated Oct 15, 2025

Official repository of the NeurIPS 2025 Competition: The PokeAgent Challenge: Competitive and Long-Context Learning at Scale. (Track 2, Speedrunning)

Python 62 27 Updated Nov 6, 2025

Beads - A memory upgrade for your coding agent

Go 2,726 173 Updated Nov 12, 2025

[ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking

Python 53 Updated Nov 19, 2024

Code for the paper "Learning a Diffusion Model Policy from Rewards via Q-Score Matching"

Python 29 4 Updated Apr 15, 2025

FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones

Python 34 3 Updated Nov 7, 2025

PyTorch-native post-training at scale

Python 520 54 Updated Nov 13, 2025

Interactive Markov-chain Monte Carlo Javascript demos

JavaScript 893 124 Updated Jun 4, 2024

An interface library for RL post training with environments.

Python 687 97 Updated Nov 13, 2025

A Multi-Task Dataset for Simulated Humanoid Control

Python 198 22 Updated Mar 27, 2025

๐Ÿ”ฅ Real-time NVIDIA GPU dashboard

JavaScript 862 44 Updated Nov 2, 2025

Post-training with Tinker

Python 1,872 149 Updated Nov 12, 2025

Code for paper "The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning"

Python 310 25 Updated Nov 13, 2025

Suite of motion imitation methods for training controllers.

Python 920 91 Updated Nov 9, 2025

๐Ÿ‘€ A modern watch command. Time machine and pager etc.

Rust 5,187 97 Updated Aug 29, 2025

ArcticInference: vLLM plugin for high-throughput, low-latency inference

Python 299 38 Updated Nov 11, 2025

The best ChatGPT that $100 can buy.

Python 36,570 4,394 Updated Nov 5, 2025

A local-first LaTeX & Typst web editor with real-time collaboration & offline support

TypeScript 478 21 Updated Nov 11, 2025

Catch MCP server issues before your agents do.

Python 128 15 Updated Oct 27, 2025

Platform for evaluating reinforcement learning (RL) algorithms on a physical Atari system.

Python 128 2 Updated Aug 28, 2025

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,029 230 Updated Nov 11, 2025

Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.

Python 2,181 258 Updated Nov 13, 2025

A framework for building, orchestrating and deploying AI agents and multi-agent workflows with support for Python and .NET.

Python 5,005 721 Updated Nov 13, 2025

JAX implementation of WSRL and RL baselines | ICLR 2025

Python 116 13 Updated Jul 11, 2025

A lightweight, local-first, and ๐Ÿ†“ experiment tracking library from Hugging Face ๐Ÿค—

Python 1,078 66 Updated Nov 7, 2025

Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion models are significantly more data-efficient than standard leftโ€ฆ

Python 105 2 Updated Oct 27, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,817 602 Updated Nov 12, 2025

LEAKED SYSTEM PROMPTS FOR CHATGPT, GEMINI, GROK, CLAUDE, PERPLEXITY, CURSOR, DEVIN, REPLIT, AND MORE! - AI SYSTEMS TRANSPARENCY FOR ALL! ๐Ÿ‘

11,807 2,379 Updated Nov 6, 2025
Next