Skip to content
View nickvdw's full-sized avatar
πŸ˜–
πŸ˜–

Block or report nickvdw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Demystifying Reinforcement Learning in Agentic Reasoning

Python 123 21 Updated Oct 14, 2025

An interface library for RL post training with environments.

Python 753 116 Updated Nov 25, 2025

Training Model Behavior in Agentic Systems

Python 662 45 Updated Nov 27, 2025

The absolute trainer to light up AI agents.

Python 8,974 717 Updated Nov 28, 2025

Awesome List for Agentic RL

HTML 559 18 Updated Nov 27, 2025

PyTorch-native post-training at scale

Python 549 67 Updated Nov 27, 2025

🧠 Make your agents learn from experience. Based on the Agentic Context Engineering (ACE) framework.

Python 982 120 Updated Nov 27, 2025

Optimize prompts, code, and more with AI-powered Reflective Text Evolution

Jupyter Notebook 1,653 126 Updated Nov 16, 2025

Curated collection of papers in MoE model inference

308 11 Updated Oct 20, 2025

Super Productivity is an advanced todo list app with integrated Timeboxing and time tracking capabilities. It also comes with integrations for Jira, GitLab, GitHub and Open Project.

TypeScript 16,152 1,317 Updated Nov 27, 2025

A project to improve skills of large language models

Python 626 115 Updated Nov 27, 2025

A simple yet powerful agent framework that delivers with open-source models

Python 3,879 378 Updated Nov 27, 2025

πŸ‹οΈ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.

Python 320 61 Updated Sep 25, 2025

OpenAI Guardrails - Python

Python 144 18 Updated Nov 25, 2025

dParallel: Learnable Parallel Decoding for dLLMs

Python 42 1 Updated Oct 14, 2025

πŸ“œ Paper list on decoding methods for LLMs and LVLMs

65 1 Updated Nov 7, 2025

Super-fast Structured Outputs

Rust 619 41 Updated Nov 24, 2025

A framework for optimizing DSPy programs with RL

Python 285 22 Updated Nov 18, 2025

Universal CPU profiler designed for humans and AI agents

TypeScript 382 10 Updated Sep 13, 2025

Test your prompts, models, RAGs. Evaluate and compare LLM outputs, catch regressions, and improve prompt quality. LLM evals for OpenAI/Azure GPT, Anthropic Claude, VertexAI Gemini, Ollama, Local & …

TypeScript 2 Updated Jul 15, 2025

A Curated Collection of LLM resources (work in progress).

Python 372 60 Updated Nov 13, 2025

Train your Agent model via our easy and efficient framework

Python 1,632 155 Updated Nov 17, 2025

the LLM vulnerability scanner

Python 6,448 697 Updated Nov 26, 2025

AgentScope: Agent-Oriented Programming for Building LLM Applications

Python 14,102 1,162 Updated Nov 27, 2025

A lightweight, local-first, and πŸ†“ experiment tracking library from Hugging Face πŸ€—

Python 1,101 70 Updated Nov 26, 2025

The official code of ARPO & AEPO

Python 806 36 Updated Nov 15, 2025

MCP-based Agent Deep Evaluation System

Python 138 16 Updated Sep 26, 2025

Leetcode for Pytorch

Jupyter Notebook 1,686 193 Updated Jul 26, 2025

πŸ€– Just a command runner

Rust 28,863 616 Updated Nov 26, 2025
Next