Skip to content
View Murgio's full-sized avatar

Block or report Murgio

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OpenTelemetry Instrumentation for AI Observability

Python 700 155 Updated Nov 7, 2025

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,085 373 Updated Nov 7, 2025

OpenAI Guardrails - Python

Python 123 17 Updated Nov 7, 2025

Pretraining data reconstruction scripts for Apertus

Python 100 8 Updated Oct 27, 2025

Tech Report of the Apertus LLM Suite

121 4 Updated Sep 18, 2025

Response format to be used with apertus

Python 6 1 Updated Sep 1, 2025

Generate audiobooks from EPUBs, PDFs and text with synchronized captions.

Python 3,814 218 Updated Oct 30, 2025

Opensource benchmark evaluating web operators/agents performance

Python 44 7 Updated Apr 11, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,127 1,909 Updated Nov 1, 2025

A library for making RepE control vectors

Jupyter Notebook 656 49 Updated Sep 24, 2025

Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]

Python 143 6 Updated Sep 20, 2024

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,685 440 Updated Nov 4, 2025

Temporal Python SDK

Python 855 140 Updated Nov 7, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,791 599 Updated Nov 6, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,164 162 Updated Nov 7, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,464 295 Updated Oct 29, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,370 1,522 Updated Apr 24, 2025
Python 374 30 Updated Oct 16, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,765 99 Updated Mar 18, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,229 617 Updated Nov 7, 2025

A lightweight LMM-based Document Parsing Model

Python 6,161 428 Updated Oct 25, 2025

Code and Data for Tau-Bench

Python 934 145 Updated Aug 28, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 62,038 7,499 Updated Nov 6, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,681 366 Updated Oct 21, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 12,248 1,121 Updated Sep 26, 2025

Environments for LLM Reinforcement Learning

Python 3,464 428 Updated Nov 7, 2025

Nano vLLM

Python 8,468 1,030 Updated Nov 3, 2025

A playbook for systematically maximizing the performance of deep learning models.

29,351 2,399 Updated Jun 18, 2024

🤗 smolagents: a barebones library for agents that think in code.

Python 23,824 2,099 Updated Nov 7, 2025

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 561 47 Updated Oct 31, 2025
Next