Skip to content
View hutm's full-sized avatar

Organizations

@vertxai

Block or report hutm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The best ChatGPT that $100 can buy.

Python 37,582 4,604 Updated Nov 17, 2025

A bridge to use Langchain output as an OpenAI-compatible API

Python 83 16 Updated Jul 11, 2025

LLM agents built for control. Designed for real-world use. Deployed in minutes.

Python 16,432 1,370 Updated Nov 25, 2025

Nano vLLM

Python 9,263 1,134 Updated Nov 3, 2025

Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling

Go 115 26 Updated Nov 21, 2025

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,082 237 Updated Nov 26, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 16,632 2,653 Updated Nov 26, 2025

NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to fa…

Python 237 37 Updated Nov 22, 2025

Tile primitives for speedy kernels

Cuda 2,950 202 Updated Nov 26, 2025

the LLM vulnerability scanner

Python 6,440 696 Updated Nov 24, 2025

Nix Packages collection & NixOS

Nix 22,545 17,317 Updated Nov 26, 2025

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

C++ 378 18 Updated Apr 13, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,547 709 Updated Nov 26, 2025

The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.

Python 1,539 432 Updated Nov 26, 2025

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

Python 8,256 891 Updated Nov 26, 2025

NVIDIA Inference Xfer Library (NIXL)

C++ 731 191 Updated Nov 26, 2025

s1: Simple test-time scaling

Python 6,605 763 Updated Jun 25, 2025

macOS packaging for ungoogled-chromium

Shell 568 91 Updated Nov 23, 2025

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Python 8,695 668 Updated Nov 26, 2025

A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.

Python 24,358 2,174 Updated Nov 14, 2025

High-Performance SGEMM on CUDA devices

Cuda 112 5 Updated Jan 21, 2025

Minimalist ML framework for Rust

Rust 18,660 1,314 Updated Nov 25, 2025

Community-maintained Kubernetes config and Helm chart for Langfuse

Smarty 190 117 Updated Nov 24, 2025

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 734 81 Updated Apr 6, 2025

Go microservice template for Kubernetes

Go 5,769 1,831 Updated Nov 26, 2025

Open source Loom alternative. Beautiful, shareable screen recordings.

TypeScript 15,027 1,036 Updated Nov 22, 2025

Blazingly fast LLM inference.

Rust 6,244 481 Updated Nov 25, 2025

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 18,665 1,819 Updated Nov 26, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 267 10 Updated Oct 11, 2024

Build Conversational AI in minutes ⚡️

Python 11,052 1,593 Updated Nov 25, 2025
Next