Skip to content
View rootfs's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@openshift @ceph @coreos-inc @rook @fast-ml @redhat-et @os-climate @llm-d

Block or report rootfs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Bringing BERT into modernity via both architecture changes and scaling

Python 1,594 134 Updated Jun 30, 2025

A powerful tool for creating fine-tuning datasets for LLM

JavaScript 12,516 1,218 Updated Nov 29, 2025

A framework for efficient model inference with omni-modality models

Python 998 136 Updated Dec 19, 2025

RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderboard.

Python 54 5 Updated Dec 8, 2025

The Future of Data Engineering — A CLI SQL client for the modern data stack, enabling AI-native context engineering for data.

Python 772 113 Updated Dec 19, 2025

Intelligent Router for Mixture-of-Models

Go 2,495 333 Updated Dec 19, 2025

LLM Semantic Router: Intelligent Mixture-of-Models (MoM) System with Privacy Preservation and Prompt Guard. The semantic router intelligently directs OpenAI compliant API requests to the most suita…

Python 19 11 Updated Aug 30, 2025

Run OpenAI's CLIP and Apple's MobileCLIP model on iOS to search photos.

Swift 2,906 444 Updated Jan 4, 2025

CLIP-Finder enables semantic offline searches of images from gallery photos using natural language descriptions or the camera. Built on Apple's MobileCLIP-S0 architecture, it ensures optimal perfor…

Swift 89 11 Updated Jul 25, 2024

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,198 266 Updated Dec 16, 2025

Latency and Memory Analysis of Transformer Models for Training and Inference

Python 467 55 Updated Apr 19, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,533 977 Updated Dec 13, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,888 310 Updated Mar 10, 2025
Python 4,573 370 Updated Dec 19, 2025

Cloud Native Observability and Policy Engine for LLM Applications

Python 8 1 Updated Feb 10, 2025

GitHub Action to Create an AWS EC2 Self-hosted Runner

Shell 3 1 Updated Oct 3, 2025

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

2,176 178 Updated Apr 30, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 81,314 12,140 Updated Dec 19, 2025

Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks.This framework enables Claud…

Python 11,143 1,159 Updated Dec 12, 2024

Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget

Python 163 9 Updated Aug 11, 2025

Carbon Limiting Auto Tuning for Kubernetes

Go 37 8 Updated Nov 11, 2024

Grok open release

Python 50,572 8,373 Updated Aug 30, 2024

vLLM Router

Python 52 2 Updated Mar 11, 2024
Jupyter Notebook 8 1 Updated Apr 28, 2024

A reproduction of the Gemini demo using GPT-vision.

JavaScript 126 45 Updated Dec 20, 2023

Create an AWS EC2 Github Action Self hosted Runner

Shell 1 Updated Dec 12, 2023

A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.

TypeScript 9,047 931 Updated Dec 3, 2025

code samples for the goodreads datasets

Jupyter Notebook 294 62 Updated Feb 4, 2025

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

Jupyter Notebook 18,239 2,571 Updated Dec 6, 2025
Next