Lists (8)
Sort Name ascending (A-Z)
Starred repositories
Proxy that exposes Antigravity provided claude / gemini models, so we can use them in Claude Code
PostgreSQL extension for supporting deep learning model inference within the database and vector storage
A C++ header-only HTTP/HTTPS server and client library
A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…
PegaFlow is a high-performance KV cache offloading solution for vLLM v1 on single-node multi-GPU setups.
The AI-Native Search Database. Unifies vector, text, structured and semi-structured data in a single engine, enabling hybrid search and in-database AI workflows.
VRAFT is a framework written in C++ that implements RAFT protocol and SEDA architecture. Based on VRAFT, distributed software can be developed easily, such as vectordb and distributed storage system.
Democratizing large model inference and training on any device.
C++ implementation of a fast hash map and hash set using robin hood hashing
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
An transformer based LLM. Written completely in Rust
AKG (Auto Kernel Generator) is an optimizer for operators in Deep Learning Networks, which provides the ability to automatically fuse ops with specific patterns.
Embeddable Postgres with real-time, reactive bindings.
Integrates DuckDB with Google BigQuery, allowing direct querying and management of BigQuery datasets
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
🤗A PyTorch-native and Flexible Inference Engine with Hybrid Cache Acceleration and Parallelism for DiTs.
Supercharge Your LLM with the Fastest KV Cache Layer
Official Repository of "LLM × DATA" Survey Paper
Scalable, fast, and disk-friendly vector search in Postgres, the successor of pgvecto.rs.