Stars
AI API server for common use cases — supports multiple models and providers. Run locally with Ollama or LM Studio, or in the cloud via OpenRouter, OpenAI, Anthropic, or Google.
Multi-Language Backend Framework that unifies APIs, background jobs, queues, workflows, streams, and AI agents with a single core primitive with built-in observability and state management.
xpander.ai is the runtime and control plane to build, run, and ship reliable AI agents fast and anywhere
Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
A modular, documentation-driven framework using Cursor custom modes (VAN, PLAN, CREATIVE, IMPLEMENT) to provide persistent memory and guide AI through a structured development workflow with visual …
Compare open-source local LLM inference projects by their metrics to assess popularity and activeness.
Convert PDF to markdown + JSON quickly with high accuracy
Implementation of Nougat Neural Optical Understanding for Academic Documents
A high-throughput and memory-efficient inference and serving engine for LLMs
End-to-end documentation to set up your own local & fully private LLM server on Debian. Equipped with chat, web search, RAG, model management, MCP servers, image generation, and TTS.
⚡Ship RAG Solutions Quickly and effortlessly
An LLM playground you can run on your laptop
AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding mod…
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Integrate cutting-edge LLM technology quickly and easily into your apps
Large Language Models: In this repository Language models are introduced covering both theoretical and practical aspects.
This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
C# implementation of LangChain. We try to be as close to the original as possible in terms of abstractions, but are open to new entities.
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
Run and train Transformer based Large Language Models (LLMS) natively in .NET using TorchSharp
An app that brings language models directly to your phone.
TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.