Skip to content
View sfpprxy's full-sized avatar

Block or report sfpprxy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

🧠 LLM

23 repositories

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …

Python 38,556 3,683 Updated Jul 9, 2025

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

JavaScript 50,477 5,315 Updated Oct 27, 2025

🛰️ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.

C++ 1,510 77 Updated Sep 25, 2025

🦜🔗 Build context-aware reasoning applications

Python 118,241 19,471 Updated Oct 27, 2025

🧠 AI-powered enterprise search engine 🔎

Python 2,805 180 Updated Dec 29, 2023

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …

Go 14,866 1,116 Updated Oct 28, 2025

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

Python 12,820 3,078 Updated Oct 27, 2025

Neural search for web-sites, docs, articles - online!

Rust 142 7 Updated Jul 31, 2025

基于LangChain和ChatGLM-6B等系列LLM的针对本地知识库的自动问答

Python 3,291 494 Updated Apr 15, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 60,914 7,366 Updated Oct 27, 2025

Scripts for fine-tuning Llama2 via SFT and DPO.

Python 204 38 Updated Aug 14, 2023

Inference Llama 2 in one file of pure C

C 18,883 2,394 Updated Aug 6, 2024

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…

TypeScript 26,126 6,714 Updated Oct 28, 2025

Humanable Chat Generative-model Fine-tuning | LLM微调

Python 206 21 Updated Sep 22, 2023

A natural language interface for computers

Python 60,717 5,207 Updated Oct 24, 2025

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…

TypeScript 9,821 1,612 Updated Oct 28, 2025

Production-ready platform for agentic workflow development.

TypeScript 117,467 18,150 Updated Oct 28, 2025

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 8,668 754 Updated Oct 28, 2025

The UI design language and React library for Conversational UI

TypeScript 3,924 384 Updated Oct 28, 2025

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,285 2,260 Updated Sep 24, 2025

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 2,071 187 Updated Jun 30, 2025

Large Language Model Text Generation Inference

Python 10,600 1,236 Updated Sep 17, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,371 448 Updated Aug 2, 2025