ML Models
🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…
Distribute and run LLMs with a single file.
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Pure Javascript OCR for more than 100 Languages 📖🎉🖥
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.
Instant voice cloning by MIT and MyShell. Audio foundation model.
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
[ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
OpenUI let's you describe UI using your imagination, then see it rendered live.
Question and Answer based on Anything.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Convert PDF to markdown + JSON quickly with high accuracy
Reliable Multi-Agent Orchestration Framework (Extension of Agents SDK)
FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a mu…