Starred repositories
Fully autonomous AI hacker to find actual exploits in your web apps. Shannon has achieved a 96.15% success rate on the hint-free, source-aware XBOW Benchmark.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
All Algorithms implemented in Python
LLM agents built for control. Designed for real-world use. Deployed in minutes.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
An open-source RAG-based tool for chatting with your documents.
A scikit-learn compatible neural network library that wraps PyTorch
A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
Google Research
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
TripoSR: Fast 3D Object Reconstruction from a Single Image
A guideline for building practical production-level deep learning systems to be deployed in real world applications.
A unified framework for 3D content generation.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Port of OpenAI's Whisper model in C/C++
[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
Sparsity-aware deep learning inference runtime for CPUs
Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.
FastAPI framework, high performance, easy to learn, fast to code, ready for production