Stars
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
A library for efficient similarity search and clustering of dense vectors.
🔥 1Panel provides an intuitive web interface and MCP Server to manage websites, files, containers, databases, and LLMs on a Linux server.
RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO and designed for fine-tuning.
Turn any computer or edge device into a command center for your computer vision projects.
The Gretel Python Client allows you to interact with the Gretel REST API.
Streamlit — A faster way to build and share data apps.
Official PyTorch implementation for "Large Language Diffusion Models"
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
[CVPR 2025] Learning Flow Fields in Attention for Controllable Person Image Generation
The python library for real-time communication
(ECCV 2024) SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark
A toolkit for making real world machine learning and data analysis applications in C++
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.
a register machine written in c and python3
We write your reusable computer vision tools. 💜
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Efficient vision foundation models for high-resolution generation and perception.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Optimizing inference proxy for LLMs
A Dockerfile for LLM training with Unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
本项目采用Firefly模型训练框架,使用LLAMA-2模型对多项选择阅读理解任务(Multiple Choice MRC)进行微调,取得了显著的进步。
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
A voice to voice chabot for RAG on your documents using Qdrant