Stars
Python tool for converting files and office documents to Markdown.
The official Python SDK for Model Context Protocol servers and clients
Real-time webcam demo with SmolVLM and llama.cpp server
A Conversational Speech Generation Model
RobertAgee / dia
Forked from nari-labs/diaA TTS model capable of generating ultra-realistic dialogue in one pass.
anan235 / dia-multilingual
Forked from nari-labs/diaA TTS model capable of generating ultra-realistic dialogue in one pass.
Tool for generating high quality Synthetic datasets
A TTS model capable of generating ultra-realistic dialogue in one pass.
It's a python based Tetris Game - works well with Mac Apple Silicon (M Series Chips) built with the help of AI
MARS5 speech model (TTS) from CAMB.AI
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
LiveKit's Plugin for Uplift AI - An Urdu Text-to-Speech Model
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Unlock your displays on your Mac! Flexible HiDPI scaling, XDR/HDR extra brightness, virtual screens, DDC control, extra dimming, PIP/streaming, EDID override and lots more!
itsitgroup / repo2txt
Forked from abinthomasonline/repo2txtWeb-based tool converts GitHub repository contents into a single formatted text file
Leap: AI-powered educational animation generator
SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
An example LiveKit Voice Pipeline Agent with Qdrant VectorDB for RAG
🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.
Spark Stack is an tool for building web applications through an AI-powered chat interface. Create quick MVPs and prototypes using natural language prompts.
A modular graph-based Retrieval-Augmented Generation (RAG) system
This repository is designed to be a user-facing StreamLit based frontend for LLM-powered AI Call Analysis Demo app. it's hosted on streamlit.io. Contact us if you need an API Key to test it out. am…
This repository serves as a StreamLit frontend for Computer Vision based OCR that our AI/ML Engineers developed as a Proof of Concept (POC).
Training code for Baby-Llama, our submission to the strict-small track of the BabyLM challenge.
The evaluation pipeline for the 2024 BabyLM Challenge.
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Implicit Style-Content Separation using B-LoRA