Skip to content
View applegrew's full-sized avatar

Block or report applegrew

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 582 70 Updated Oct 11, 2025

Fast and local neural text-to-speech engine

C++ 1,212 131 Updated Sep 10, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,422 2,041 Updated Jul 17, 2025

⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever.

Rust 3,008 211 Updated Oct 13, 2025

A multi-agent framework written in Rust that enables you to build, deploy, and coordinate multiple intelligent agents

Rust 126 25 Updated Oct 13, 2025

A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.

Python 2,968 409 Updated Oct 15, 2025

🦀 Low-level 3D Computer Vision library in Rust

Rust 446 77 Updated Oct 14, 2025

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 13,426 1,449 Updated Oct 10, 2025

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

Python 1,757 180 Updated Oct 9, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 2,623 139 Updated Oct 9, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 10,093 1,054 Updated Oct 12, 2025

Official repository for LTX-Video

Python 8,281 745 Updated Jul 21, 2025

SoTA open-source TTS

Python 13,874 1,810 Updated Sep 25, 2025

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 6,769 465 Updated May 5, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,587 1,600 Updated Jul 6, 2025

Frontier Open-Source Text-to-Speech

9,595 1,190 Updated Sep 5, 2025

Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model

Python 2,268 182 Updated Sep 24, 2025

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 12,067 1,163 Updated Oct 6, 2025

Automatic 3D Character animation using Pose Estimation and Landmark Generation techniques

C# 549 85 Updated Feb 4, 2024

High-resolution models for human tasks.

Python 5,173 304 Updated Nov 18, 2024

OpenMMLab Pose Estimation Toolbox and Benchmark.

Python 6,944 1,398 Updated Aug 4, 2025

A Conversational Speech Generation Model

Python 14,168 1,405 Updated May 27, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,161 1,098 Updated Aug 27, 2025

📦 The official Nextcloud installation method. Provides easy deployment and maintenance with most features included in this one Nextcloud instance.

PHP 8,091 889 Updated Oct 14, 2025

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,596 130 Updated Jan 24, 2025

The best OSS video generation models, created by Genmo

Python 3,455 437 Updated Sep 5, 2025

Select a portrait, click to move the head around (please use your own space / GPU!)

JavaScript 898 97 Updated Aug 18, 2025

flow-pilot is an openpilot based driver assistance system that runs on linux, windows and android powered machines.

C 1,775 250 Updated Sep 19, 2024

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 8,997 806 Updated Oct 9, 2025
Next