applegrew

Follow

AppleGrew applegrew

Follow

32 followers · 8 following

Achievements

Achievements

Stars

character-ai / Ovi

Python 582 70 Updated Oct 11, 2025

OHF-Voice / piper1-gpl

Fast and local neural text-to-speech engine

C++ 1,212 131 Updated Sep 10, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,422 2,041 Updated Jul 17, 2025

Michael-A-Kuykendall / shimmy

⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever.

Rust 3,008 211 Updated Oct 13, 2025

liquidos-ai / AutoAgents

A multi-agent framework written in Rust that enables you to build, deploy, and coordinate multiple intelligent agents

Rust 126 25 Updated Oct 13, 2025

deepbeepmeep / Wan2GP

Forked from Wan-Video/Wan2.1

A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.

Python 2,968 409 Updated Oct 15, 2025

kornia / kornia-rs

🦀 Low-level 3D Computer Vision library in Rust

Rust 446 77 Updated Oct 14, 2025

index-tts / index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 13,426 1,449 Updated Oct 10, 2025

OpenBMB / VoxCPM

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

Python 1,757 180 Updated Oct 9, 2025

QwenLM / Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 2,623 139 Updated Oct 9, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 10,093 1,054 Updated Oct 12, 2025

Lightricks / LTX-Video

Official repository for LTX-Video

Python 8,281 745 Updated Jul 21, 2025

resemble-ai / chatterbox

SoTA open-source TTS

Python 13,874 1,810 Updated Sep 25, 2025

apple / ml-fastvlm

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 6,769 465 Updated May 5, 2025

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,587 1,600 Updated Jul 6, 2025

microsoft / VibeVoice

Frontier Open-Source Text-to-Speech

9,595 1,190 Updated Sep 5, 2025

Tencent-Hunyuan / HunyuanWorld-1.0

Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model

Python 2,268 182 Updated Sep 24, 2025

Tencent-Hunyuan / Hunyuan3D-2

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 12,067 1,163 Updated Oct 6, 2025

Danial-Kord / DigiHuman

Automatic 3D Character animation using Pose Estimation and Landmark Generation techniques

C# 549 85 Updated Feb 4, 2024

facebookresearch / sapiens

High-resolution models for human tasks.

Python 5,173 304 Updated Nov 18, 2024

open-mmlab / mmpose

OpenMMLab Pose Estimation Toolbox and Benchmark.

Python 6,944 1,398 Updated Aug 4, 2025

SesameAILabs / csm

A Conversational Speech Generation Model

Python 14,168 1,405 Updated May 27, 2025

Tencent-Hunyuan / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,161 1,098 Updated Aug 27, 2025

kijai / ComfyUI-HunyuanVideoWrapper

Python 2,546 197 Updated Aug 20, 2025

nextcloud / all-in-one

📦 The official Nextcloud installation method. Provides easy deployment and maintenance with most features included in this one Nextcloud instance.

PHP 8,091 889 Updated Oct 14, 2025

McGill-NLP / llm2vec

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,596 130 Updated Jan 24, 2025

genmoai / mochi

The best OSS video generation models, created by Genmo

Python 3,455 437 Updated Sep 5, 2025

jbilcke-hf / FacePoke

Select a portrait, click to move the head around (please use your own space / GPU!)

JavaScript 898 97 Updated Aug 18, 2025

flowdriveai / flowpilot

flow-pilot is an openpilot based driver assistance system that runs on linux, windows and android powered machines.

C 1,775 250 Updated Sep 19, 2024

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 8,997 806 Updated Oct 9, 2025