Lists (8)
Sort Name ascending (A-Z)
Starred repositories
This repository contains the Hugging Face Agents Course.
CoPart (ICCV 2025): A part-based 3D generation framework & the first large-scale part-level 3D dataset.
A curated list of seminal and influential research papers in artificial intelligence, covering key topics in machine learning, deep learning, NLP, computer vision, reinforcement learning, and AI et…
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
100+ Fine-tuning Tutorial Notebooks on Google Colab, Kaggle and more.
A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, …
maepopi / Zonos
Forked from Zyphra/ZonosZonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
maepopi / Spark-TTS
Forked from SparkAudio/Spark-TTSSpark-TTS Inference Code
maepopi / 3DTopia-XL
Forked from 3DTopia/3DTopia-XL[CVPR 2025] 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion
[SIGGRAPH Asia 2024 (Conference Track)] Boosting 3D Object Generation through PBR Materials
maepopi / CraftsMan3D
Forked from HKUST-SAIL/CraftsMan3DCraftsMan: High-fidelity Mesh Generation with 3D Native Diffusion and Interactive Geometry Refiner
Bark Voice Cloning and Voice Cloning for Chinese Speech
maepopi / Nari-Dia-TTS
Forked from pinokiofactory/Nari-Dia-TTSNari Dia is a powerful text-to-speech (TTS) application based on the Dia-1.6B model from Nari Labs. This application allows you to convert text into natural-sounding speech with various customizati…
maepopi / TripoSG
Forked from VAST-AI-Research/TripoSGTripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models
maepopi / tts-generation-webui
Forked from rsxdalv/TTS-WebUITTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)
Nari Dia is a powerful text-to-speech (TTS) application based on the Dia-1.6B model from Nari Labs. This application allows you to convert text into natural-sounding speech with various customizati…
LLM Finetuning with peft
🔥 [ICCV 2025 Highlight] InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
Repo for the papers "Intrinsic Image Decomposition via Ordinal Shading" (TOG 2023) and "Colorful Diffuse Intrinsic Image Decomposition in the Wild" (TOG 2024)
The source code for the official CG Texture Upscaler: https://www.mohamedbenaicha.ca/upscaler
A blender addon which allow the upscaling of image directly in blender using ai
This is the official website of our work 3D Appearance Super-Resolution with Deep Learning published on CVPR2019.
0lento / TRELLIS
Forked from microsoft/TRELLISTRELLIS fork with additional memory handling.
An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)
Used for AI model generation, next-generation Blender rendering engine, texture enhancement&generation (based on ComfyUI)
Code of MCMat: Multiview-Consistent and Physically Accurate PBR Material Generation;
[ICCV 2025] VistaDream: Sampling multiview consistent images for single-view scene reconstruction
some materials about mesh processing, including papers, videos, codes, and so on. Updating every day!