MiroThinker is an open-source search agent model, built for tool-augmented reasoning and real-world information seeking, aiming to match the deep research experience of OpenAI Deep Research and Gem…

Python 4,693 315 Updated Jan 13, 2026

thunderbolt215 / UniPercept

UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture

Python 70 Updated Jan 7, 2026

ASLP-lab / VoiceSculptor

An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.

Python 147 9 Updated Jan 10, 2026

antgroup / echomimic_v3

[AAAI 2026] EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation

Python 701 74 Updated Nov 24, 2025

Lightricks / LTX-2

Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.

Python 2,220 247 Updated Jan 12, 2026

TencentARC / VideoPainter

[SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"

Python 547 39 Updated Apr 8, 2025

knightyxp / VideoCoF

VideoCoF: Unified Video Editing with Temporal Reasoner

Python 123 7 Updated Jan 2, 2026

ant-research / CoDeF

[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Python 4,863 382 Updated Apr 7, 2024

sczhou / ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Python 6,461 763 Updated Feb 19, 2025

Playmate111 / Playmate2

[AAAI 2026] Playmate2: Training-Free Multi-Character Audio-Driven Animation via Diffusion Transformer with Reward Feedback

Python 291 28 Updated Nov 21, 2025

code-yeongyu / oh-my-opencode

The Best Agent Harness. Meet Sisyphus: The Batteries-Included Agent that codes like you.

TypeScript 15,737 1,075 Updated Jan 13, 2026

JIA-Lab-research / DreamOmni3

This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''

32 Updated Dec 30, 2025

anthropics / skills

Public repository for Agent Skills

Python 39,199 3,572 Updated Dec 20, 2025

prs-eth / stereospace

Python 61 5 Updated Dec 16, 2025

JIA-Lab-research / RePlan

RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing

Python 52 2 Updated Dec 26, 2025

Tencent-Hunyuan / HY-MT

Python 390 40 Updated Jan 1, 2026

Tencent-Hunyuan / HY-Motion-1.0

HY-Motion model for 3D character animation generation.

Python 1,746 124 Updated Jan 4, 2026

GAIR-NLP / LiveTalk

Python 205 15 Updated Jan 2, 2026

HisMax / RedInk

红墨 - 基于🍌Nano Banana Pro🍌 的一站式小红书图文生成器《一句话一张图片生成小红书图文》 Red Ink - A one-stop Xiaohongshu image-and-text generator based on the 🍌Nano Banana Pro🍌, "One Sentence, One Image: Generate Xiaohongshu Text …

Python 4,461 868 Updated Dec 29, 2025

Tongyi-MAI / MAI-UI

MAI-UI: Real-World Centric Foundation GUI Agents ranging from 2B to 235B

Jupyter Notebook 1,442 152 Updated Jan 12, 2026

jsz-05 / LLM-State-Machine

Framework for building conversational agents using a Finite State Machine (FSM) and LLMs

Python 62 8 Updated Apr 23, 2025

rayray9999 / Genfocus

Python 259 23 Updated Jan 8, 2026

Kevin-thu / StoryMem

Official code for StoryMem: Multi-shot Long Video Storytelling with Memory

Python 606 58 Updated Dec 26, 2025

FunAudioLLM / Fun-Audio-Chat

Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.

Python 667 66 Updated Dec 25, 2025

antvis / Infographic

🦋 An Infographic Generation and Rendering Framework, bring words to life with AI!

TypeScript 3,759 253 Updated Jan 13, 2026

ag-ui-protocol / ag-ui

AG-UI: the Agent-User Interaction Protocol. Bring Agents into Frontend Applications.

TypeScript 11,341 1,041 Updated Jan 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PeterYoung PeterYoungQaQ

Achievements