-
Tencent
- Shenzhen, China
- https://xinntao.github.io/
Stars
TeleMem is a high-performance drop-in replacement for Mem0, featuring semantic deduplication, long-term dialogue memory, and multimodal video reasoning.
Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.
[SIGGRAPH Asia'25] Enabling Reference-based Camera Control via Context without Explicit 3D Estimation
Repo for SeedVR2 & SeedVR (CVPR2025 Highlight)
[ARXIV’25] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"
Official PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT
[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
MoBA: Mixture of Block Attention for Long-Context LLMs
[NeurIPS 2025] Improving Video Generation with Human Feedback
[ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos
[ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation
[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
[CVPR'25] StyleMaster: Stylize Your Video with Artistic Generation and Translation
Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".
Excalidraw app for mac. Powered by pure SwiftUI.
Let your Claude able to think
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
SEED-Voken: A Series of Powerful Visual Tokenizers
A PyTorch native platform for training generative AI models
Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.