Skip to content
View YS-IMTech's full-sized avatar

Block or report YS-IMTech

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

SAM 3D Objects

Python 3,466 224 Updated Nov 21, 2025

Cambrian-S: Towards Spatial Supersensing in Video

Python 383 9 Updated Nov 10, 2025

A list of open source games.

8,059 617 Updated Nov 20, 2025

Depth Anything 3

Jupyter Notebook 2,704 188 Updated Nov 20, 2025

[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention

Python 583 30 Updated Nov 18, 2025

ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation

Python 602 33 Updated Nov 20, 2025

We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that Sora-2 surpasses GPT5 by 10% on eyeballing puzzles and reache…

Python 204 4 Updated Nov 24, 2025

Native Multimodal Models are World Learners

Python 1,278 44 Updated Nov 19, 2025

Krea Realtime 14B. An open-source realtime AI video model.

Python 392 22 Updated Nov 13, 2025

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Python 344 12 Updated Oct 27, 2025

This is the official implementation for Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1.

HTML 142 10 Updated Oct 27, 2025

[Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Python 497 41 Updated Oct 29, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 34,385 4,022 Updated Nov 19, 2025

Infinite-Forcing: Towards Infinite-Long Video Generation

Python 92 2 Updated Nov 13, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,476 599 Updated Nov 20, 2025

A tool for running and customizing real-time, interactive generative AI pipelines and models

Python 78 14 Updated Nov 24, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,581 45 Updated Nov 15, 2025

Ctrl-World: A Controllable Generative World Model for Robot Manipualtion

Python 183 12 Updated Oct 25, 2025

Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.

Python 184 6 Updated Oct 12, 2025

A simple state update rule to enhance length generalization for CUT3R

Python 515 14 Updated Oct 1, 2025

AHN: Artificial Hippocampus Networks for Efficient Long-Context Modeling

Python 143 5 Updated Oct 17, 2025

The official implementation of paper “VChain: Chain-of-Visual-Thought for Reasoning in Video Generation”

102 1 Updated Oct 7, 2025

Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time

Python 247 9 Updated Oct 31, 2025

LongLive: Real-time Interactive Long Video Generation

Python 831 55 Updated Nov 3, 2025

An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"

Python 145 6 Updated Nov 5, 2025

A minimal implementation of DeepMind's Genie world model

Python 1,033 76 Updated Nov 22, 2025

An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"

Python 100 3 Updated Sep 28, 2025

Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation

Python 592 32 Updated Oct 2, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,310 1,321 Updated Nov 20, 2025
Next