-
GVC Lab, Great Bay University
- Dongguan, China
-
12:46
(UTC +08:00) - http://vinthony.github.io
- @shadocun
Lists (4)
Sort Name ascending (A-Z)
Stars
- All languages
- AppleScript
- Assembly
- BibTeX Style
- C
- C#
- C++
- CMake
- CSS
- Clojure
- Common Workflow Language
- Cuda
- Cython
- D
- Dart
- Dockerfile
- GDScript
- GLSL
- Go
- HTML
- Haskell
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- Lua
- MATLAB
- MDX
- Makefile
- Markdown
- NASL
- Nunjucks
- Objective-C
- OpenEdge ABL
- PHP
- Perl
- PowerShell
- Protocol Buffer
- Python
- QML
- R
- Rich Text Format
- Roff
- Ruby
- Rust
- SCSS
- SWIG
- Scala
- Scheme
- Shell
- Swift
- Tcl
- TeX
- TypeScript
- Vim Script
- Vue
EasyOmnimatte: Taming Pretrained Inpainting Diffusion Models for End-to-End Video Layered Decomposition
A Foundation Model for Generalist Gaming Agents
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
This is a ComfyUI custom node implementation of 'PersonaLive: Expressive Portrait Image Animation for Live Streaming'.
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
PersonaLive! : Expressive Portrait Image Animation for Live Streaming
An open-source academic paper management tool.
🚀 A curated collection of papers focusing on LLM-based quantitative trading.
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
Pytorch implementation of "SKEL-CF: Coarse-to-Fine Biomechanical Skeleton and Surface Mesh Recovery"
Kandinsky 5.0: A family of diffusion models for Video & Image generation
The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…
This is a framework for evaluating reasoning in foundational Video Models.
[Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
MentraOS is the leading smart glasses platform + SDK. Stream your view, transcribe audio, talk to AI and capture photos hands-free on compatible glasses.
Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.
Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time
[SIGGRAPH Asia 2025] The official repo for the conference paper "MV-Performer: Taming Video Diffusion Model for Faithful and Synchronized Multi-view Performer Synthesis".
[DEIMv2] Real Time Object Detection Meets DINOv3