Skip to content
View wilson1yan's full-sized avatar

Block or report wilson1yan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,787 2,173 Updated Jul 17, 2025

Inference script for Oasis 500M

Python 1,982 172 Updated Nov 8, 2024

Monitor Memory usage of Python code

Python 4,538 387 Updated Apr 29, 2024

Train VAE like a boss

Jupyter Notebook 301 13 Updated Oct 21, 2024
Python 147 31 Updated Nov 7, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,061 521 Updated Jun 9, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,735 272 Updated Jul 18, 2025

ElasticTok: Adaptive Tokenization for Image and Video

Python 86 Updated Nov 4, 2024

Pythonic bindings for FFmpeg's libraries.

Python 3,039 412 Updated Nov 27, 2025

[ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark

Python 132 5 Updated Jun 4, 2025

[NeurIPS 2024] An official implementation of "ShareGPT4Video: Improving Video Understanding and Generation with Better Captions"

Python 1,079 44 Updated Oct 9, 2024

Fast dataset format and loader

Python 23 1 Updated Jan 17, 2025

A simple library for scaling up JAX programs

Python 144 12 Updated Nov 4, 2025

Grok open release

Python 50,576 8,381 Updated Aug 30, 2024

Fast and reliable distributed systems in Python

Python 32 1 Updated Apr 20, 2025

Inference code for Llama models

Python 58,951 9,819 Updated Jan 26, 2025

Large World Model -- Modeling Text and Video with Millions Context

Python 7,380 561 Updated Oct 19, 2024

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,405 247 Updated Dec 3, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,066 2,670 Updated Aug 12, 2024

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 1,440 240 Updated Jul 31, 2024

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Python 946 39 Updated Mar 19, 2025

The official repository of "Video assistant towards large language model makes everything easy"

Python 232 14 Updated Dec 24, 2024

A framework for few-shot evaluation of language models.

Python 10,780 2,879 Updated Nov 27, 2025

Mamba SSM architecture

Python 16,558 1,513 Updated Nov 11, 2025

Youtube-8m Videos, Frames and Ids Generator. Extract videos from youtube-8m. Extract frames from youtube-8m.

Shell 129 32 Updated May 7, 2019

Video-P2P: Video Editing with Cross-attention Control

Python 423 26 Updated Jun 30, 2025

[IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Python 1,133 57 Updated Sep 13, 2025

[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

Python 942 62 Updated Nov 13, 2024

jax-triton contains integrations between JAX and OpenAI Triton

Python 436 53 Updated Nov 24, 2025
Next