Skip to content
View kevinlin311tw's full-sized avatar

Organizations

@microsoft @ivclab

Block or report kevinlin311tw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Web-Bench is a benchmark designed to evaluate the performance of LLMs in actual Web development.

JavaScript 228 19 Updated Nov 6, 2025

[NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT

Python 418 24 Updated Sep 18, 2025

Kolors Team

Python 4,578 349 Updated Nov 13, 2024

[NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation

Python 69 7 Updated Oct 27, 2024

LLM training in simple, raw C/CUDA

Cuda 28,253 3,297 Updated Jun 26, 2025

Transparent Image Layer Diffusion using Latent Transparency

2,175 35 Updated Jun 16, 2024

Generative models for conditional audio generation

Python 3,511 399 Updated Oct 9, 2025

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Python 935 43 Updated Sep 27, 2024

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

Python 242 18 Updated Apr 6, 2024

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,587 233 Updated Jun 14, 2024

GPT-4V in Wonderland: LMMs as Smartphone Agents

Python 135 2 Updated Jul 17, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,751 6,534 Updated Nov 26, 2025

Consistency Distilled Diff VAE

Python 2,202 77 Updated Nov 7, 2023

Examples and guides for using the OpenAI API

Jupyter Notebook 69,379 11,634 Updated Nov 26, 2025

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,131 394 Updated Jul 11, 2024

Rich-Text-to-Image Generation

Python 800 68 Updated Oct 9, 2023

[ACM MM 2023] Official implementation of paper "Language-guided Human Motion Synthesis with Atomic Actions".

Python 29 1 Updated Jun 28, 2024

MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)

Python 317 11 Updated Jan 20, 2025

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 4,350 469 Updated Nov 12, 2025

Inference code for Llama models

Python 58,946 9,818 Updated Jan 26, 2025

The unofficial python package that returns response of Google Bard through cookie value.

Python 5,234 509 Updated Apr 24, 2024

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

Python 291 15 Updated Mar 13, 2024

A PyTorch implementation of EmpiricalMVM

Python 41 2 Updated Dec 18, 2023

pytorch implementation of openpose including Hand and Body Pose Estimation.

Jupyter Notebook 2,291 415 Updated Jul 9, 2024

[NeurIPS 2023] Official implementation of the paper "Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset"

Python 780 23 Updated Mar 3, 2025

[CVPR2024] DisCo: Referring Human Dance Generation in Real World

Python 1,084 105 Updated Jul 22, 2024
Python 15 Updated Jun 21, 2023

PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"

Python 37 Updated Oct 11, 2023

Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts

Python 323 36 Updated Aug 1, 2023

Universal LLM Deployment Engine with ML Compilation

Python 21,648 1,863 Updated Nov 25, 2025
Next