Skip to content
View SihanXU's full-sized avatar
🫥
I may be slow to respond.
🫥
I may be slow to respond.

Highlights

  • Pro

Organizations

@sled-group

Block or report SihanXU

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis

Python 127 4 Updated May 16, 2025

CSE 593 Project

TypeScript 2 Updated Nov 26, 2025
TypeScript 1 Updated Nov 27, 2025

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 1,394 62 Updated Nov 18, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,693 6,159 Updated Sep 18, 2024

PyTorch implementation of SimSiam https//arxiv.org/abs/2011.10566

Python 1,219 174 Updated Jan 26, 2023

Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders

Python 125 7 Updated Apr 10, 2025

About Official repo of paper "SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models". A post-training framework that creates a cost-effective, self-iterative optimization loop.

Python 83 4 Updated Nov 26, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,592 45 Updated Nov 15, 2025

JAX implementation of MeanFlow

Python 472 17 Updated Jul 30, 2025
Python 80 3 Updated Jul 24, 2025
Python 44 2 Updated Jun 22, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,778 6,543 Updated Nov 27, 2025

Self-reimplemented version of 4D-LRM.

63 Updated May 30, 2025

📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.

332 15 Updated Oct 16, 2025

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,801 110 Updated Sep 27, 2024

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,430 67 Updated Mar 16, 2025

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,271 102 Updated Oct 29, 2025

s1: Simple test-time scaling

Python 6,607 763 Updated Jun 25, 2025

Collect every awesome work about r1!

Python 423 15 Updated May 2, 2025

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,198 66 Updated Feb 25, 2025

Official Jax Implementation of MaskGIT

Jupyter Notebook 542 53 Updated Nov 18, 2022

Minimal reproduction of DeepSeek R1-Zero

Python 12,432 1,525 Updated Apr 24, 2025

视觉小说翻译器 / Visual Novel Translator

C++ 9,722 986 Updated Nov 26, 2025

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Python 935 43 Updated Sep 27, 2024

A tiny paper rating web

HTML 38 Updated Mar 19, 2025

Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]

Jupyter Notebook 580 38 Updated Jul 29, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 153,110 31,242 Updated Nov 27, 2025

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python 3,616 531 Updated Oct 16, 2024
Python 184 4 Updated Dec 17, 2024
Next