Skip to content
View zjr2000's full-sized avatar
😢
Focusing
😢
Focusing
  • The Hong Kong Polytechnic University
  • Hong Kong

Highlights

  • Pro

Block or report zjr2000

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 617 35 Updated Jan 15, 2026
Python 578 60 Updated Sep 23, 2025

Efficient Actively Secure DPF and RAM-based 2PC with One-Bit Leakage

C++ 2 Updated Jan 3, 2026

Training library for Megatron-based models with bi-directional Hugging Face conversion capability

Python 359 139 Updated Jan 15, 2026

Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Python 246 44 Updated Jan 15, 2026

TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs

Python 88 3 Updated Dec 19, 2025

Official repository for the paper "MICo-150K: A Comprehensive Dataset for Multi-Image Composition".

Python 42 2 Updated Dec 16, 2025

CaptionQA: Is Your Caption as Useful as the Image Itself?

Python 30 1 Updated Dec 10, 2025

PyTorch building blocks for the OLMo ecosystem

Python 698 125 Updated Jan 15, 2026

Easy and Efficient dLLM Fine-Tuning

Python 194 8 Updated Dec 15, 2025

Paper list for Efficient Reasoning.

793 31 Updated Jan 14, 2026

TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

Python 389 30 Updated Dec 16, 2025

A lightweight Inference Engine built for block diffusion models

Python 39 5 Updated Dec 9, 2025

Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries

33 1 Updated Nov 19, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 65,755 7,999 Updated Jan 13, 2026

SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model(1.7B, 4B, 8B, 30B)

Python 321 17 Updated Dec 15, 2025
Python 129 3 Updated Jan 6, 2026

This is the official repository for the ICCV 2025 paper ReCoT: Reflective Self-Correction Training for Mitigating Confirmation Bias in Large Vision-Language Models

Jupyter Notebook 1 Updated Oct 20, 2025

Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation length and maintaining KV-cache compatibility, achieving high eff…

Python 89 3 Updated Dec 27, 2025

Minimalistic large language model 3D-parallelism training

Python 2,417 267 Updated Dec 11, 2025

We have summarised all 3D anomaly detection methods and datasets (still updating). 多模态,点云和姿势无关异常检测的综述仓库(持续更新)

48 Updated Jan 14, 2026

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,537 128 Updated Jan 14, 2026

The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.

Python 499 37 Updated Nov 11, 2025

The official github repo for "Diffusion Language Models are Super Data Learners".

Python 218 8 Updated Nov 6, 2025

GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 training.

Python 312 25 Updated Nov 11, 2025

Ongoing research training transformer models at scale

Python 14,911 3,490 Updated Jan 15, 2026

Megatron's multi-modal data loader

Python 304 37 Updated Jan 2, 2026

PyTorch native quantization and sparsity for training and inference

Python 2,619 400 Updated Jan 15, 2026
Python 39 Updated Dec 16, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,692 58 Updated Dec 26, 2025
Next