Skip to content
View xesdiny's full-sized avatar

Block or report xesdiny

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Interactive visualization of Manifold-Constrained Hyper-Connections (mHC) for stable deep network training

TypeScript 20 2 Updated Jan 11, 2026

A safetensors extension to efficiently store sparse quantized tensors on disk

Python 233 49 Updated Jan 15, 2026

[FPGA'26 Best Paper Nomination] CXL-SpecKV: A Disaggregated FPGA Speculative KV-Cache for Datacenter LLM Serving

C++ 13 1 Updated Nov 23, 2025

Fully Open Framework for Democratized Multimodal Reinforcement Learning.

Python 37 1 Updated Dec 19, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,692 59 Updated Dec 26, 2025

Official repo of Promoting Efficient Reasoning with Verifiable Stepwise Reward

Python 15 1 Updated Sep 9, 2025

Fully Open Framework for Democratized Multimodal Training

Python 689 56 Updated Dec 27, 2025

Official implementation of "DPad: Efficient Diffusion Language Models with Suffix Dropout"

Python 54 5 Updated Nov 22, 2025

SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model(1.7B, 4B, 8B, 30B)

Python 321 17 Updated Dec 15, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,181 1,139 Updated Jan 15, 2026

Easy and Efficient dLLM Fine-Tuning

Python 194 8 Updated Dec 15, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 4,724 399 Updated Jan 15, 2026

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,664 2,234 Updated Feb 1, 2025

Defeating the Training-Inference Mismatch via FP16

Python 176 15 Updated Nov 14, 2025

The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".

Python 30 1 Updated Nov 12, 2024
Python 848 62 Updated Nov 6, 2025

Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion models are significantly more data-efficient than standard left…

Python 118 3 Updated Jan 10, 2026

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python 409 15 Updated Jul 11, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,255 125 Updated Nov 9, 2025

The author's implementation of FUDOKI, a multimodal large language model purely based on discrete flow matching.

Python 67 3 Updated Dec 21, 2025

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,557 82 Updated Nov 16, 2025

Dream 7B, a large diffusion language model

Python 1,142 74 Updated Nov 21, 2025

ScaleRL Curve Fitting

Python 14 2 Updated Oct 13, 2025

Automatic Video Generation from Scientific Papers

Python 2,063 303 Updated Oct 20, 2025
Python 6 Updated Oct 22, 2025

Official Jax Implementation of MD4 Masked Diffusion Models

Python 151 15 Updated Feb 27, 2025

[Arxiv] Discrete Diffusion in Large Language and Multimodal Models: A Survey

Python 354 3 Updated Nov 1, 2025

Contexts Optical Compression

Python 22,036 2,007 Updated Oct 25, 2025

Falcon is a continuously-evolving, high-quality benchmark for natural-language-to-SQL (Text2SQL) systems.

HTML 42 1 Updated Dec 22, 2025

A high-performance kernel library for LLM training

Python 57 6 Updated Jan 14, 2026
Next