Skip to content
View xesdiny's full-sized avatar

Block or report xesdiny

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A safetensors extension to efficiently store sparse quantized tensors on disk

Python 230 49 Updated Jan 8, 2026

[FPGA'26 Best Paper Nomination] CXL-SpecKV: A Disaggregated FPGA Speculative KV-Cache for Datacenter LLM Serving

C++ 12 1 Updated Nov 23, 2025

Fully Open Framework for Democratized Multimodal Reinforcement Learning.

Python 34 1 Updated Dec 19, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,685 58 Updated Dec 26, 2025

Official repo of Promoting Efficient Reasoning with Verifiable Stepwise Reward

Python 15 1 Updated Sep 9, 2025

Fully Open Framework for Democratized Multimodal Training

Python 681 54 Updated Dec 27, 2025

Official implementation of "DPad: Efficient Diffusion Language Models with Suffix Dropout"

Python 54 5 Updated Nov 22, 2025

SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model(1.7B, 4B, 8B, 30B)

Python 316 17 Updated Dec 15, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,062 1,114 Updated Jan 9, 2026

Easy and Efficient dLLM Fine-Tuning

Python 193 7 Updated Dec 15, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 4,540 384 Updated Jan 9, 2026

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,663 2,237 Updated Feb 1, 2025

Defeating the Training-Inference Mismatch via FP16

Python 172 15 Updated Nov 14, 2025

The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".

Python 29 1 Updated Nov 12, 2024
Python 847 62 Updated Nov 6, 2025

Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion models are significantly more data-efficient than standard left…

Python 118 3 Updated Oct 27, 2025

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python 407 14 Updated Jul 11, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,239 122 Updated Nov 9, 2025

The author's implementation of FUDOKI, a multimodal large language model purely based on discrete flow matching.

Python 66 3 Updated Dec 21, 2025

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,549 80 Updated Nov 16, 2025

Dream 7B, a large diffusion language model

Python 1,138 73 Updated Nov 21, 2025

ScaleRL Curve Fitting

Python 14 2 Updated Oct 13, 2025

Automatic Video Generation from Scientific Papers

Python 2,051 304 Updated Oct 20, 2025
Python 6 Updated Oct 22, 2025

Official Jax Implementation of MD4 Masked Diffusion Models

Python 151 15 Updated Feb 27, 2025

[Arxiv] Discrete Diffusion in Large Language and Multimodal Models: A Survey

Python 350 3 Updated Nov 1, 2025

Contexts Optical Compression

Python 21,949 1,995 Updated Oct 25, 2025

Falcon is a continuously-evolving, high-quality benchmark for natural-language-to-SQL (Text2SQL) systems.

HTML 42 1 Updated Dec 22, 2025

A high-performance kernel library for LLM training

Python 57 6 Updated Oct 28, 2025

dInfer: An Efficient Inference Framework for Diffusion Language Models

Python 383 37 Updated Jan 7, 2026
Next