Skip to content
View XinDongol's full-sized avatar
🏁
Loading...
🏁
Loading...

Block or report XinDongol

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

This project explores transformer improvements through Super-Transformers (scalable softmax, positional encoding ablation) and MiniDeepSeek (multi-latent attention, KV cache, weight absorption) to …

Python 2 Updated Mar 17, 2025

Unofficial Scalable-Softmax Is Superior for Attention

Python 20 Updated May 30, 2025

Official code implementation of Context Cascade Compression: Exploring the Upper Limits of Text Compression

Python 266 3 Updated Dec 3, 2025

Unofficial Implementation of Selective Attention Transformer

Python 20 1 Updated Oct 31, 2024

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Python 482 30 Updated Mar 19, 2024

Implementation of "Efficient Training of Language Models to Fill in the Middle"

Python 4 Updated Jan 30, 2025

A Reproduction of GDM's Nested Learning Paper

Python 525 75 Updated Dec 3, 2025
Python 18 Updated Jul 31, 2025

Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation length and maintaining KV-cache compatibility, achieving high eff…

Python 81 3 Updated Dec 27, 2025

A community driven list of open source alternatives to proprietary software and applications.

TypeScript 5,190 226 Updated Nov 24, 2025

A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

Python 178 23 Updated Dec 19, 2025

DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning

Python 13 1 Updated Nov 11, 2025

CUDA Python: Performance meets Productivity

Cython 3,115 235 Updated Dec 24, 2025

🌐 The open-source Agentic browser; privacy-first alternative to ChatGPT Atlas, Perplexity Comet, Dia.

C++ 8,419 820 Updated Jan 1, 2026

Official repository for Adaptive Parallel Decoding (APD).

Python 17 Updated Oct 27, 2025

(best/better) practices of megatron on veRL and tuning guide

Shell 113 8 Updated Sep 26, 2025

A Google Apps Script for syncing ICS/ICAL files faster than the current Google Calendar speed

JavaScript 1,812 236 Updated Dec 14, 2025

Generate a timeline of your day, automatically

Swift 5,243 261 Updated Jan 1, 2026

Practical productivity tools for Claude Code, Codex-CLI, and similar CLI coding agents.

Python 893 66 Updated Jan 1, 2026

An efficient implementation of the NSA (Native Sparse Attention) kernel

Python 127 4 Updated Jun 24, 2025

Render any git repo into a single static HTML page for humans or LLMs

Python 1,986 196 Updated Aug 21, 2025

This is a highlight select words plugin for Visual Studio Code.It's very useful when you are reading code.

TypeScript 8 2 Updated Apr 26, 2021

The Cursor for Designers • An Open-Source AI-First Design tool • Visually build, style, and edit your React App with AI

TypeScript 23,616 1,745 Updated Dec 29, 2025

Learning to Keep a Promise

2 Updated Apr 30, 2025
Python 2 Updated Apr 15, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,750 271 Updated Jul 18, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 4,359 368 Updated Jan 1, 2026

Official Repo for Open-Reasoner-Zero

Python 2,085 119 Updated Jun 2, 2025

Production-ready platform for agentic workflow development.

Python 124,316 19,328 Updated Jan 1, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,960 2,935 Updated Jan 1, 2026
Next