Skip to content
View Dogacel's full-sized avatar
🚀
To the moon.
🚀
To the moon.
  • Evanston, IL
  • 13:11 (UTC -06:00)

Highlights

  • Pro

Block or report Dogacel

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Tinymist [ˈtaɪni mɪst] is an integrated language service for Typst [taɪpst].

Rust 2,711 116 Updated Jan 7, 2026

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,933 294 Updated Aug 9, 2025

Block Diffusion for Ultra-Fast Speculative Decoding

Python 181 7 Updated Jan 5, 2026

Run Claude Code in Obsidian

JavaScript 77 3 Updated Jan 6, 2026

Enabling PyTorch on XLA Devices (e.g. Google TPU)

C++ 2,737 562 Updated Dec 18, 2025

Open-source framework for the research and development of foundation models.

HTML 704 70 Updated Jan 7, 2026

Package management made easy

Rust 6,047 398 Updated Jan 7, 2026

MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation Model

956 36 Updated Dec 22, 2025

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 13,858 2,158 Updated Jan 7, 2026

All the Mods 10

JavaScript 326 173 Updated Dec 12, 2025

All the mods skyblock pack for 1.21.1 NeoForge

JavaScript 71 54 Updated Dec 15, 2025

🐹 Deep clean and optimize your Mac.

Shell 26,164 698 Updated Jan 6, 2026

🔥 A minimal training framework for scaling FLA models

Python 333 50 Updated Nov 15, 2025

[ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference

Jupyter Notebook 48 2 Updated Jun 17, 2025

A paper list for spatial reasoning

593 32 Updated Dec 24, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,253 757 Updated Jan 5, 2026

Specification and documentation for Agent Skills

Python 4,696 240 Updated Jan 6, 2026
Rust 2 Updated Dec 21, 2025

Making large AI models cheaper, faster and more accessible

Python 41,315 4,545 Updated Dec 22, 2025
TypeScript 9,380 660 Updated Jan 7, 2026

Measure and optimize the energy consumption of your AI applications!

Python 326 40 Updated Jan 4, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,568 1,994 Updated Jan 7, 2026

This repository is the official implementation of "Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE"

Python 36 3 Updated Oct 5, 2025

A terminal assistant that allows you to ask an LLM to run commands.

Python 8 Updated Jan 7, 2026

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 614 130 Updated Jan 7, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 22,227 3,983 Updated Jan 7, 2026

3x Faster Inference; Unofficial implementation of EAGLE Speculative Decoding

Python 82 14 Updated Jul 3, 2025

Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton

Python 39 1 Updated Feb 13, 2025

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Python 349 45 Updated Apr 22, 2025
Next