Lists (3)
Sort Name ascending (A-Z)
Stars
This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai
EunjuYang / nntrainer
Forked from nnstreamer/nntrainerNNtrainer is Software Framework for Training Neural Network Models on Devices.
Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrβ¦
MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning
NVIDIA Isaac GR00T N1.5 - A Foundation Model for Generalist Robots.
SGLang is a fast serving framework for large language models and vision language models.
Paper list for Personal LLM Agents
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!
A minimalistic C++ Jinja templating engine for LLM chat templates
[ACL 2024] Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
Inference Speed Benchmark for Learning to (Learn at Test Time): RNNs with Expressive Hidden States
OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
A library for efficient similarity search and clustering of dense vectors.
EMNLP 2024 | Style-Specific Neurons for Steering LLMs in Text Style Transfer
[EMNLP 2023] Adapting Language Models to Compress Long Contexts