Skip to content
View EunjuYang's full-sized avatar
πŸ‘
:)
πŸ‘
:)

Block or report EunjuYang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai

C 88 13 Updated Oct 17, 2025

NNtrainer is Software Framework for Training Neural Network Models on Devices.

C++ 1 Updated Oct 15, 2025

Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache

Python 125 8 Updated Aug 13, 2025

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 8,393 1,381 Updated Oct 14, 2025

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

Python 2,920 245 Updated Jul 7, 2025

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python 350 11 Updated Jul 11, 2025

[Fully open] [Encoder-free MLLM] Vision as LoRA

Python 340 28 Updated Jun 12, 2025

[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

Python 862 63 Updated Sep 26, 2025

NVIDIA Isaac GR00T N1.5 - A Foundation Model for Generalist Robots.

Jupyter Notebook 5,060 787 Updated Oct 13, 2025

Code release for DynamicTanh (DyT)

Python 1,020 85 Updated Mar 30, 2025
Python 25 7 Updated May 30, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 19,052 3,087 Updated Oct 18, 2025
C++ 314 91 Updated Jul 19, 2025

Paper list for Personal LLM Agents

412 22 Updated May 8, 2024

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 16,581 3,917 Updated Jul 21, 2025

A Self-adaptation FrameworkπŸ™ that adapts LLMs for unseen tasks in real-time!

Python 1,154 134 Updated Jan 30, 2025

A minimalistic C++ Jinja templating engine for LLM chat templates

C++ 190 23 Updated Sep 22, 2025

[ACL 2024] Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models

Python 105 12 Updated May 24, 2024

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,360 107 Updated Feb 19, 2025

Fast Multimodal LLM on Mobile Devices

C++ 1,117 136 Updated Oct 18, 2025

Inference Speed Benchmark for Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Cuda 73 5 Updated Jul 14, 2024

OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.

C 7,048 1,602 Updated Oct 17, 2025

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 1,263 82 Updated Jul 14, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 37,541 4,069 Updated Oct 18, 2025

The repo for In-context Autoencoder

Jupyter Notebook 145 19 Updated May 11, 2024

EMNLP 2024 | Style-Specific Neurons for Steering LLMs in Text Style Transfer

Python 11 1 Updated Mar 23, 2025

[EMNLP 2023] Adapting Language Models to Compress Long Contexts

Python 313 26 Updated Sep 9, 2024
Next