Skip to content
View pjunjie's full-sized avatar

Block or report pjunjie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The Official Implementation of Ada-KV [NeurIPS 2025]

Python 114 5 Updated Nov 26, 2025

[ICLR 2025🔥] D2O: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models

Python 25 2 Updated Jul 7, 2025

Unified KV Cache Compression Methods for Auto-Regressive Models

Python 1,280 160 Updated Jan 4, 2025

Xiao-Ming Wu's homepage: https://dravenalg.github.io/.

HTML 5 4 Updated Nov 24, 2025

This is the repository for [Homogeneous Keys, Heterogeneous Values: Exploiting Local KV Cache Asymmetry for Long-Context LLMs](https://arxiv.org/html/2506.05410v1),presented at NeurIPS 2025.

Jupyter Notebook 2 Updated Nov 25, 2025

The source code of CVPR 2019 paper "Deep Exemplar-based Video Colorization".

Python 361 87 Updated Oct 29, 2022

Collection of awesome test-time (domain/batch/instance) adaptation methods

1,132 72 Updated Nov 14, 2025

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

2,017 85 Updated Nov 21, 2025
MATLAB 2 Updated Dec 19, 2024

A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".

2,087 143 Updated Oct 5, 2023

Latest Advances on Long Chain-of-Thought Reasoning

557 26 Updated Jul 18, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,343 1,327 Updated Nov 20, 2025

[ICLR 2025] Palu: Compressing KV-Cache with Low-Rank Projection

Python 148 10 Updated Feb 20, 2025

[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)

Python 426 36 Updated Oct 23, 2025
Python 4 Updated Aug 28, 2025

LongBench v2 and LongBench (ACL 25'&24')

Python 1,027 110 Updated Jan 15, 2025

Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance. Accepted to ACL 2024.

Python 157 14 Updated Apr 7, 2025

xKV: Cross-Layer SVD for KV-Cache Compression

Python 42 4 Updated Nov 16, 2025

A framework for few-shot evaluation of language models.

Python 10,758 2,874 Updated Nov 25, 2025

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

608 17 Updated Sep 30, 2025

本人的科研经验

7,963 457 Updated Aug 12, 2025

Github Pages template based upon HTML and Markdown for personal, portfolio-based websites.

HTML 15,925 4,095 Updated Nov 25, 2025

General starter code for creative model architecture with huggingface transformer library.

Python 7 1 Updated Sep 29, 2025

This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding code links.

251 8 Updated Jul 29, 2025

Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.

392 24 Updated Mar 3, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 153,015 31,219 Updated Nov 26, 2025

An easy/swift-to-adapt PyTorch-Lighting template. 套壳模板,简单易用,稍改原来Pytorch代码,即可适配Lightning。You can translate your previous Pytorch code much easier using this template, and keep your freedom to edit a…

Jupyter Notebook 1,529 193 Updated Aug 6, 2023

TensorFlow code and pre-trained models for BERT

Python 39,688 9,712 Updated Jul 23, 2024

Crawl BookCorpus

Python 847 109 Updated Jul 14, 2023
Next