Skip to content
View carsonwang's full-sized avatar

Block or report carsonwang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 8,448 1,387 Updated Oct 14, 2025

Accessible large language models via k-bit quantization for PyTorch.

Python 7,738 793 Updated Nov 4, 2025

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,377 583 Updated Oct 28, 2024

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,741 865 Updated Jun 10, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 62,227 7,530 Updated Nov 10, 2025

Pretrain, finetune and serve LLMs on Intel platforms with Ray

Python 131 36 Updated Sep 23, 2025

Inference code for CodeLlama models

Python 16,358 1,936 Updated Aug 12, 2024

Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning

Python 308 29 Updated Oct 24, 2024

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,457 733 Updated Jun 7, 2025

Inference code for Llama models

Python 58,914 9,813 Updated Jan 26, 2025

Universal LLM Deployment Engine with ML Compilation

Python 21,588 1,852 Updated Nov 4, 2025

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 76,884 8,300 Updated May 27, 2025

Ongoing research training transformer models at scale

Python 14,155 3,265 Updated Nov 11, 2025

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Shell 1,006 102 Updated Jul 29, 2024

Making large AI models cheaper, faster and more accessible

Python 41,235 4,538 Updated Nov 11, 2025

A collection of libraries to optimise AI model performances

Python 8,368 632 Updated Jul 22, 2024

Development repository for the Triton language and compiler

MLIR 17,525 2,379 Updated Nov 11, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,248 4,777 Updated Jun 2, 2025

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 40,479 3,126 Updated Nov 11, 2025

深度学习经典、新论文逐段精读

31,899 2,741 Updated Mar 22, 2025

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).

Python 8,933 838 Updated Nov 11, 2025

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,213 4,034 Updated Jul 17, 2024

Training and serving large-scale neural networks with auto parallelization.

Python 3,163 353 Updated Dec 9, 2023

An open-source ML pipeline development platform

Python 996 60 Updated Jan 9, 2025

RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.

Python 350 77 Updated Nov 5, 2025

Uniffle is a high performance, general purpose Remote Shuffle Service.

Java 429 161 Updated Nov 11, 2025

The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query processing

Rust 1,640 190 Updated Nov 11, 2025

Cloud Scale Platform for Distributed Analytics and AI

Python 24 5 Updated Oct 12, 2023

A cross platform way to express data transformation, relational algebra, standardized record expression and plans.

Python 1,423 187 Updated Nov 10, 2025

DAOS Storage Stack (client libraries, storage engine, control plane)

C 892 333 Updated Nov 11, 2025
Next