-
Intel
- Shanghai
Stars
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…
Accessible large language models via k-bit quantization for PyTorch.
Running large language models on a single GPU for throughput-oriented scenarios.
QLoRA: Efficient Finetuning of Quantized LLMs
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Pretrain, finetune and serve LLMs on Intel platforms with Ray
Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Universal LLM Deployment Engine with ML Compilation
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Ongoing research training transformer models at scale
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
Making large AI models cheaper, faster and more accessible
A collection of libraries to optimise AI model performances
Development repository for the Triton language and compiler
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).
Code and documentation to train Stanford's Alpaca models, and generate the data.
Training and serving large-scale neural networks with auto parallelization.
An open-source ML pipeline development platform
RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.
Uniffle is a high performance, general purpose Remote Shuffle Service.
The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query processing
Cloud Scale Platform for Distributed Analytics and AI
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
DAOS Storage Stack (client libraries, storage engine, control plane)