Skip to content
View jeffra's full-sized avatar

Highlights

  • Pro

Organizations

@brownsys

Block or report jeffra

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ArcticInference: vLLM plugin for high-throughput, low-latency inference

Python 286 37 Updated Oct 27, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 151,649 30,950 Updated Oct 25, 2025

ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)

Python 232 25 Updated Oct 20, 2025
Jupyter Notebook 79 19 Updated Mar 8, 2025

Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines

Python 195 12 Updated May 6, 2024

Machine Learning Engineering Open Book

Python 15,526 943 Updated Oct 21, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 19,910 2,077 Updated Oct 24, 2025

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 2,071 187 Updated Jun 30, 2025

Pretrained language model with 100B parameters

Python 3,754 297 Updated Jul 10, 2023

Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2

Python 3,153 632 Updated Sep 15, 2025

Code release for SLIP Self-supervision meets Language-Image Pre-training

Python 782 72 Updated Feb 9, 2023

Azure HPC/AI VM Images

Shell 119 93 Updated Oct 23, 2025

Library for 8-bit optimizers and quantization routines.

780 48 Updated Aug 18, 2022

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,422 228 Updated Mar 20, 2024

Distribution transparent Machine Learning experiments on Apache Spark

Python 91 14 Updated Feb 21, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,322 1,088 Updated Sep 26, 2025

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed

Python 436 74 Updated Jun 14, 2023

Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.

Python 481 92 Updated Oct 23, 2024

RDMA and SHARP plugins for nccl library

C 211 38 Updated Oct 21, 2025

Example models using DeepSpeed

Python 6,698 1,108 Updated Oct 15, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,510 4,591 Updated Oct 27, 2025

A minimal & modern LaTeX template for your (bachelor's | master's | doctoral) thesis

TeX 1,207 136 Updated Nov 16, 2023

Find the smallest number of switches necessary to build topologies of a given number of hosts and bisection bandwidth for the EGFT, HyperX, and Jellyfish topologies.

Python 2 Updated Jul 24, 2013