Skip to content
View anxuthu's full-sized avatar

Block or report anxuthu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2024] Lemur: Open Foundation Models for Language Agents

Python 555 33 Updated Oct 28, 2023

🐙 OctoPack: Instruction Tuning Code Large Language Models

Jupyter Notebook 475 27 Updated Feb 5, 2025

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 12,958 860 Updated Dec 17, 2024

A multi-programming language benchmark for LLMs

Python 280 52 Updated Nov 16, 2025

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,224 4,035 Updated Jul 17, 2024
Python 1,495 113 Updated May 12, 2023

Code for the paper "Evaluating Large Language Models Trained on Code"

Python 3,023 422 Updated Jan 17, 2025

CLI platform to experiment with codegen. Precursor to: https://lovable.dev

Python 55,038 7,337 Updated May 14, 2025

A playbook for systematically maximizing the performance of deep learning models.

29,413 2,403 Updated Jun 18, 2024
Python 608 77 Updated Aug 22, 2025

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 16,732 3,697 Updated Jun 2, 2023

An implementation of a deep learning recommendation model (DLRM)

Python 3,992 867 Updated Oct 2, 2025

Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"

Jupyter Notebook 1,064 110 Updated Aug 9, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,744 4,638 Updated Nov 19, 2025

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

Python 5,093 806 Updated Sep 30, 2025
Python 7,621 2,212 Updated Oct 23, 2025

Code repo for "Language Models with Transformers" paper

Python 22 13 Updated Sep 18, 2020
Python 45 8 Updated Oct 27, 2019

Transformer implementation in PyTorch.

Python 490 108 Updated Mar 7, 2019

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,840 245 Updated Nov 17, 2025

[ICML 2020] Obtaining Adjustable Regularization for Free via Iterate Averaging

Python 3 1 Updated Jul 31, 2020

Fair Resource Allocation in Federated Learning (ICLR '20)

Python 250 60 Updated Dec 2, 2023

Decentralized SGD and Consensus with Communication Compression: https://arxiv.org/abs/1907.09356

Python 73 18 Updated Sep 10, 2020

Synchronized Batch Normalization implementation in PyTorch.

Python 1,501 188 Updated Apr 8, 2021

Simple Hierarchical Count Sketch in Python

Python 21 9 Updated Jun 3, 2021

PyTorch for benchmarking communication-efficient distributed SGD optimization algorithms

Python 78 20 Updated Aug 30, 2021

A PyTorch implementation of the paper "Training Neural Networks Using Features Replay"

Python 10 1 Updated Aug 28, 2019

Code for Federated Learning with Matched Averaging, ICLR 2020.

Python 341 84 Updated Dec 5, 2021
Next