anxuthu

anxuthu anxuthu

5 followers · 14 following

Achievements

Stars

OpenLemur / Lemur

[ICLR 2024] Lemur: Open Foundation Models for Language Agents

Python 555 33 Updated Oct 28, 2023

bigcode-project / octopack

🐙 OctoPack: Instruction Tuning Code Large Language Models

Jupyter Notebook 475 27 Updated Feb 5, 2025

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 12,958 860 Updated Dec 17, 2024

nuprl / MultiPL-E

A multi-programming language benchmark for LLMs

Python 280 52 Updated Nov 16, 2025

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,224 4,035 Updated Jul 17, 2024

sahil280114 / codealpaca

Python 1,495 113 Updated May 12, 2023

openai / human-eval

Code for the paper "Evaluating Large Language Models Trained on Code"

Python 3,023 422 Updated Jan 17, 2025

AntonOsika / gpt-engineer

CLI platform to experiment with codegen. Precursor to: https://lovable.dev

Python 55,038 7,337 Updated May 14, 2025

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

29,413 2,403 Updated Jun 18, 2024

google-research / sam

Python 608 77 Updated Aug 22, 2025

Qualcomm-AI-research / oscillations-qat

Python 78 13 Updated Jul 21, 2022

tensorflow / tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 16,732 3,697 Updated Jun 2, 2023

facebookresearch / dlrm

An implementation of a deep learning recommendation model (DLRM)

Python 3,992 867 Updated Oct 2, 2025

juntang-zhuang / Adabelief-Optimizer

Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"

Jupyter Notebook 1,064 110 Updated Aug 9, 2024

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,744 4,638 Updated Nov 19, 2025

facebookresearch / moco

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

Python 5,093 806 Updated Sep 30, 2025

MIC-DKFZ / nnUNet

Python 7,621 2,212 Updated Oct 23, 2025

cgraywang / gluon-nlp-1

Forked from dmlc/gluon-nlp

Code repo for "Language Models with Transformers" paper

Python 22 13 Updated Sep 18, 2020

briancheung / superposition

Python 45 8 Updated Oct 27, 2019

tunz / transformer-pytorch

Transformer implementation in PyTorch.

Python 490 108 Updated Mar 7, 2019

flexflow / flexflow-train

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,840 245 Updated Nov 17, 2025

uuujf / IterAvg

[ICML 2020] Obtaining Adjustable Regularization for Free via Iterate Averaging

Python 3 1 Updated Jul 31, 2020

litian96 / fair_flearn

Fair Resource Allocation in Federated Learning (ICLR '20)

Python 250 60 Updated Dec 2, 2023

epfml / ChocoSGD

Decentralized SGD and Consensus with Communication Compression: https://arxiv.org/abs/1907.09356

Python 73 18 Updated Sep 10, 2020

cybertronai / autograd-hacks

Python 157 32 Updated Jun 8, 2022

vacancy / Synchronized-BatchNorm-PyTorch

Synchronized Batch Normalization implementation in PyTorch.

Python 1,501 188 Updated Apr 8, 2021

nikitaivkin / csh

Simple Hierarchical Count Sketch in Python

Python 21 9 Updated Jun 3, 2021

kiddyboots216 / CommEfficient

PyTorch for benchmarking communication-efficient distributed SGD optimization algorithms

Python 78 20 Updated Aug 30, 2021

slowbull / FeaturesReplay

A PyTorch implementation of the paper "Training Neural Networks Using Features Replay"

Python 10 1 Updated Aug 28, 2019

IBM / FedMA

Code for Federated Learning with Matched Averaging, ICLR 2020.

Python 341 84 Updated Dec 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

anxuthu anxuthu

Achievements

Achievements

Block or report anxuthu

Stars

OpenLemur / Lemur

bigcode-project / octopack

microsoft / LoRA

nuprl / MultiPL-E

tatsu-lab / stanford_alpaca

sahil280114 / codealpaca

openai / human-eval

AntonOsika / gpt-engineer

google-research / tuning_playbook

google-research / sam

Qualcomm-AI-research / oscillations-qat

tensorflow / tensor2tensor

facebookresearch / dlrm

juntang-zhuang / Adabelief-Optimizer

deepspeedai / DeepSpeed

facebookresearch / moco

MIC-DKFZ / nnUNet

cgraywang / gluon-nlp-1

briancheung / superposition

tunz / transformer-pytorch

flexflow / flexflow-train

uuujf / IterAvg

litian96 / fair_flearn

epfml / ChocoSGD

cybertronai / autograd-hacks

vacancy / Synchronized-BatchNorm-PyTorch

nikitaivkin / csh

kiddyboots216 / CommEfficient

slowbull / FeaturesReplay

IBM / FedMA