imoneoi

🎯

Tuning PPO

One imoneoi

🎯

Tuning PPO

Professional RL(HF) hyperparameter tuner

535 followers · 0 following

http://imone.me

Achievements

x4 x2

Achievements

x4 x2

Organizations

Lists (1)

Sort

🔮 Future ideas

Stars

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 6,033 459 Updated Jan 12, 2026

sapientinc / HRM

Hierarchical Reasoning Model Official Release

Python 12,235 1,780 Updated Sep 9, 2025

google-deepmind / mctx

Monte Carlo tree search in JAX

Python 2,578 209 Updated Sep 2, 2025

imoneoi / adam-atan2

Fused Adam-atan2 implementation

Python 5 6 Updated Apr 2, 2025

Sea-Snell / grokking

unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"

Python 81 16 Updated Jul 4, 2022

easimon / maximize-build-space

Github action to maximize the available disk space on Github runners

502 99 Updated Mar 28, 2025

imoneoi / bf16_fused_adam

BFloat16 Fused Adam Operator for PyTorch

Python 16 1 Updated Nov 16, 2024

imoneoi / RSP_JAX

[AAAI'25 Oral] Are Expressive Models Truly Necessary for Offline RL?

Python 14 4 Updated Dec 10, 2024

ruixiangcui / AGIEval

Python 771 53 Updated Jun 13, 2024

AmericanPresidentJimmyCarter / test-torch-bfloat16-vit-training

Python 11 1 Updated Apr 4, 2024

xai-org / grok-1

Grok open release

Python 50,568 8,367 Updated Aug 30, 2024

edornd / argdantic

Typed command line interfaces with argparse and pydantic

Python 49 4 Updated Jan 10, 2025

openchatdev / cublas_sm90_grouped_gemm

Forked from tgale96/grouped_gemm

[For SM90 and cuBLAS] PyTorch bindings for CUTLASS grouped GEMM.

Cuda 1 Updated Jan 24, 2024

Codium-ai / AlphaCodium

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Python 3,914 300 Updated Nov 25, 2024

foldl / chatllm.cpp

Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)

C++ 763 53 Updated Jan 10, 2026

SupImDos / pydantic-argparse

Typed Argument Parsing with Pydantic

Python 135 22 Updated Nov 24, 2025

imoneoi / cutlass_grouped_gemm

Forked from tgale96/grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 7 1 Updated Dec 27, 2023

NVIDIA / open-gpu-kernel-modules

NVIDIA Linux open GPU kernel module source

C 16,612 1,562 Updated Dec 18, 2025

microsoft / Table-Pretraining

ICLR 2022 Paper, SOTA Table Pre-training Model, TAPEX: Table Pre-training via Learning a Neural SQL Executor

Python 299 39 Updated Feb 6, 2023

sail-sg / symbolic-instruction-tuning

The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".

Python 66 3 Updated Apr 18, 2023

Sanster / padding_free_llm_train

Python 16 2 Updated Feb 6, 2024

OpenOrca / FLAN_OO2

Forked from google-research/FLAN

Python 1 Updated Nov 22, 2023

zhoubolei / bolei_awesome_posters

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

1,849 163 Updated May 9, 2023

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 40,727 7,098 Updated Jan 12, 2026

imoneoi / openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,466 434 Updated Sep 13, 2024

imoneoi / mistral-tokenizer

JavaScript 21 2 Updated Apr 1, 2024

imoneoi / EvolvingConnectivity

Code for paper Evolving Connectivity for Spiking Neural Networks

Python 23 4 Updated Oct 23, 2023

SciPhi-AI / synthesizer

A multi-purpose LLM framework for RAG and data creation.

Python 630 51 Updated Jan 13, 2024

h44z / wg-portal

WireGuard Configuration Portal with LDAP connection

Go 1,518 160 Updated Jan 5, 2026

databricks / lilac

Curate better data for LLMs

Python 1,065 103 Updated Mar 19, 2024