Skip to content
View imoneoi's full-sized avatar
🎯
Tuning PPO
🎯
Tuning PPO

Organizations

@OpenOrca @FastEval

Block or report imoneoi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Adams Optimizer for Stable and Scalable Training

Python 8 Updated Sep 13, 2025

Efficient Triton Kernels for LLM Training

Python 5,781 421 Updated Oct 28, 2025

Hierarchical Reasoning Model Official Release

Python 11,591 1,689 Updated Sep 9, 2025

Monte Carlo tree search in JAX

Python 2,547 209 Updated Sep 2, 2025

Fused Adam-atan2 implementation

Python 5 5 Updated Apr 2, 2025

unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"

Python 79 16 Updated Jul 4, 2022

Github action to maximize the available disk space on Github runners

478 97 Updated Mar 28, 2025

BFloat16 Fused Adam Operator for PyTorch

Python 16 Updated Nov 16, 2024

[AAAI'25 Oral] Are Expressive Models Truly Necessary for Offline RL?

Python 13 4 Updated Dec 10, 2024
Python 767 52 Updated Jun 13, 2024

Grok open release

Python 50,540 8,370 Updated Aug 30, 2024

Typed command line interfaces with argparse and pydantic

Python 47 4 Updated Jan 10, 2025

[For SM90 and cuBLAS] PyTorch bindings for CUTLASS grouped GEMM.

Cuda 1 Updated Jan 24, 2024

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Python 3,897 302 Updated Nov 25, 2024

Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)

C++ 729 53 Updated Oct 29, 2025

Typed Argument Parsing with Pydantic

Python 132 21 Updated Oct 13, 2025

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 7 1 Updated Dec 27, 2023

NVIDIA Linux open GPU kernel module source

C 16,302 1,512 Updated Oct 27, 2025

ICLR 2022 Paper, SOTA Table Pre-training Model, TAPEX: Table Pre-training via Learning a Neural SQL Executor

Python 297 38 Updated Feb 6, 2023

The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".

Python 66 3 Updated Apr 18, 2023
Python 1 Updated Nov 22, 2023

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

1,795 157 Updated May 9, 2023

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 39,590 6,841 Updated Oct 30, 2025

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,439 428 Updated Sep 13, 2024
JavaScript 21 2 Updated Apr 1, 2024

Code for paper Evolving Connectivity for Spiking Neural Networks

Python 23 4 Updated Oct 23, 2023

A multi-purpose LLM framework for RAG and data creation.

Python 626 53 Updated Jan 13, 2024

WireGuard Configuration Portal with LDAP connection

Go 1,331 152 Updated Oct 27, 2025
Next