Skip to content
View imoneoi's full-sized avatar
🎯
Tuning PPO
🎯
Tuning PPO

Organizations

@OpenOrca @FastEval

Block or report imoneoi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient Triton Kernels for LLM Training

Python 5,886 439 Updated Nov 28, 2025

Hierarchical Reasoning Model Official Release

Python 11,818 1,725 Updated Sep 9, 2025

Monte Carlo tree search in JAX

Python 2,566 210 Updated Sep 2, 2025

Fused Adam-atan2 implementation

Python 5 5 Updated Apr 2, 2025

unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"

Python 79 16 Updated Jul 4, 2022

Github action to maximize the available disk space on Github runners

488 100 Updated Mar 28, 2025

BFloat16 Fused Adam Operator for PyTorch

Python 16 Updated Nov 16, 2024

[AAAI'25 Oral] Are Expressive Models Truly Necessary for Offline RL?

Python 13 4 Updated Dec 10, 2024
Python 768 52 Updated Jun 13, 2024

Grok open release

Python 50,576 8,380 Updated Aug 30, 2024

Typed command line interfaces with argparse and pydantic

Python 48 4 Updated Jan 10, 2025

[For SM90 and cuBLAS] PyTorch bindings for CUTLASS grouped GEMM.

Cuda 1 Updated Jan 24, 2024

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Python 3,904 300 Updated Nov 25, 2024

Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)

C++ 749 54 Updated Nov 26, 2025

Typed Argument Parsing with Pydantic

Python 133 21 Updated Nov 24, 2025

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 7 1 Updated Dec 27, 2023

NVIDIA Linux open GPU kernel module source

C 16,400 1,531 Updated Nov 21, 2025

ICLR 2022 Paper, SOTA Table Pre-training Model, TAPEX: Table Pre-training via Learning a Neural SQL Executor

Python 299 39 Updated Feb 6, 2023

The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".

Python 66 3 Updated Apr 18, 2023
Python 1 Updated Nov 22, 2023

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

1,819 158 Updated May 9, 2023

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 40,057 6,955 Updated Nov 29, 2025

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,444 428 Updated Sep 13, 2024
JavaScript 21 2 Updated Apr 1, 2024

Code for paper Evolving Connectivity for Spiking Neural Networks

Python 23 4 Updated Oct 23, 2023

A multi-purpose LLM framework for RAG and data creation.

Python 628 51 Updated Jan 13, 2024

WireGuard Configuration Portal with LDAP connection

Go 1,456 162 Updated Nov 24, 2025

Curate better data for LLMs

Python 1,061 102 Updated Mar 19, 2024
Next