Skip to content
View jyakaranda's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report jyakaranda

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Plan-R1: Safe and Feasible Trajectory Planning as Language Modeling

Python 42 16 Updated Oct 30, 2025

A tiny deep learning training framework implemented from scratch in C++ that follows PyTorch's API.

C++ 115 21 Updated Nov 1, 2025

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Pape…

Jupyter Notebook 772 182 Updated Jan 22, 2019

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,329 5,811 Updated Aug 14, 2024

A Gym for Agentic LLMs

Python 346 20 Updated Oct 30, 2025

The best ChatGPT that $100 can buy.

Python 35,011 3,968 Updated Nov 1, 2025

Deep Reinforcement Learning

4,276 650 Updated Dec 10, 2022

Wife approved HomeOps driven by Kubernetes and GitOps using Flux

YAML 2,583 211 Updated Nov 1, 2025

My GitOps-managed home Kubernetes cluster... and more! ⛵

Just 88 Updated Nov 2, 2025

GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's TerminalBench leaderboard.

Python 288 17 Updated Aug 24, 2025

A benchmark for LLMs on complicated tasks in the terminal

Python 1,003 365 Updated Oct 31, 2025

Lightweight coding agent that runs in your terminal

Rust 49,539 6,086 Updated Nov 2, 2025

A C library for creating Excel XLSX files.

C 1,687 368 Updated Oct 31, 2025

GPU documentation for humans

Python 355 45 Updated Oct 3, 2025

💫 Toolkit to help you get started with Spec-Driven Development

Shell 44,221 3,777 Updated Oct 23, 2025

DELT: Data Efficacy for Language Model Training

Python 40 4 Updated Aug 31, 2025

Nano vLLM

Python 7,397 949 Updated Aug 31, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads. AntRay is forked from ray, offering incremental new features on top …

Python 154 23 Updated Nov 1, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,332 237 Updated Nov 2, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,799 936 Updated Nov 2, 2025

Trae Agent is an LLM-based agent for general purpose software engineering tasks.

Python 9,849 1,018 Updated Sep 24, 2025

Spark RAPIDS plugin - accelerate Apache Spark with GPUs

Scala 940 260 Updated Oct 31, 2025

AGENTS.md — a simple, open format for guiding coding agents

TypeScript 7,821 606 Updated Oct 22, 2025

Open source software for autonomous drones.

C++ 2,944 448 Updated Jul 16, 2024

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 12,641 1,197 Updated Oct 28, 2025

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

Python 13,225 1,752 Updated Oct 15, 2025

TARE Exploration Planner for Ground Vehicles

C++ 601 119 Updated Jul 10, 2024

An elegant PyTorch deep reinforcement learning library.

Python 8,886 1,187 Updated Oct 29, 2025

C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

C++ 1,203 117 Updated Aug 12, 2024

3D Visualization of an GPT-style LLM

TypeScript 5,101 588 Updated Aug 24, 2024
Next