Skip to content
View ehsk's full-sized avatar

Organizations

@castorini @beir-cellar @project-miracl

Block or report ehsk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 9 3 Updated Jul 10, 2025

Code for paper "The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning"

Python 335 26 Updated Nov 13, 2025

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,307 112 Updated Jan 16, 2026

The personal finance app for everyone

Ruby 53,944 5,123 Updated Jul 24, 2025

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 347 34 Updated Jan 18, 2026

Recipes to scale inference-time compute of open models

Python 1,125 131 Updated May 22, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,476 3,061 Updated Jan 19, 2026

Simple RL training for reasoning

Python 3,826 283 Updated Dec 23, 2025

Fully open reproduction of DeepSeek-R1

Python 25,829 2,411 Updated Nov 24, 2025
Python 1,074 49 Updated Jan 10, 2026

Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models

Python 68 10 Updated Apr 26, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,214 51 Updated Nov 16, 2024

The MATH Dataset (NeurIPS 2021)

Python 1,288 111 Updated Sep 6, 2025

TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle

Python 302 37 Updated Dec 16, 2025

🙃 A delightful community-driven (with 2,400+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python…

Shell 184,073 26,298 Updated Jan 19, 2026

The official Meta Llama 3 GitHub site

Python 29,181 3,506 Updated Jan 26, 2025

A blazing fast inference solution for text embeddings models

Rust 4,399 345 Updated Jan 13, 2026

WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?

Python 229 32 Updated Jan 14, 2026

🌎💪 BrowserGym, a Gym environment for web task automation

Python 1,082 143 Updated Jan 16, 2026

Code for Contrastive Preference Learning (CPL)

Python 178 15 Updated Nov 22, 2024

Firefly III: a personal finances manager

PHP 22,091 2,029 Updated Jan 19, 2026

Home of StarCoder2!

Python 2,030 194 Updated Mar 21, 2024

Easy and Efficient Quantization for Transformers

C++ 202 16 Updated Jun 24, 2025

A Comprehensive Assessment of Trustworthiness in GPT Models

Python 311 61 Updated Sep 16, 2024

Ranger helps you see the forest among the trees - Ranger is an effect-size meta analysis library creating beautiful forest plots!

Python 11 1 Updated Jun 12, 2023

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,713 204 Updated Nov 15, 2025
Python 39 7 Updated Mar 29, 2024

utilities for decoding deep representations (like sentence embeddings) back to text

Python 1,047 114 Updated Dec 27, 2025

RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.

Python 571 81 Updated Dec 26, 2025
Next