-
ServiceNow AI Research
- Canada
-
04:18
(UTC -05:00) - https://ehsk.github.io
- @ehsk0
Lists (2)
Sort Name ascending (A-Z)
Stars
Code for paper "The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning"
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
Recipes to scale inference-time compute of open models
verl: Volcano Engine Reinforcement Learning for LLMs
Fully open reproduction of DeepSeek-R1
Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models
A bibliography and survey of the papers surrounding o1
TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle
🙃 A delightful community-driven (with 2,400+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python…
A blazing fast inference solution for text embeddings models
WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?
🌎💪 BrowserGym, a Gym environment for web task automation
Firefly III: a personal finances manager
Easy and Efficient Quantization for Transformers
A Comprehensive Assessment of Trustworthiness in GPT Models
Ranger helps you see the forest among the trees - Ranger is an effect-size meta analysis library creating beautiful forest plots!
The hub for EleutherAI's work on interpretability and learning dynamics
utilities for decoding deep representations (like sentence embeddings) back to text
RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.