Skip to content
View jxhe's full-sized avatar

Organizations

@asyml

Block or report jxhe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows

Python 2,826 364 Updated Nov 12, 2025

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Python 133 9 Updated Nov 12, 2025

MiniMax-M2, a model built for Max coding & agentic workflows.

1,653 115 Updated Nov 7, 2025

Contexts Optical Compression

Python 20,321 1,601 Updated Oct 25, 2025

[ICML2025 Oral] LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models

Python 81 6 Updated Jul 31, 2025

Pushing Test-Time Scaling Limits of Deep Search with Asymmetric Verification

Python 20 1 Updated Oct 8, 2025

Building Open-Ended Embodied Agents with Internet-Scale Knowledge

Java 2,052 184 Updated Mar 18, 2024

Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environment…

Python 350 44 Updated Nov 12, 2025

A Gym for Agentic LLMs

Python 356 21 Updated Nov 10, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,455 248 Updated Nov 13, 2025

A clean, modular SDK for building AI agents with OpenHands V1.

Python 153 52 Updated Nov 12, 2025

The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"

Python 87 1 Updated Sep 29, 2025

The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".

Python 15 Updated Sep 3, 2025

A benchmark for LLMs on complicated tasks in the terminal

Python 1,058 381 Updated Nov 12, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 82,289 9,219 Updated Nov 13, 2025

Renderer for the harmony response format to be used with gpt-oss

Rust 3,996 225 Updated Nov 5, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,178 1,918 Updated Nov 1, 2025

LEAKED SYSTEM PROMPTS FOR CHATGPT, GEMINI, GROK, CLAUDE, PERPLEXITY, CURSOR, DEVIN, REPLIT, AND MORE! - AI SYSTEMS TRANSPARENCY FOR ALL! 👐

11,800 2,376 Updated Nov 6, 2025

Connect APIs, remarkably fast. Free for developers.

JavaScript 10,792 5,534 Updated Nov 13, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 72,624 2,221 Updated Nov 12, 2025

The official Python SDK for Model Context Protocol servers and clients

Python 20,026 2,750 Updated Nov 12, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

9,311 638 Updated Nov 7, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,709 442 Updated Nov 12, 2025

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,116 37 Updated Oct 4, 2025

An agent benchmark with tasks in a simulated software company.

Python 580 94 Updated Oct 12, 2025

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 12,542 1,270 Updated Nov 10, 2025
Python 116 16 Updated Oct 16, 2025
Next