Skip to content
View xijia-tao's full-sized avatar
🌈
Coding
🌈
Coding

Highlights

  • Pro

Block or report xijia-tao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🌎💪 BrowserGym, a Gym environment for web task automation

Python 928 130 Updated Oct 16, 2025

A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architectures

Python 118 10 Updated Sep 26, 2025

[NeurIPS 2025🔥]Main source code of SRPO framework.

Python 173 18 Updated Sep 21, 2025

ScaleCUA is the open-sourced computer use agents that can operate on corss-platform environments (Windows, macOS, Ubuntu, Android).

Python 2 Updated Sep 19, 2025

A version of verl to support diverse tool use

Python 606 43 Updated Oct 17, 2025

Think Beyond Images

Python 501 30 Updated Sep 23, 2025

My learning notes/codes for ML SYS.

Python 3,888 234 Updated Oct 6, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,104 1,209 Updated Oct 11, 2025
Python 4 Updated Aug 11, 2025
Python 67 2 Updated Sep 26, 2025

Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas

Python 81 7 Updated Sep 13, 2025

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Python 1,166 109 Updated Aug 16, 2025

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation

Python 748 47 Updated Jul 9, 2025

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 288 64 Updated Oct 7, 2025

Scaling RL on advanced reasoning models

Python 612 38 Updated Aug 12, 2025

[ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs

Python 479 31 Updated Jan 23, 2025

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

Python 2,918 245 Updated Jul 7, 2025

Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning

Python 33 Updated Oct 15, 2025

[ICML 2025🔥] ParallelComp: Parallel Long-Context Compressor for Length Extrapolation

Python 25 Updated Jun 16, 2025

健康学习到150岁 - 人体系统调优不完全指南

20,656 1,460 Updated Sep 10, 2025

Learning Safety Constraints for Large Language Models (ICML2025)

Python 23 4 Updated Aug 4, 2025

MiMo-VL

570 27 Updated Aug 21, 2025

Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"

Python 111 9 Updated Aug 28, 2025

[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis

TypeScript 117 4 Updated Jun 18, 2025

Citysub: Controlling Bounds

MoonBit 7 Updated Apr 26, 2025

Official PyTorch implementation of EMOVA in CVPR 2025 (https://arxiv.org/abs/2409.18042)

Python 73 7 Updated Mar 16, 2025
Next