Skip to content
View 33zs's full-sized avatar
😃
Study
😃
Study

Block or report 33zs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ReviewEval: An Evaluation Framework for AI-Generated Reviews

Python 3 Updated Sep 6, 2025

The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"

Python 58 Updated Oct 15, 2025

Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

Python 471 47 Updated Sep 11, 2025

SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasoning paths via Revision, Recombination, and Refinement, expandi…

Python 194 24 Updated Sep 23, 2025

Official implementation of "Decoupling Continual Semantic Segmentation". Novel framework separating class-aware detection from class-agnostic segmentation for effective continual learning.

4 Updated Aug 21, 2025
Python 12 Updated Aug 10, 2025

[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model

Python 195 7 Updated Aug 5, 2024

[AAAI 2026] GUI-G²: Gaussian Reward Modeling for GUI Grounding

Python 230 8 Updated Nov 9, 2025

📚 A collection of papers about Referring Image Segmentation.

782 64 Updated Oct 27, 2025

Grid Adventure - Java RPG Game 🎮 Grid Adventure is a Java-based, console role-playing game (RPG) where players navigate a 2D grid, battle monsters, and strategize their way to victory.

Java 1 Updated Apr 17, 2025

👋 Hi there, I'm Zishan Xu

1 Updated Nov 6, 2025

🚇 Guangzhou Metro Route Planning System A route planning system for Guangzhou Metro Lines 1, 2, and 3, designed to compute the shortest route, minimum travel time, and least number of transfers bet…

HTML 1 Updated Jun 2, 2025

[EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning

Python 58 3 Updated Nov 4, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,156 97 Updated Oct 20, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,390 185 Updated Nov 10, 2025

S2A-Attention for Multimodal 3D SemanticSegmentation Using LiDAR and Cameras inAutonomous Driving

Python 1 Updated Apr 29, 2025

Personalized Fragrance Recommendation for Aromatherapy: A Machine Learning Approach Based on Personality Traits and Electrodermal Activity

Jupyter Notebook 14 3 Updated May 1, 2025
Vue 4 1 Updated Aug 19, 2024

Flash Attention 2 pre-built wheels for Windows. Drop-in replacement for PyTorch attention providing up to 10x speedup and 20x memory reduction. Compatible with Python 3.10 and CUDA 11.7+. No build …

32 9 Updated Dec 1, 2024

A smart system for real-time knee health monitoring, featuring a HarmonyOS app for visualized data, doctor-patient communication, and personalized joint care.

Java 2 Updated Apr 18, 2025

An AI-powered intelligent learning platform that delivers personalized questioning, automated grading, adaptive learning paths, and community-driven support for enhanced student engagement and unde…

Java 2 Updated Apr 18, 2025

Self-Alignment with Principle-Following Reward Models

Python 169 13 Updated Sep 18, 2025

A Survey of Direct Preference Optimization (DPO)

81 Updated Jul 4, 2025

文本情感分析

Jupyter Notebook 866 225 Updated Dec 30, 2017