Skip to content
View RManLuo's full-sized avatar
😀
LOL
😀
LOL

Highlights

  • Pro

Block or report RManLuo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"

Python 44 4 Updated Feb 20, 2025

repo for paper https://arxiv.org/abs/2504.13837

Python 268 14 Updated Jun 27, 2025

🎯 告别信息过载,AI 助你看懂新闻资讯热点,简单的舆情监控分析 - 多平台热点聚合+基于 MCP 的AI分析工具。监控35个平台(抖音、知乎、B站、华尔街见闻、财联社等),智能筛选+自动推送+AI对话分析(用自然语言深度挖掘新闻:趋势追踪、情感分析、相似检索等13种工具)。支持企业微信/个人微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 推送,30秒网页部署,1分…

Python 31,565 17,208 Updated Nov 28, 2025

chsrc 全平台通用换源工具与框架. Change Source everywhere for every software

C 6,433 261 Updated Nov 25, 2025

Official repository for ICLR 2025 paper "Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs"

Python 14 Updated Mar 18, 2025

This repo is reproduction resources for linear alignment paper, still working

Python 17 2 Updated May 19, 2024

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,570 302 Updated Nov 13, 2025

Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning

Python 29 2 Updated Nov 27, 2025

Bypass MDM Setup for MacOS, up to MacOS Tahoe 26.

Shell 1,010 270 Updated Sep 16, 2025

Agent benchmark for medical diagnosis

Python 259 44 Updated Dec 31, 2024

✨ Agentic IM ChatBot Infrastructure ✨ Integration with multiple IMs, easy-to-use plugin system, supports OpenAI, Gemini, Anthropic, Dify, Coze, built-in Knowledge Base, Agent. ✨ 一站式大模型聊天机器人平台及开发框架 …

Python 13,752 1,039 Updated Nov 28, 2025

Open source code for Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions

Python 154 24 Updated Nov 19, 2025

Mirix is a multi-agent personal assistant designed to track on-screen activities and answer user questions intelligently. By capturing real-time visual data and consolidating it into structured mem…

Python 3,288 314 Updated Nov 28, 2025

[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization

Python 92 6 Updated Aug 20, 2024

Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".

Python 29 4 Updated Oct 30, 2024

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 379 45 Updated Oct 4, 2025

A modern, responsive, and professional academic portfolio theme for researchers, built with Tailwind CSS, and DaisyUI.

EJS 25 7 Updated Nov 9, 2025

⚡ Hugo Blox: Markdown sites in minutes. Academic/resume/lab/portfolio for AI researchers & startups. Premium templates. Deploy to GitHub Pages now in 1-click 👇

HTML 9,093 2,961 Updated Nov 25, 2025

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 799 58 Updated Jul 31, 2025

Reasoning-Reinforced Representation for Search

12 Updated Oct 9, 2025

A suite of test scenarios for multi-agent reinforcement learning.

Python 760 146 Updated Nov 27, 2025

Youtu-Embedding is an industry-leading, general-purpose text representation model developed by Tencent Youtu Lab.

Python 155 15 Updated Nov 14, 2025

Biomni: a general-purpose biomedical AI agent

Python 2,360 389 Updated Nov 24, 2025

Youtu-GraphRAG boosts cost efficiency, inference accuracy, and cross-domain adaptability, pushing the boundaries of performance in complex QA.

Python 932 130 Updated Oct 30, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,228 107 Updated Oct 20, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,940 625 Updated Nov 27, 2025
Python 393 54 Updated Nov 26, 2025

A Headless Steam Docker image supporting NVIDIA GPU and accessible via Web UI

Shell 2,434 172 Updated Jun 23, 2025

The absolute trainer to light up AI agents.

Python 8,992 717 Updated Nov 28, 2025

framework for detecting hallucinations in LLM chain-of-thought reasoning. Features synthetic data corruption, transformer-based classifiers, Streamlit UI, and FastAPI backend.

Python 2 Updated Oct 12, 2025
Next