RManLuo

😀

LOL

Linhao Luo RManLuo

😀

LOL

Research Fellow at Monash University | AI, LLMs, Graph

390 followers · 252 following

Monash University
Melbourne
https://rmanluo.github.io/
in/linhao-luo-36b489134
https://scholar.google.com.au/citations?user=RO46HpcAAAAJ&hl=zh-CN

Achievements

x2 x3

Achievements

x2 x3

Highlights

Lists (14)

Sort

Starred repositories

YangRui2015 / Generalizable-Reward-Model

Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"

Python 44 4 Updated Feb 20, 2025

LeapLabTHU / limit-of-RLVR

repo for paper https://arxiv.org/abs/2504.13837

Python 268 14 Updated Jun 27, 2025

sansan0 / TrendRadar

🎯 告别信息过载，AI 助你看懂新闻资讯热点，简单的舆情监控分析 - 多平台热点聚合+基于 MCP 的AI分析工具。监控35个平台（抖音、知乎、B站、华尔街见闻、财联社等），智能筛选+自动推送+AI对话分析（用自然语言深度挖掘新闻：趋势追踪、情感分析、相似检索等13种工具）。支持企业微信/个人微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 推送，30秒网页部署，1分…

Python 31,565 17,208 Updated Nov 28, 2025

RubyMetric / chsrc

chsrc 全平台通用换源工具与框架. Change Source everywhere for every software

C 6,433 261 Updated Nov 25, 2025

zowiezhang / Amulet

Official repository for ICLR 2025 paper "Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs"

Python 14 Updated Mar 18, 2025

Wizardcoast / Linear_Alignment

This repo is reproduction resources for linear alignment paper, still working

Python 17 2 Updated May 19, 2024

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,570 302 Updated Nov 13, 2025

QwenQKing / Prompt-R1

Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning

Python 29 2 Updated Nov 27, 2025

assafdori / bypass-mdm

Bypass MDM Setup for MacOS, up to MacOS Tahoe 26.

Shell 1,010 270 Updated Sep 16, 2025

SamuelSchmidgall / AgentClinic

Agent benchmark for medical diagnosis

Python 259 44 Updated Dec 31, 2024

AstrBotDevs / AstrBot

✨ Agentic IM ChatBot Infrastructure ✨ Integration with multiple IMs, easy-to-use plugin system, supports OpenAI, Gemini, Anthropic, Dify, Coze, built-in Knowledge Base, Agent. ✨ 一站式大模型聊天机器人平台及开发框架 …

Python 13,752 1,039 Updated Nov 28, 2025

HUST-AI-HYZ / MemoryAgentBench

Open source code for Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions

Python 154 24 Updated Nov 19, 2025

Mirix-AI / MIRIX

Mirix is a multi-agent personal assistant designed to track on-screen activities and answer user questions intelligently. By capturing real-time visual data and consolidating it into structured mem…

Python 3,288 314 Updated Nov 28, 2025

ZHZisZZ / modpo

[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization

Python 92 6 Updated Aug 20, 2024

srzer / MOD

Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".

Python 29 4 Updated Oct 30, 2024

ElliottYan / LUFFY

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 379 45 Updated Oct 4, 2025

jiehua1995 / hexo-theme-researcher

A modern, responsive, and professional academic portfolio theme for researchers, built with Tailwind CSS, and DaisyUI.

EJS 25 7 Updated Nov 9, 2025

HugoBlox / hugo-blox-builder

⚡ Hugo Blox: Markdown sites in minutes. Academic/resume/lab/portfolio for AI researchers & startups. Premium templates. Deploy to GitHub Pages now in 1-click 👇

HTML 9,093 2,961 Updated Nov 25, 2025

BytedTsinghua-SIA / MemAgent

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 799 58 Updated Jul 31, 2025

ytgui / Search-R3

Reasoning-Reinforced Representation for Search

12 Updated Oct 9, 2025

google-deepmind / meltingpot

A suite of test scenarios for multi-agent reinforcement learning.

Python 760 146 Updated Nov 27, 2025

TencentCloudADP / youtu-embedding

Youtu-Embedding is an industry-leading, general-purpose text representation model developed by Tencent Youtu Lab.

Python 155 15 Updated Nov 14, 2025

snap-stanford / Biomni

Biomni: a general-purpose biomedical AI agent

Python 2,360 389 Updated Nov 24, 2025

TencentCloudADP / youtu-graphrag

Youtu-GraphRAG boosts cost efficiency, inference accuracy, and cross-domain adaptability, pushing the boundaries of performance in complex QA.

Python 932 130 Updated Oct 30, 2025

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,228 107 Updated Oct 20, 2025

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,940 625 Updated Nov 27, 2025

IINemo / lm-polygraph

Python 393 54 Updated Nov 26, 2025

Steam-Headless / docker-steam-headless

A Headless Steam Docker image supporting NVIDIA GPU and accessible via Web UI

Shell 2,434 172 Updated Jun 23, 2025

microsoft / agent-lightning

The absolute trainer to light up AI agents.

Python 8,992 717 Updated Nov 28, 2025

muhkartal / llm-outputVerifier

framework for detecting hallucinations in LLM chain-of-thought reasoning. Features synthetic data corruption, transformer-based classifiers, Streamlit UI, and FastAPI backend.

Python 2 Updated Oct 12, 2025