-
Monash University
- Melbourne
- https://rmanluo.github.io/
- in/linhao-luo-36b489134
- https://scholar.google.com.au/citations?user=RO46HpcAAAAJ&hl=zh-CN
Highlights
- Pro
Lists (14)
Sort Name ascending (A-Z)
Starred repositories
Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"
repo for paper https://arxiv.org/abs/2504.13837
🎯 告别信息过载,AI 助你看懂新闻资讯热点,简单的舆情监控分析 - 多平台热点聚合+基于 MCP 的AI分析工具。监控35个平台(抖音、知乎、B站、华尔街见闻、财联社等),智能筛选+自动推送+AI对话分析(用自然语言深度挖掘新闻:趋势追踪、情感分析、相似检索等13种工具)。支持企业微信/个人微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 推送,30秒网页部署,1分…
chsrc 全平台通用换源工具与框架. Change Source everywhere for every software
Official repository for ICLR 2025 paper "Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs"
This repo is reproduction resources for linear alignment paper, still working
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning
Bypass MDM Setup for MacOS, up to MacOS Tahoe 26.
Agent benchmark for medical diagnosis
✨ Agentic IM ChatBot Infrastructure ✨ Integration with multiple IMs, easy-to-use plugin system, supports OpenAI, Gemini, Anthropic, Dify, Coze, built-in Knowledge Base, Agent. ✨ 一站式大模型聊天机器人平台及开发框架 …
Open source code for Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions
Mirix is a multi-agent personal assistant designed to track on-screen activities and answer user questions intelligently. By capturing real-time visual data and consolidating it into structured mem…
[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".
Official Repository of "Learning to Reason under Off-Policy Guidance"
A modern, responsive, and professional academic portfolio theme for researchers, built with Tailwind CSS, and DaisyUI.
⚡ Hugo Blox: Markdown sites in minutes. Academic/resume/lab/portfolio for AI researchers & startups. Premium templates. Deploy to GitHub Pages now in 1-click 👇
A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.
A suite of test scenarios for multi-agent reinforcement learning.
Youtu-Embedding is an industry-leading, general-purpose text representation model developed by Tencent Youtu Lab.
Biomni: a general-purpose biomedical AI agent
Youtu-GraphRAG boosts cost efficiency, inference accuracy, and cross-domain adaptability, pushing the boundaries of performance in complex QA.
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
A Headless Steam Docker image supporting NVIDIA GPU and accessible via Web UI
The absolute trainer to light up AI agents.
framework for detecting hallucinations in LLM chain-of-thought reasoning. Features synthetic data corruption, transformer-based classifiers, Streamlit UI, and FastAPI backend.