Skip to content
View Lincyaw's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@h1trust

Block or report Lincyaw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🏡 GitHub Pages template for personal academic homepage

HTML 451 254 Updated Oct 31, 2025

(Minimalism Style) Powered by Jekyll, based on the Minimal Mistakes theme and Jason Ansel's website

CSS 737 870 Updated Nov 5, 2025

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 4,047 821 Updated Sep 4, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,018 1,296 Updated Nov 9, 2025

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Python 13,080 429 Updated Nov 5, 2025

A simple yet powerful agent framework that delivers with open-source models

Python 3,779 369 Updated Nov 6, 2025

⚡ Hugo Blox: Markdown sites in minutes. Academic/resume/lab/portfolio for AI researchers & startups. Premium templates. Deploy to GitHub Pages now in 1-click 👇

HTML 9,058 2,956 Updated Nov 8, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,691 441 Updated Nov 9, 2025

🏆 ICML 2025 Spotlight

Python 329 18 Updated Jul 14, 2025

[NeurIPS'25] Official Implementation of RISE (Reinforcing Reasoning with Self-Verification)

Python 30 2 Updated Aug 8, 2025

🎉 A Vue.js 3 UI Library made by Element team

TypeScript 26,771 19,493 Updated Nov 9, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,170 163 Updated Nov 9, 2025
Python 376 30 Updated Oct 16, 2025
Python 309 15 Updated May 24, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,391 186 Updated Nov 7, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 78,257 11,567 Updated Nov 9, 2025

GNU/Linux 更换系统软件源脚本及 Docker 安装与换源脚本

Shell 6,232 596 Updated Nov 3, 2025

[ICLR'25] OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures?

Python 196 23 Updated Oct 21, 2025
Jupyter Notebook 15 3 Updated Mar 31, 2022

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 872 54 Updated Jul 22, 2025

JS snippet to send codeblock contents as a query string

HTML 47 4 Updated Jun 11, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,263 2,455 Updated Nov 9, 2025

ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning

Python 1,241 76 Updated May 16, 2025

R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Python 652 45 Updated Aug 5, 2025

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 2,967 226 Updated Nov 9, 2025

Data and Code for EMNLP 2025 Findings Paper "MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search"

Python 79 11 Updated Nov 4, 2025

🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data

TypeScript 66,995 5,220 Updated Nov 7, 2025

A collection of MCP servers.

74,540 6,241 Updated Nov 9, 2025

Writing AI Conference Papers: A Handbook for Beginners

2,972 104 Updated Jul 16, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,344 809 Updated Nov 9, 2025
Next