Skip to content
View cmj2002's full-sized avatar

Highlights

  • Pro

Block or report cmj2002

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
cmj2002/README.md

Cao Mingjun (Mingjun Cao, 曹明隽)

我是南京大学人工智能学院的硕士研究生,由章宗长教授指导。我也是周志华教授领导的 LAMDA 实验室的成员。

我于2024年6月在南京大学人工智能学院获得了学士学位。同年,我被免试录取攻读南京大学硕士学位。

我主要研究兴趣集中在强化学习,特别关注离线强化学习和通过监督学习进行的强化学习。除了学术之外,我也是开源和自主托管应用的爱好者。


I'm a M.Sc. student at Nanjing University, studying at the School of Artificial Intelligence under the guidance of Prof. Zongzhang Zhang as a member of the LAMDA Group led by Professor Zhi-Hua Zhou.

I got my B.Sc. degree in School of Artificial Intelligence from Nanjing University in June 2024. In the same year, I was admitted to study for a M.SC. degree in Nanjing University without entrance examination.

My primary research interest lies in Reinforcement Learning, particularly focused on offline reinforcement learning and reinforcement learning through supervised learning. Apart from my academic pursuits, I'm also a big fan of open-source and self-hosted applications.


博客 (blog): blog.caomingjun.com

邮箱 (email): [email protected], [email protected]

Pinned Loading

  1. warp-docker warp-docker Public

    Run Cloudflare WARP in Docker.

    Shell 709 191

  2. r2-dir-list r2-dir-list Public

    Directory Listing for Cloudflare R2

    TypeScript 53 19

  3. typoverflow/flow-rl typoverflow/flow-rl Public

    Flow RL is a high-performance RL library with flow and diffusion models.

    Python 18 2