Skip to content
View lauyikfung's full-sized avatar

Highlights

  • Pro

Block or report lauyikfung

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
lauyikfung/README.md

Hi there! 👋 I am a first-year Ph.D. student of UCLA following Prof. Quanquan Gu, mainly researching in optimization and architecture of Large Language Models. Previously I was a Yao Class Student in Tsinghua University advised by Prof. Zhilin Yang.

See my:

My Projects

  • RPG: On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning: Paper, github
  • Kimi K1.5: Scaling Reinforcement Learning with LLMs: Paper
  • TPA: Tensor Product Attention Is All You Need: Paper, github
  • MARS: Unleashing the Power of Variance Reduction for Training Large Models: Paper, github
  • T-Rex: Text-assisted Retrosynthesis Prediction: Paper, github
  • Capricorn: Enhancing Hi-C contact matrices for loop detection with Capricorn, a multi-view diffusion model: Paper, github

More about me

Besides research, I love traveling around and I am also fond of topics including transportation (subways/undergrounds/light rails, etc.), geography (especially Chinese geography), linguistics, Vocaloid (Hatsune Miku & IA in Japanese, Luo Tianyi in Chinese) as well as financing (VC investment etc., welcome to talk with me about AI startups and investment in AI).

Popular repositories Loading

  1. A-Summary-Sheet-of-Optimization-in-Deep-Learning A-Summary-Sheet-of-Optimization-in-Deep-Learning Public

    TeX 9

  2. T-Rex T-Rex Public

    T-Rex: Text-assisted Retrosynthesis Prediction

    Python 8 2

  3. SichuaMahjongAI SichuaMahjongAI Public

    Python 3

  4. blog blog Public

    HTML 3

  5. multiomics multiomics Public

    Python 3

  6. dsrw dsrw Public

    Forked from Haozh20/dsrw

    C++ 2