Skip to content
View likaixin2000's full-sized avatar

Highlights

  • Pro

Block or report likaixin2000

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
likaixin2000/README.md

πŸ‘‹ Hi there

I am a Ph.D. student at the National University of Singapore, focusing on multimodal models, autonomous agents, and code generation. πŸš€

πŸ” About Me

πŸŽ“ Ph.D. researcher passionate about exploring AI, deep learning, and computational creativity.

🧠 Deeply interested in building intelligent systems that can understand and generate across modalities (e.g., text, images, videos and code).

🀝 Open to collaborations, research discussions, and exciting projects in AI and machine learning.

Kaixin Li's GitHub stats

Pinned Loading

  1. QwenLM/Qwen3-VL QwenLM/Qwen3-VL Public

    Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

    Jupyter Notebook 15.4k 1.2k

  2. ScreenSpot-Pro-GUI-Grounding ScreenSpot-Pro-GUI-Grounding Public

    GUI Grounding for Professional High-Resolution Computer Use

    Python 274 29

  3. YXB-NKU/SE-GUI YXB-NKU/SE-GUI Public

    [NeurIPS 2025]"Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"

    Python 73 4

  4. MMCode MMCode Public

    [EMNLP 2024] Multi-modal reasoning problems via code generation.

    Python 26 1

  5. qishenghu/InstructCoder qishenghu/InstructCoder Public

    InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw

    Python 62 6

  6. GUI-Agent/HackWorld GUI-Agent/HackWorld Public

    PHP 4