I am an Applied Scientist at Amazon AGI, working on the Nova Cross-modal Foundation Model. Before joining Amazon, I spent nearly seven years (2018–2025) at Baidu VIS, where I grew from a research intern into a Senior/Staff Researcher and contributed to multiple large-scale computer vision and multimodal projects. Since 2021, I have been collaborating closely with Chief Scientist Dr. Jingdong Wang (IEEE Fellow). I earned my Ph.D. from the MMLab at The University of Sydney, supervised by Prof. Wanli Ouyang. Previously, I obtained my M.S.E. degree from the University of Chinese Academy of Sciences (UCAS), under the supervision of Prof. Shifeng Chen and Prof. Yu Qiao.
Since 2016, I have been engaged in AI research and development across both academia and industry, gaining extensive experience at Amazon AGI, Baidu AIG, Snap Research, SenseTime Research, Samsung Research, and iQIYI AI. I have also been affiliated with leading academic institutions including MMLab@USYD, MMLab@CUHK, and MMLab@SIAT-CAS.
I am honored to have been awarded the Baidu PhD Fellowship (2023) and the DAAD AInet Fellowship (2025).