About me

I am a third-year Ph.D. Candidate in the Robotics Perception and Learning Lab (RIPL) at Georgia Institute of Technology advised by Prof. Zsolt Kira. Previously, I was fortunate to work with Prof. Hongteng Xu in Structured Data Science Lab (SDSL) at Renmin University of China. My research aims to improve the generalizability of foundation models, especially vision–language models. I am particularly interested in robust fine-tuning, reasoning, and vision–language–action models.

News

  • [2025.11] MAPS is online. Excited to share our first work on VLAs!
  • [2025.06] Mimicking or Reasoning was featured on YouTube by Discover AI!
  • [2025.04] I received the CVPR 2025 Travel Support Award, thanks! See you in Nashville!
  • [2025.04] I passed my Qualifying Exam and officially become a Ph.D. Candidate!
  • [2025.02] FRAMES-VQA was accepted by CVPR 2025!
  • [2025.01] DiGraP was accepted by ICLR 2025!
  • [2023.08] I joined Georgia Institute of Technology for the Machine Learning PhD Program!