About Me
I am Phillip (Yuseung) Lee, a second year PhD student at KAIST Graduate School of AI, advised by Prof. Minhyuk Sung.
I am also a student researcher at Google DeepMind, Mountain View.
I received both my Masters and Bachelors degree at KAIST.
I am also fortunate to closely collaborate with Prof. Leonidas Guibas.
I am a recipient of Qualcomm Innovation Fellowship Korea 2025.
My research interests revolve around analyzing and enhancing the spatial understanding of multimodal foundation models,
with the broader goal of building truly autonomous and embodied agents capable of operating in the physical world.
Specifically, I have worked on advancing the 3D spatial reasoning capabilities of vision-language models (VLMs),
and enhancing the spatial control of diffusion models.
I am open to collaboration opportunities! Please feel free to contact me via email.
My CV is at Curriculum Vitae (CV).
Work Experience
News
[ see more ]
Publications
-
CVPR
Token Warping Helps MLLMs Look from Nearby Viewpoints
CVPR 2026
Preprint
Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection
Preprint, 2025
Preprint
DiverseVAR: Balancing Diversity and Quality of Next-Scale Visual Autoregressive Models
AI for Creative Visual Content Generation, Editing and Understanding (CVEU) Workshop at CVPR 2026
WACV
Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models
WACV 2026
ICCV
Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation
ICCV 2025, Human to Robot (H2R) Workshop at CoRL 2025
NeurIPS
GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation
(* equal contribution)
NeurIPS 2024
ECCV
ReGround: Improving Textual and Spatial Grounding at No Cost
ECCV 2024
NeurIPS
SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions
NeurIPS 2023
Teaching
- Teaching Assistant, Fall 2025, Diffusion and Flow Models (CS492-C), KAIST
- Teaching Assistant, Spring 2025, Machine Learning for 3D Data (CS479), KAIST
- Teaching Assistant, Fall 2024, Diffusion Models and Their Applications (CS492-D), KAIST
- Teaching Assistant, Fall 2023, Machine Learning for 3D Data (CS479), KAIST
Academic Services
- Reviewer: NeurIPS (2024, 2025), ICLR (2025, 2026), ICML (2025), ICCV Workshop (SP4V, 2025), 3DV (2026), AAAI (2026), TPAMI (2025), Eurographics (2026)
- Workshop Organizer: 2nd Workshop on Multimodal Spatial Intelligence (MUSI) at CVPR 2026
Achievements
Powered by Jekyll and Minimal Light theme.