avatar

Phillip Y. Lee

이유승

PhD Student

KAIST Graduate School of AI

About Me

I am Phillip Lee, a second year PhD student at KAIST Graduate School of AI, advised by Prof. Minhyuk Sung. I am also a student researcher at Google DeepMind, Mountain View. I received both my Masters and Bachelors degree at KAIST. I am also fortunate to closely collaborate with Prof. Leonidas Guibas. I am a recipient of Qualcomm Innovation Fellowship Korea 2025.

My research interests revolve around analyzing and enhancing the spatial understanding of multimodal foundation models, with the broader goal of building truly autonomous and embodied agents capable of operating in the physical world. Specifically, I have worked on advancing the 3D spatial reasoning capabilities of vision-language models (VLMs), and enhancing the spatial control of diffusion models.

I am open to collaboration opportunities! Please feel free to contact me via email. My CV is at Curriculum Vitae (CV).

Work Experience

News

[ see more ]

Publications

Selected publications.

  1. Hidden Sensitivity in Spatial Reasoning Evaluation: Diagnosis and Re-ranking with VSI-Bench
    Combining Theory and Benchmark (CTB) Workshop at ICML 2026
    Token Warping Helps MLLMs Look from Nearby Viewpoints
    Phillip Y. Lee*, Chanho Park*, Mingue Park, Seungwoo Yoo, Juil Koo, Minhyuk Sung (* equal contribution)
    CVPR 2026
    Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection
    Juil Koo*, Daehyeon Choi*, Sangwoo Youn*, Phillip Y. Lee, Minhyuk Sung (* equal contribution)
    Reinforcement Learning from World Feedback (RLxF) Workshop at ICML 2026
    DiverseVAR: Balancing Diversity and Quality of Next-Scale Visual Autoregressive Models
    ECCV 2026
    Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models
    WACV 2026
    Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation
    ICCV 2025 / Human to Robot (H2R) Workshop at CoRL 2025
    GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation
    Phillip Y. Lee*, Taehoon Yoon*, Minhyuk Sung (* equal contribution)
    NeurIPS 2024
    ReGround: Improving Textual and Spatial Grounding at No Cost
    Phillip Y. Lee, Minhyuk Sung
    ECCV 2024
    SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions
    NeurIPS 2023

Teaching

Academic Services

Achievements

Powered by Jekyll and Minimal Light theme.