I am a first year Ph.D. student, working with Prof. Stella X. Yu at CSE, University of Michigan. Before joining Stella's group, my research focused on ambient sound-conditioned visual synthesis at Korea University. I worked as a Google Student Researcher, exploring ways to improve image generation and cropping with measurable quality signals.
My research goal is to develop a physically understandable foundation model in unsupervised manner. Feel free to reach out to chat more about this.
Contact: seungle [at] umich [dot] edu | easter3163 [at] korea [dot] ac [dot] kr
Ph.D. in CSE, 2024~
University of Michigan
MS in Artificial Intelligence, 2022~2024
Korea University, Korea
BS in Computer Science, 2016~2022
University of Seoul, Korea
Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation
ECCV 2024 Oral (2.3%)
Authors: Seung Hyun Lee, Yinxiao Li, Junjie Ke, Innfarn Yoo, Han Zhang, Jiahui Yu, Qifei Wang, Fei Deng, Glenn Entis, Junfeng He, Gang Li, Sangpil Kim, Irfan Essa, Feng Yang
Cropper: Vision-Language Model for Image Cropping through In-Context Learning
CVPR 2025
Authors: Seung Hyun Lee*, Jijun Jiang*, Yiran Xu*, Zhuofang Li*, Junjie Ke, Yinxiao Li, Junfeng He, Steven Hickson, Katie Datsenko, Sangpil Kim, Ming-Hsuan Yang, Irfan Essa, Feng Yang
Sound-Guided Semantic Image Manipulation
CVPR 2022
Authors: Seung Hyun Lee, Wonseok Roh, Wonmin Byeon, Sang Ho Yoon, Chanyoung Kim, Jinkyu Kim*, Sangpil Kim*
Sound-Guided Semantic Video Generation
ECCV 2022
Authors: Seung Hyun Lee, Gyeongrok Oh, Wonmin Byeon, Chanyoung Kim, Won Jeong Ryoo, Sang Ho Yoon, Jihyun Bae, Jinkyu Kim*, Sangpil Kim*
Robust Sound-Guided Image Manipulation
Neural Networks 2024
Authors: Seung Hyun Lee*, Hyung-gun Chi*, Gyeongrok Oh, Wonmin Byeon, Sang Ho Yoon, Hyunje Park, Wonjun Cho, Jinkyu Kim*, Sangpil Kim*
The Power of Sound (TPoS): Audio Reactive Video Generation with Stable Diffusion
ICCV 2023
Authors: Yujin Jeong, Wonjeong Ryoo, Seung Hyun Lee, Dabin Seo, Wonmin Byeon, Jinkyu Kim
Functional Hand Type Prior for 3D Hand Pose Estimation and Action Recognition from Egocentric View Monocular Videos
BMVC 2023 Oral
Authors: Wonseok Roh, Seung Hyun Lee, Wonjeong Ryoo, Gyeongrok Oh, Jakyung Lee, Soo Yeon Hwang, Hyung-gun Chi, Sangpil Kim
Audio-guided implicit neural representation for local image stylization
Computational Visual Media 2024
Authors: Seung Hyun Lee, Chanyoung Kim, Wonmin Byeon, Sang Ho Yoon, Jinkyu Kim*, Sangpil Kim*
Soundini: Sound-Guided Diffusion for Natural Video Editing
Under review
Authors: Seung Hyun Lee, Sieun Kim, Innfarn Yoo, Feng Yang, Donghyeon Cho, Youngseo Kim, Huiwen Chang, Jinkyu Kim*, Sangpil Kim*