Minguk Kang (강민국)
I am a Founding Research Scientist at Pika. I received my Ph.D. from the Graduate School of AI at POSTECH in 2026, advised by Prof. Suha Kwak (2023-2026) and Prof. Jaesik Park (2020-2023). Previously, I was a Research Scientist Intern at Adobe Research, where my work on GigaGAN contributed to Adobe Firefly. I received my B.S. from Pusan National University.
At Pika, I have contributed to PikaStream1.0, an audio-driven performance model, and video generation models including Pika 1.5, 2.0, 2.1, and Pika 2.2. My work spans tokenizers, diffusion distillation, fast super-resolution, and components for real-time video agent systems.
My research focuses on efficient generative modeling for real-time content generation across video, audio, and multimodal media. I am particularly interested in high-compression, low-latency tokenizers, few-step diffusion distillation, fast super-resolution, tokenizer design and diffusibility, and multimodal generative modeling.
Education
| Feb, 2020 - Feb, 2026 | Pohang University of Science and Technology (POSTECH), Pohang, South Korea Ph.D. in Graduate School of AI Advisors: Prof. Suha Kwak (2023-2026) and Prof. Jaesik Park (2020-2023) Thesis: Efficient Deep Generative Models for Visual Content Generation |
|---|---|
| Mar, 2013 - Aug, 2019 | Pusan National University, Pusan, South Korea B.S. in Engineering (Major: Mechanical Engineering; Minor: Statistics) Graduated summa cum laude, ranked 1st among 394 students in the College of Engineering. |
Experience
| Nov, 2024 - Present | Pika Labs, South Korea (Remote)
|
|---|---|
| Jun, 2024 - Oct, 2024 | Pika Labs, Korea (Remote) / Palo Alto, USA
|
| Jul, 2022 - May, 2024 | Adobe Research Creative Intelligence Lab, Korea (Remote) / San Francisco, USA
|
| Feb, 2020 - Feb, 2026 | Computer Vision Lab, POSTECH, Pohang, South Korea
|
| Aug, 2017 - Jan, 2020 | Vision and Intelligent System Lab, Pusan National University, Pusan, South Korea
|
Products & Software
PikaStream1.0: core contributor to a real-time video agent system for group video chat, focusing on low-latency generation and multimodal capabilities.
Audio-Driven Performance Model: developed generation and acceleration pipelines.
Pika Video Generation Models: contributed to Pika 1.5, Pika 2.0, Pika 2.1, and Pika 2.2, with work on tokenizers, distillation, and fast super-resolution.
Adobe Firefly is Adobe’s visual generative AI product suite; my GigaGAN research contributed to its development.
PyTorch StudioGAN is an open-source PyTorch library for representative GAN training and evaluation.
Publications
-
Context-Aware Image CompletionIEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshop, 2023
Honors and Awards
Outstanding Reviewer, European Computer Vision Association (2024)
|