
<(From Left) M.S candidate Soyoung Choi, Ph.D candidate Seong-Hyeon Hwang, Professor Steven Euijong Whang>
Just as human eyes tend to focus on pictures before reading accompanying text, multimodal artificial intelligence (AI)—which processes multiple types of sensory data at once—also tends to depend more heavily on certain types of data. KAIST researchers have now developed a new multimodal AI training technology that enables models to recognize both text and images evenly, enabling far more accurate predictions.
KAIST (President Kwang Hyung Lee) announced on the 14th that a research team led by Professor Steven Euijong Whang from the School of Electrical Engineering has developed a novel data augmentation method that enables multimodal AI systems—those that must process multiple data types simultaneously—to make balanced use of all input data.
Multimodal AI combines various forms of information, such as text and video, to make judgments. However, AI models often show a tendency to rely excessively on one particular type of data, resulting in degraded prediction performance.
To solve this problem, the research team deliberately trained AI models using mismatched or incongruent data pairs. By doing so, the model learned to rely on all modalities—text, images, and even audio—in a balanced way, regardless of context.
The team further improved performance stability by incorporating a training strategy that compensates for low-quality data while emphasizing more challenging examples. The method is not tied to any specific model architecture and can be easily applied to various data types, making it highly scalable and practical.

<Model Prediction Changes with a Data-Centric Multimodal AI Training Framework>

Professor Steven Euijong Whang explained, “Improving AI performance is not just about changing model architectures or algorithms—it’s much more important how we design and use the data for training.” He continued, “This research demonstrates that designing and refining the data itself can be an effective approach to help multimodal AI utilize information more evenly, without becoming biased toward a specific modality such as images or text.”
The study was co-led by doctoral student Seong-Hyeon Hwang and master’s student Soyoung Choi, with Professor Steven Euijong Whang serving as the corresponding author. The results will be presented at NeurIPS 2025 (Conference on Neural Information Processing Systems), the world’s premier conference in the field of AI, which will be held this December in San Diego, USA, and Mexico City, Mexico.
※ Paper title: “MIDAS: Misalignment-based Data Augmentation Strategy for Imbalanced Multimodal Learning,” Original paper: https://arxiv.org/pdf/2509.25831
The research was supported by the Institute for Information & Communications Technology Planning & Evaluation (IITP) under the projects “Robust, Fair, and Scalable Data-Centric Continual Learning” (RS-2022-II220157) and “AI Technology for Non-Invasive Near-Infrared-Based Diagnosis and Treatment of Brain Disorders” (RS-2024-00444862).
< Group photo of the KAIST-MIT Quantum Information Winter School > “Through the KAIST-MIT Quantum Information Winter School, I was able to view research from a broader perspective. The experience of collaborating with students from various universities and majors to complete a project was very refreshing,” said Jun-hyeong Cho, a student at the KAIST School of Electrical Engineering. KAIST announced on the 16th that the Graduate School of Quantum Science and Technology suc
2026-01-16<Jae-Chul Kim, Honorary Chairman of Dongwon Group> "In the era of AI, a new future lies within the sea of data. I ask that KAIST leaps forward to become the world's No. 1 AI research group." — Jae-Chul Kim, Honorary Chairman of Dongwon Group KAIST announced on January 16th that Honorary Chairman Jae-Chul Kim of Dongwon Group has pledged an additional 5.9 billion KRW in development funds to foster Artificial Intelligence (AI) talent and strengthen research infrastructure,
2026-01-16<(From Left) Distinguisehd Professor Sang Yup Lee, Dr. Gi Bae Kim, Professor Bernhard O. Palsson> “We know the genes, but not their functions.” To resolve this long-standing bottleneck in microbial research, a joint research team has proposed a cutting-edge research strategy that leverages Artificial Intelligence (AI) to drastically accelerate the discovery of microbial gene functions. KAIST announced on January 12th that a research team led by Distinguished Professor Sang
2026-01-12<Dr. Jung Won Park, (Upper Right) Professor Jeong Ho Lee, Professor Seok-Gu Kang> IDH-mutant glioma, caused by abnormalities in a specific gene (IDH), is the most common malignant brain tumor among young adults under the age of 50. It is a refractory brain cancer that is difficult to treat due to its high recurrence rate. Until now, treatment has focused primarily on removing the visible tumor mass. However, a Korean research team has discovered for the first time that normal brain cell
2026-01-09