KAIST Illuminates the Eyes of Humanoid Robots with Minimal Memory
<CVPR 2026 poster session. From left to right: Minseok Seo (KAIST, first author), Mark Hamilton (MIT and Microsoft, second author), and Prof. Changick Kim (KAIST, corresponding author)>
From facial recognition on smartphones to humanoid robots, computer vision technology, which serves as the eyes of artificial intelligence (AI), is widely utilized in our daily lives. A joint research team from KAIST and international institutions has developed a technology that allows AI to see the world more clearly with minimal memory, increasing GPU (Graphics Processing Unit) memory efficiency by up to 16 times. This achievement is evaluated as a core technology that will accelerate the era of humanoid robots and on-device AI.
<Overview of Upsample Anything. Given a high-resolution image, it is first downsampled to a low-resolution image and then reconstructed through test-time optimization (TTO). During this process, pixel-wise anisotropic kernel parameters are learned. The learned kernels are subsequently applied to low-resolution foundation feature maps to generate high-resolution feature maps. These feature maps are then used to perform pixel-wise anisotropic Joint Bilateral Upsampling, enabling high-quality reconstruction at high resolution>
KAIST announced on June 17th that a research team led by Professor Changick Kim from the School of Electrical Engineering, through joint research with researchers from MIT and Microsoft in the United States, has developed 'Upsample Anything,' a universal technology that can enhance the visual performance of AI even with limited GPU memory.
Following its acceptance to 'CVPR 2026,' the world's most prestigious conference in the field of artificial intelligence and computer vision, this achievement was awarded the 'CVPR Compute Gold Star' in recognition of its efficient utilization of computational resources. It was also selected as the 'Transparency Champion,' ranking first overall in the category of research process transparency and reproducibility. This is an accomplishment that widely recognizes the core elements of responsible AI research, including research performance, computational resources used, code disclosure, and experimental reproducibility.
Recently, humanoid robots, autonomous driving systems, and AI based on world models (AI models that learn and predict the physical environment and changes of the real world) have been compressing input images into low-resolution features (core information extracted from images by AI) to increase computational speed and reduce memory usage.
However, during the compression process, a problem occurs where important visual information, such as small objects, thin structures, and minute defects, is lost. Conversely, processing all images at high resolution from the beginning requires massive GPU memory and computational resources, making real-time processing difficult. This has remained an unresolved challenge for a long time in situations where small devices like smartphones or robots, where mobility is crucial, must precisely perceive their surrounding environment.
To overcome these limitations, the research team developed a training-free (requiring no additional data training) upsampling technology that restores low-resolution feature information into high resolution by utilizing the edge and structural information of the input image.
Existing technologies required a separate retraining or complex optimization process to be applied to new environments or data. In contrast, 'Upsample Anything' developed by the research team can find the optimal restoration method using just a single input image, allowing it to be immediately applied to various environments.
In addition, by compressing and utilizing only core information instead of storing and processing all visual information at high resolution, GPU memory usage was significantly reduced. Based on a 224×224 size image (approximately 50,000 pixels) widely used in AI research, the research team restored visual information close to the original with a short calculation of about 0.4 seconds, achieving a performance that improves GPU memory efficiency by up to 16 times.
This means that artificial intelligence can perceive its surrounding environment more precisely even with limited computational resources. Therefore, this technology is expected to be widely used in various next-generation artificial intelligence fields, such as small devices like smartphones, as well as humanoid robots that need to accurately identify and manipulate small objects, autonomous driving systems, and on-device AI.
<Comparison image illustrating the performance gap with conventional methods (AI-generated). Conventional vision foundation models understand a scene by converting the input image into low-resolution features at a small patch level (left). Upsample Anything restores these low-resolution features to the original resolution level, enabling the AI to comprehend the scene's structure and boundaries with significantly higher precision (right)>
Professor Changick Kim said, “This technology is an algorithm that can significantly increase the visual precision of artificial intelligence with fewer resources, and it is expected to accelerate the commercialization of humanoid robots and on-device AI.” He added, “It is even more meaningful because it was recognized at CVPR not only for its performance but also for its computational efficiency and research transparency.”
This research was participated in by KAIST PhD student Minseok Seo as the first author, and this achievement was presented on June 7 at 'CVPR 2026,' the world's most prestigious conference in the field of artificial intelligence and computer vision.
※ Paper Title: Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling, DOI:10.48550/arXiv.2511.16301
※ Author Information: Minseok Seo (KAIST, First Author), Mark Hamilton (MIT, Microsoft, Second Author), Changick Kim (KAIST, Corresponding Author)
Seeking Innovative Financial Solutions to Help Deep Tech Startups Overcome the Valley of Death
< Poster of STARTUP NATION KOREA 2026 >
KAIST announced on June 16 that it will co-host 'STARTUP NATION KOREA 2026' (2026 Innovation Entrepreneurship Nation Korea International Forum) with Seoul National University and The JoongAng from June 17 to 18 at the Haedong Advanced Engineering Building on Seoul National University's Gwanak Campus.
Celebrating its 5th anniversary this year, the forum serves as a platform to overcome the so-called 'R&D Paradox'—where outstanding research and development achievements fail to fully connect with entrepreneurship and industry—and to seek solutions for realizing a science and technology-based innovation entrepreneurship nation. Universities, government agencies, research institutions, investment firms, conglomerates, startups, and media will participate to discuss cooperative methods for connecting technological potential to market and industrial value.
In particular, this year's forum focuses on the role of patient capital and innovative finance, which are critical challenges for the growth of deep tech startups, under the theme “Deep Tech: Beyond the Valley of Death.”
Deep tech startups require long-term investment and large-scale funding throughout the entire process from research and development to technology verification, demonstration, and market entry. However, the domestic venture investment market is relatively focused on short-term returns, often causing these startups to face severe difficulties during the commercialization phase. Therefore, this forum will intensively discuss strategies to establish an innovative financial ecosystem where 'patient capital'—which invests based on the long-term growth potential of technology—is organically linked with investments, guarantees, and policy finance required for the entire life cycle of technology commercialization.
Kwang Hyung Lee, President of KAIST, said, “Although Korea possesses world-class R&D capabilities, major barriers still exist in the process of translating research achievements into startups and new industries. I hope this forum serves as a collective effort to seek solutions through patient capital, innovative finance, and a sustainable entrepreneurial ecosystem so that deep tech startups can overcome the valley of death and grow into the global market.”
Hyun Min Bae, Director of the KAIST Startup Institute, stated, “The success of deep tech entrepreneurship cannot be achieved through technology alone. Through this forum, we look forward to discussing the growth ladders and collaborative measures that connect laboratory technologies to markets, investments, and global expansion, thereby discovering a new direction for Korea's deep tech startup ecosystem.”
On the first day of the forum, Professor Jeong Dong Lee of the College of Engineering at Seoul National University, author of The Way of Accumulation and The First Question, will deliver a keynote speech emphasizing the role of finance in accelerating technological innovation and the importance of patient capital. This will be followed by a roundtable featuring domestic and international experts to discuss long-term investment case studies and policy directions for innovative finance.
In addition, the presentation ceremony for the '2026 Korea Innovation Entrepreneurship Awards' will be held simultaneously. This year, a total of 13 awards will be presented across three categories: the Innovation Entrepreneurship Award, the Innovation Entrepreneurship Challenge Award, and the Innovation Entrepreneurship Ecosystem Contribution Award.
The recipients of the Innovation Entrepreneurship Award include Sovagen, EndoRobotics, WIRobotics, Exo Systems, Marine Drone Tech, Daughter, MUSTBIO, IMNEWRUN, and Narnia Labs. These companies are promising deep tech enterprises recognized for their technological innovation and growth potential in national strategic sectors such as AI, robotics, bio/healthcare, drones, and mobility.
The Innovation Entrepreneurship Challenge Award will be presented to Scionic AI, RX, and TDS Innovation, while the Innovation Entrepreneurship Ecosystem Contribution Award will be awarded to the Commercialization Promotion Agency for R&D Outcomes (COMPA).
Kyeong Hwan Kim, Chair of the Evaluation Committee (Dean of the Graduate School of Global Entrepreneurship at Sungkyunkwan University), remarked, “Promising deep tech companies that will lead future industries in fields like AI, robotics, semiconductors, and bio have stood out remarkably. This contest was a meaningful opportunity to confirm the robust growth potential of Korea's innovation entrepreneurship ecosystem.”
An exhibition featuring innovative startups and various programs linked with government ministries will also run throughout the event. Participating companies will showcase their breakthrough technologies in sectors including AI, robotics, bio/healthcare, advanced materials, energy, and mobility, while exploring collaboration opportunities with investment institutions and industry leaders. Notably, WIRobotics plans to exhibit its humanoid robot 'ALLEX', which was previously unveiled at CES 2026.
The forum is co-hosted by KAIST, Seoul National University, and The JoongAng, and sponsored by major agencies including the Ministry of Science and ICT, the Ministry of Trade, Industry and Energy, and the Ministry of SMEs and Startups. The event will be broadcast live via the official YouTube channel and website of STARTUP NATION KOREA 2026.
KAIST Breaks a Major AI Bottleneck with Liquid Cooling Technology 10 Times More Efficient Than the Previous Record
<(From Left) Professor Sung Jin Kim, Professor Ikjin Lee, Dr. Yong Jin Lee, Ph.D candidate Hansol Lee, Ph.D candidate ChulHyun Hwang>
AI data centers are often described as “power-hungry giants.” Not only do artificial intelligence computations consume enormous amounts of electricity, but a significant amount of energy is also required to cool the semiconductor chips that heat up during operation. As AI chips continue to deliver higher performance, the amount of heat they generate is increasing rapidly. As a result, conventional air cooling and external copper heat spreaders are approaching their practical limits. To address this challenge, KAIST research team has developed an ultra-high-efficiency liquid-cooling technology that cools semiconductor chips from within.
KAIST (President Kwang Hyung Lee) announced on the 16th that a joint research team led by Professor Sung Jin Kim of the Department of Mechanical Engineering and Professor Ikjin Lee of the School of AI and Computing has developed a highly efficient liquid-cooling technology that directly cools high-heat-flux semiconductor chips using room-temperature water. The researchers achieved this by embedding liquid-cooling channels, thinner than a human hair, directly inside a silicon semiconductor chip. The team successfully maintained the chip temperature below 100°C even under extreme heat-generation conditions exceeding 2,000 watts per square centimeter (W/cm²).
<Manifold Microchannel Cooling Device Structure for Cooling High-Heat-Generation Semiconductor Chips>
The researchers focused on a Manifold MicroChannel (MMC hereafter) structure embedded directly inside a silicon chip. Microchannel cooling removes heat through microscopic fluid channels that are smaller than a human hair. In conventional designs, coolant must travel through numerous microchannels from one end of the chip to the other. This long flow path increases flow resistance and requires greater pumping power to circulate the coolant.
The manifold structure developed by the research team distributes coolant through multiple inlet channels and collects it through multiple outlets. An analogy can be drawn to a logistics network: instead of shipping all goods from a single origin to a distant destination, multiple distribution centers are strategically placed to shorten transportation distances. Because the coolant travels only a short distance within each channel, flow resistance is significantly reduced, and the required pumping pressure becomes much lower.
Previous MMC designs often suffered from uneven coolant distribution, with some channels receiving too much coolant while others received too little. The research team overcame this challenge by optimizing the structure so that coolant could flow evenly through all microchannels. Through extensive design analysis and advanced simulations, they identified an optimized cooling architecture that delivers high cooling performance while minimizing energy loss.
The optimized structure was then fabricated in an actual silicon semiconductor chip and experimentally validated. Under the same temperature-rise condition, the cooling system achieved a coefficient of performance (COP) of 106,000. This is approximately ten times higher than the previous world-leading result of around 10,000 reported in Nature (van Erp et al.) in 2020. In practical terms, it means that only about one-tenth of the pumping power is required to remove the same amount of heat.
Notably, this performance was achieved without relying on phase-change cooling, nanoscale surface modifications, or expensive materials such as diamond. Ordinary room-temperature water was used as the coolant. In addition, the device was fabricated using a low-temperature process below 350°C that is compatible with conventional semiconductor manufacturing. This means the technology could be implemented in existing semiconductor foundries without requiring major additional equipment investments.
<Fabricated Cooling Device, Experimental Results, and Applications>
The technology is expected to help address thermal management challenges in a wide range of high-heat-flux electronic systems, including AI accelerators, high-performance computing (HPC) systems, three-dimensional semiconductor packaging, power electronics, and defense electronics. In particular, data centers are increasingly constrained not only by computing performance but also by cooling power consumption and cooling infrastructure requirements. Technologies that reduce pumping power at the chip level could therefore play an important role in improving the energy efficiency of next-generation data centers and alleviating thermal bottlenecks.
<Research Image(AI-generated)>
Professor Sung Jin Kim said, “As the performance of AI semiconductors and advanced electronic packaging becomes increasingly limited by heat, we expect this technology to serve as a foundational cooling solution for future high-performance computing systems.”
The study was co-first-authored by Young Jin Lee, ChulHyun Hwang, and Hansol Lee from the Department of Mechanical Engineering at KAIST. The research was published on June 15 in the international journal Energy Conversion and Management.
Paper title: Highly energy-efficient manifold microchannel for cooling electronics with a coefficient of performance over 100,000
DOI: 10.1016/j.enconman.2026.121422
This research was supported by the Mid-Career Researcher Program of the National Research Foundation of Korea (NRF), funded by the Ministry of Science and ICT (Grant No. 2021R1A2C3011944), and by the Specialized Research Laboratory Program for Ultra-High-Heat-Flux Cooling Systems of the Korea Research Institute for Defense Technology Planning and Advancement (KRIT), funded by the Defense Acquisition Program Administration (Grant No. KRIT-CT-22-022).
Graduate School of Global Digital Innovation (GDI) Hosts 'AI⁺ Global Prosperity Forum 2026'
The Graduate School of Global Digital Innovation (GDI) of KAIST will host the "AI⁺ Global Prosperity Forum 2026" on June 24 at the Chung Kunmo Conference Hall (5F), KAIST Academic Cultural Complex (E9).
KAIST Graduate School of Global Digital Innovation (GDI) is carrying out the "ICT Global Specialized Convergence Talent Cultivation Program" supported by the Ministry of Science and ICT and the Institute of Information & Communications Technology Planning & Evaluation (IITP). Since the launch of the Global IT Technology Program (ITTP) in 2006, GDI has grown into South Korea's representative global digital talent fostering platform over the past 20 years, nurturing approximately 260 government officials, public institution experts, and industry leaders from over 80 countries. GDI serves as a vehicle for global cooperation, sharing Korea's digital innovation experience and policy know-how with the international community, and promoting various collaborative projects such as international joint research, policy cooperation, and digital transformation projects based on its global network.
This forum, organized by GDI as part of the ICT Global Specialized Convergence Talent Cultivation Program under the theme 'Advancing Global AI Leadership Through Partnership and Innovation,' is designed to discuss global cooperation strategies for international partnership, digital transformation, and sustainable development in the AI era.
The event is expected to bring together approximately 60 government officials, international organization experts, researchers, and industry leaders from around 30 countries across Asia, Africa, Latin America, the Middle East, and Europe. Notably, representatives from the African Development Bank (AfDB), the Ministry of Communications and Digital Affairs of Indonesia, and various foreign governments, public institutions, and international organizations will participate to share AI governance, digital transformation, innovation policies, and international cooperation cases.
The forum will feature two main sessions:
Session 1: Global AI Partnership and Collaboration
Session 2: AI Policy, Governance, AI Innovation, and Applications
Participants will discuss global cooperation models in the AI era, digital transformation in the public sector, and methods for establishing AI policy and governance frameworks.
In addition, an 'AI⁺ Industry Showcase' featuring Korean AI and digital innovation enterprises alongside corporate exhibition booths will be operated simultaneously. Participating companies will introduce their innovative AI-based technologies and services, and seek opportunities for Proof of Concept (PoC), joint research, digital transformation projects, and overseas market expansion through business matching sessions with foreign government and public institution officials.
This forum is highly anticipated to serve as a global platform connecting AI technology, policy, industry, and international cooperation, while sharing Korea's AI capabilities and digital innovation experiences with the world and creating practical cooperative outcomes such as international joint research and digital transformation projects.
Seunghun Han, Head of the Graduate School of Global Digital Innovation, stated, "AI has moved beyond mere technology to become a core agenda for national development and international cooperation." He added, "We expect this forum to serve as a venue where policymakers, experts, and companies from around the world gather to explore future AI cooperation strategies and build new global partnerships."
The forum is open to any researchers, students, and the general public interested in AI and global cooperation through pre-registration (https://docs.google.com/forms/d/1QmYMqaD4uoT11NxUZb4ZSQBVgDxex5DJ3_4-eCoqgVI/edit).
※ Inquiries: KAIST Graduate School of Global Digital Innovation (gdi.adm@kaist.ac.kr / 042-350-6845)
"Age of Robots Making Human-Like Judgments, KAIST Solves Key Challenge in Physical AI
< (From left) Professor Chang D. Yoo, Tung M. Luu (PhD candidate, first author) at the back center, and Hwanhee Kim (M.S candidate, second author) at the front right >
“Robots that make judgments like humans are coming faster than we think.” A core technology that will accelerate the era where robots understand human intentions and choose the correct actions on their own has been developed in South Korea. KAIST researchers solved a key challenge in the commercialization of physical AI by developing a technology where AI learns human judgment criteria on its own with just a few videos. KAIST announced on June 10th that a research team led by Professor Chang D. Yoo from the School of Electrical Engineering has developed 'VOTP (Video-based Optimal TransPort Preference)' for the first time in the world, a new technology that allows AI to learn human intentions and judgment criteria using just a few preference videos instead of thousands to tens of thousands of human evaluation data points.
< VOTP Overview Diagram >
The research team's paper has been accepted to ICML (International Conference on Machine Learning) 2026, the world's most prestigious AI conference, which will be held at COEX in Seoul this July. It was selected for an Oral presentation, an honor given to only the top 0.7% (168 papers) out of all submitted papers (23,918 papers), recognizing the excellence of the research. ICML is considered one of the most influential international conferences in the fields of AI and machine learning. Recently, AI technology is rapidly evolving beyond generative AI that writes text and draws pictures into the era of 'Physical AI,' which moves actual machines and acts in the real world. Representative examples include robots that perform dangerous tasks in factories instead of humans, autonomous vehicles that judge road situations on their own, and medical robots that perform delicate surgeries. However, there was a barrier that had to be overcome for the practical application of physical AI. It is the problem of learning human-level evaluation criteria to judge whether the actions performed by a machine match human intentions and which actions are more desirable. For example, when a surgical robot performs suturing or an autonomous vehicle passes through a complex intersection, the AI must choose the most appropriate action among numerous options. To achieve this, a 'Reward Function' that reflects human preferences and judgment criteria is required. However, until now, humans had to directly evaluate thousands to tens of thousands of action data points to build this, which required an enormous amount of time and cost. The research team focused on the way humans learn new tasks after seeing just a few demonstrations. VOTP, developed by the research team, helps AI understand human-preferred action patterns on its own with just a few videos of good and bad examples. Even without humans evaluating a vast amount of data one by one as before, AI can understand human judgment criteria and expand its learning to various situations. The core idea of this research is that intelligent machines such as robots or autonomous vehicles can quickly grasp human intents with only a small number of videos containing human preferences. The algorithm developed for this purpose proved its effectiveness and generalization performance through extensive experiments across various environments and tasks. This method can significantly reduce human feedback and data construction costs required for physical AI development. Since robots, autonomous vehicles, and industrial machinery can learn actions that meet human expectations with only a small number of examples, it is expected to drastically shorten development time and costs. The technology can be widely applied not only to robot arm control, humanoid robots, autonomous vehicles, smart factories, drones, and surgical robots, but also to AI agents that directly operate computers. In particular, it is expected to be utilized as a core foundational technology for all physical AI systems that need to learn human intention and satisfaction.
< VOTP Research Image (AI Generated) >
Professor Chang D. Yoo said, "The core of physical AI is making machines understand human intentions and choose the correct actions," and added, "Since VOTP can learn human judgment criteria with only a small number of videos, it is a core technology that will accelerate the era of robots making human-like judgments." This research, in which PhD student Tung M. Luu from the School of Electrical Engineering participated as the first author, was selected as an Oral presentation paper at ICML (International Conference on Machine Learning) 2026, the world's most prestigious AI conference. ※ Paper Title: Video-Based Optimal Transport for Feedback-Efficient Offline Preference-Based Reinforcement Learning, Paper File: https://sanctusfactory.com/data/file/publications/202606091714078906.pdf This research was conducted with support from the Institute for Information & Communication Technology Planning & Evaluation (IITP) and the National Research Foundation of Korea (NRF), funded by the Ministry of Science and ICT."
KAIST Launches Mind Care & Growth Center, an Integrated Mental Health Platform for the AI Era
<Human Behavior and Mental Health Symposium Poster>
KAIST announced the official launch of the KAIST Mind Care & Growth Center (KMCG), a new integrated platform that strengthens mental health support for students and faculty while advancing digital mental health research. To mark the occasion, KAIST hosted an international symposium titled "Human Behavior and Mental Health" on June 10, 2026, at the Cho Su-mi Hall in the Chang Young Shin Student Activity Center on its main Daejeon campus.
The symposium drew significant interest from both academia and the public, with pre-registration reaching capacity within just one week of opening. More than 300 KAIST faculty members, researchers, students, and global digital health experts attended, underscoring the urgent demand for advanced mental health frameworks in the era of artificial intelligence.
A Unified Hub for Integrated Mental Health Support
The KMCG consolidates psychological counseling, psychiatric care, and crisis intervention services that were previously dispersed across campus, expanding and reorganizing the existing student counseling center. By eliminating the inconvenience of navigating multiple support channels, the center provides more systematic, seamless, and consistent care under one roof.
Going beyond a traditional counseling center, the KMCG serves as a living laboratory that fuses real-world mental health expertise with KAIST's research capabilities. Researchers from artificial intelligence, brain science, industrial design, digital humanities, mathematics, and computer science collaborate to develop and evaluate new approaches to mental health support — delivering evidence-based interventions to students, gathering practice-based insights, and continuously improving services.
Keynote: Global Collaboration with UCSF Neuroscape
The keynote address was delivered by Professor Adam Gazzaley, Founder and Executive Director of Neuroscape at the University of California, San Francisco (UCSF), and co-founder of Akili Interactive, developer of EndeavorRx — the world's first FDA-authorized prescription video game treatment.
Professor Gazzaley presented the latest advances in digital mental health, including VR-based cognitive training and multimodal biosensing. Neuroscape's interdisciplinary model — integrating clinical neuroscience, engineering, design, and AI — stands as a leading example of the convergence research that KMCG seeks to promote, and serves as a benchmark for building a multidisciplinary mental health innovation ecosystem at KAIST.
In subsequent lightning talk sessions, KAIST faculty from mathematics, brain engineering, AI, computer science, industrial design, and digital humanities presented future research directions and discussed opportunities for global collaboration with Neuroscape.
New Research: Generative AI as a Clinically Ambivalent Technology
In parallel with its service mission, the KMCG is already generating impactful scientific insights. Center Director Dooyoung Jung recently co-authored a study with Professor Chul-Hyun Cho of Korea University College of Medicine, published on June 2, 2026, in the Journal of Medical Internet Research (JMIR), examining the impact of generative AI on psychiatric practice.
<(From Left) Ph. D candidate Myungsung Kim, Professor Yoosuk An, Professor Dooyoung Jung, Professor Chul-Hyun Cho>
Based on the real-world clinical experiences of 311 Korean psychiatrists, the study found that generative AI may help organize emotions, support self-care, and improve access to treatment — but may also create risks such as overdependence, reinforcement of distorted beliefs, and potentially dangerous situations depending on patient vulnerability and context of use.
<An AI-generated image of a psychiatrist consulting with a patient who uses AI.>
The research team described generative AI as a "clinically ambivalent technology" and emphasized that it should assist, not replace, human therapists. Safe adoption requires technological reliability, clinical validation, crisis response systems, and robust governance by medical professionals.
<Structure of Item-by-Item Topic Analysis and Co-occurrence Analysis>
<Key Connections Among Clinical Signals, Interpretations, and Priority Implementation Tasks for Generative AI>
Leadership Perspectives
Dooyoung Jung, Director of the KMCG, stated: "A healthy mind is the foundation for achieving outstanding research. We will develop the Mind Care & Growth Center into a platform that leads mental health solutions for universities across Korea."
KAIST President Kwang Hyung Lee remarked: "In an era where artificial intelligence is replacing even high-level human intellectual labor, our greatest concern is not technological deficiency, but the potential erosion of human-centered values and culture. Technology must remain a tool guided by human wisdom and philosophy — not the other way around."
As AI becomes more deeply embedded in everyday life, the KMCG is positioned to address new mental health challenges arising from AI dependency while harnessing digital technology in ways that support human flourishing. Pre-registration for the symposium has closed, but interested members of the public were welcome to attend on-site. Some services and amenities were limited.
Related Publication
Title: Mapping Practice-Based Signals of Generative AI in Psychiatric Care: Qualitative Study of Korean Psychiatrists' Experiences, Interpretations, and Implementation Priorities
Journal: Journal of Medical Internet Research (JMIR), June 2, 2026
DOI: 10.2196/96556
Authors: Myungsung Kim (UNIST, co-first author), Yoosuk An (Seoul National University / National Traffic Rehabilitation Hospital, co-first author), Chul-Hyun Cho (Korea University, co-corresponding author), Dooyoung Jung (KAIST, co-corresponding author)
KAIST Develops AI Technology That Automatically Generates Sounds as If a “Jurassic Park” Dinosaur Were Actually Walking Toward You
<(From Left) Hyun-Bin Oh, Takida Yuhta, Uesaka Toshimitsu, Tae-Hyun Oh, Mitsufuji Yuki>
When people watch a scene in the film Jurassic Park where a giant dinosaur walks toward them, they naturally imagine a heavy, rumbling sound, as if the ground were shaking. This is because humans predict sound by considering not only the shape of an object, but also physical properties such as its size, weight, and speed of movement. However, existing video-to-audio generation AI mainly generates sound based on the category of objects or scene information in the video, and has not sufficiently reflected physical properties that vary depending on weight or speed.
KAIST (President Kwang Hyung Lee) announced on the 26th of May that a collaborative research team involving Professor Tae-Hyun Oh of the School of Computing, KAIST, together with joint researchers from POSTECH (President Sung Keun Kim) and Sony AI, has developed “PAVAS (Physics-Aware Video-to-Audio Synthesis),” an artificial intelligence (AI) technology that understands the physical situation in a video and generates more realistic sound.
<Concept Diagram of PAVAS (Physics-Aware Video-to-Audio Synthesis) Technology>
The key feature of this technology is that it is designed so that AI can infer invisible physical information such as the mass and velocity of objects in a video on its own. Ordinary videos do not provide exact numerical values for an object’s weight or speed, but the research team enabled AI to estimate them by analyzing the surrounding environment and movement context, and to reflect the results in the sound generation process.
In other words, the AI was designed to go beyond simply recognizing “what is visible” and to understand the physical cause of “why this sound should occur.”
As a result of technical validation, the research team’s AI generated sounds very similar to real-world environments in scenes involving physical interactions such as collisions or impacts between objects. In particular, it produced more realistic audio in which loudness and tone naturally changed when the mass and velocity of objects varied.
Recently, generative AI technologies that simultaneously generate video and audio have been advancing rapidly. Representative examples include Google’s “Veo 3” and ByteDance’s “Seedance 2.0.” However, in actual film, advertising, and game production sites, there is far greater demand for post-production work that adds sound effects suited to existing video scenes or supplements audio than for generating entirely new videos.
While existing commercial AI models have focused on generating video and audio together, PAVAS is differentiated by its ability to analyze the movement and collision characteristics of objects in a video and generate realistic sound effects that precisely match the scene.
<Comparison of Spectrograms Generated by Conventional Video-to-Audio Models and PAVAS>
The research team explained that this technology presents new possibilities in the field of “Physical AI,” or physically consistent generative AI. Physically consistent generative AI refers to AI that goes beyond simply producing plausible results and understands the laws of physics and causal relationships in the real world.
In the future, this technology is expected to provide more immersive user experiences in a wide range of fields, including the automation of content sound production, augmented reality (AR) and virtual reality (VR) content, the metaverse, and robotics simulation.
Professor Tae-Hyun Oh stated, “While existing generative AI has developed by increasing the scale of data and models, this research is meaningful in that it was designed so that AI directly understands physical quantities and causal relationships,” adding, “In the future, it can be expanded into a core foundational technology for next-generation multimodal AI that simultaneously understands and processes diverse types of information, including text, video, and speech.”
This study was led by POSTECH integrated M.S.-Ph.D. student Hyun-Bin Oh as the first author, with KAIST Professor Tae-Hyun Oh and Sony AI researchers Yuhta Takida, Toshimitsu Uesaka, and Yuki Mitsufuji participating as co-authors. This research was selected as an Oral presentation paper at CVPR 2026 (Computer Vision and Pattern Recognition 2026), the world’s most prestigious academic conference in the field of computer vision (image-based artificial intelligence technology), where only the top 0.88% of all papers are selected for oral presentation, recognizing the excellence of the work. The presentation is scheduled to take place on June 6.
※ Paper title: “PAVAS: Physics-Aware Video-to-Audio Synthesis,” DOI: https://arxiv.org/abs/2512.08282
This research was supported by the Mid-Career Research Program under the Basic Research Program of the Ministry of Science and ICT, the Pioneer Research Program for Future Converging Technology of the Ministry of Science, ICT and Future Planning, the AGI Program of the Ministry of Science and ICT, and the KAIST InnoCORE Program.
Welcome to South Korea's Hosting of the 'Global AI Hub'
KAIST announced on May 22nd that the entire faculty of the Graduate School of AI welcomes South Korea's hosting of the 'Global AI Hub.' The faculty determined that hosting this will serve as a crucial momentum builder for South Korea to earnestly contribute to international cooperation and the responsible use of technology in the artificial intelligence (AI) era.
In a joint statement, the faculty of the KAIST Graduate School of AI expressed, "Hosting the Global AI Hub goes beyond simply attracting an international organization or center to the country. It marks a significant turning point where South Korea, leveraged by its competitive edge in AI technology, can shape the direction of responsible AI together with the international community."
They further emphasized, "Moving forward, the standards for an AI powerhouse will not be determined solely by larger models and more infrastructure. How safely and responsibly we utilize AI, and how we share its benefits with the international community, is becoming increasingly vital."
The statement also noted, "South Korea has accumulated significant competitiveness in the AI sector based on its outstanding researchers and corporations, semiconductor and manufacturing capabilities, and rapid experience in digital transformation. This hosting is a precious opportunity to connect these capabilities with the international community and contribute to spreading the benefits of AI more widely."
KAIST President Kwang Hyung Lee stated, "AI is no longer just a technological competition between individual nations; it is a field where the international community must work together to shape a responsible direction for development. South Korea’s hosting of the Global AI Hub will serve as a major milestone for our country to leap forward as a central nation of trust and cooperation, not just in AI technology."
He added, "KAIST will also actively contribute to expanding cooperation with the international community, built upon our world-class AI research and talent cultivation."
KAIST is actively conducting research in a variety of related fields, including large-scale AI models, multimodal AI, AI semiconductors, robotics, manufacturing/bio/medical AI, as well as AI safety and reliability. Through convergence research spanning from foundational studies to empirical demonstrations in industrial and public sectors, we are contributing to strengthening South Korea's AI competitiveness and developing the global AI ecosystem.
Furthermore, through the establishment and operation of the Graduate School of AI, KAIST has built a world-class AI research capacity and a system for nurturing convergence-type talents. We are continuously expanding responsible AI education and research that encompasses safety, ethics, and governance alongside AI technology itself.
The faculty of the KAIST Graduate School of AI stated, "The KAIST Graduate School of AI is constantly pursuing education and research that comprehensively address AI algorithms and models, AI semiconductors and systems, convergence with industry and science/technology, and AI safety and governance. We intend to contribute to connecting South Korea's AI capabilities with the international community through cooperation with domestic universities, research institutions, corporations, and public agencies."
Lastly, they concluded, "The true meaning of the Global AI Hub lies not in the hosting itself, but in what kinds of talents we nurture afterward, what research and empirical evidence we share with the world, and what cooperative structures we build with the international community. We will continue to faithfully fulfill our necessary roles for talent cultivation and research cooperation."
“Entrepreneurial Mutual Growth Fair 2026” to be Held... KAIST Super Star Companies Gather for AI Solopreneurship, Tech Commercialization, Investment, and Youth Job Fair
KAIST announced that it will host the ‘AI Agent-Based Solopreneurship Program Information Session’ and the ‘Entrepreneurial Mutual Growth Fair 2026’ for two days from May 18th to 19th.
In this event, KAIST’s new AI-based solopreneurship model, which utilizes AI not merely as an operational tool but as a ‘Co-founder,’ will be introduced in depth. The university will hold an information session for the ‘AI Solopreneur Support Project,’ which enables a single individual to carry out the entire entrepreneurial process—including planning, development, marketing, and fundraising—using AI agents prepared by the university.
In this program, 100 prospective entrepreneurs will be selected nationwide, and faculty from the KAIST Institute for Entrepreneurship and the KAIST Graduate School of AI will provide eight weeks of intensive training. Additionally, a network of top-tier domestic and global mentors will be established to support business optimization and linkage with overseas investments.
In particular, outstanding teams will be provided with seed investment of up to 100 million KRW, prototype production support, and infrastructure for GPU and AI services. By fostering world-class AI utilization skills in prospective entrepreneurs with diverse domain knowledge, KAIST plans to accelerate the introduction of AI into various domestic industries while nurturing AI business models with global competitiveness.
This event is organized as a venue to introduce the KAIST-style full-cycle entrepreneurial ecosystem, encompassing artificial intelligence (AI)-based entrepreneurship, technology commercialization, industry-academic cooperation, investment linkage, and youth job creation. In particular, it will showcase the competitiveness of the deep-tech (advanced technology-based) startup ecosystem from multiple perspectives, focusing on the technological prowess and industrial application cases of KAIST startup companies.
Global big tech companies' choice of AI solution providers will also participate to reveal various technologies reflecting the AX (AI Transformation) trend across industries. Actual application cases that supported the digital transformation of major domestic corporations through factory and office automation solutions will also be announced.
In the field of robotics, Lion Robotics will introduce field-application technologies based on quadruped robots and leading R&D cases for humanoid robots. In addition, next-generation AI semiconductor startups such as Panmnesia and HyperAccel will present next-generation chip design technologies for implementing On-Device AI. These companies will showcase technologies and business models that can run Large Language Model (LLM)-based AI services faster while reducing dependence on GPUs (Graphics Processing Units). In the deep-tech bio and healthcare AI field, Barreleye will introduce an innovative solution that complements the limitations of traditional MRI (Magnetic Resonance Imaging)-centered diagnosis through AI-based quantitative ultrasound analysis technology. In the bio and medical robot field, Roen Surgical will present next-generation medical innovation cases based on precision surgical robot technology.
On the first day of the event, May 18th, the ‘Entrepreneurial Mutual Growth Fair’ will be held in the main hall on the 1st floor of the KI Building along with the ‘AI Agent-Based Solopreneurship Program Information Session.’ Representative startup companies that have led KAIST’s technology commercialization success will participate in this session to share successful technology commercialization models that connected R&D achievements to actual market results.
Through this, they plan to present a virtuous cycle for the KAIST startup ecosystem leading from ‘Research → Startup → Investment → Growth.’ Furthermore, KAIST startup companies will operate recruitment sessions alongside technology exhibitions. Participating companies will conduct direct recruitment consultations and talent discovery on-site, providing youth with high-quality, technology-based job opportunities. Through this event, the university plans to support scientific and technological talents so they can advance into startups and industrial fields rather than staying in research, and to lead technology-based entrepreneurship and employment creation. On the second day, May 19th, an ‘Open Innovation Information Session’ will be held to connect KAIST’s research capabilities with industrial demand.
At the event, the ‘1 Lab N Startup’ model, which connects KAIST faculty’s technology with corporate R&D needs to promote joint research and commercialization, will be introduced. Industry-academic cooperation strategies that expand beyond technology transfer to joint entrepreneurship and new business creation will also be announced. Following this, in the ‘KAIST Startup Investment Linkage IR Pitching Session,’ the investment attraction program ‘Tech Plaza’ will be operated, featuring five Korean deep-tech bio companies. Companies selected based on the KAIST Startup Platform (KSTP) will present their business models and technological prowess to investors, and tangible investment results are expected through linkage with venture capital (VC) and accelerators. Bae Hyeon-min, Dean of the KAIST Institute for Entrepreneurship, said, “This Entrepreneurial Mutual Growth Fair is an integrated startup platform that connects the entire process from AI-based individual entrepreneurship to technology commercialization, industry-academic cooperation, investment, and job creation.
We expect it to serve as an opportunity to present a new direction for the domestic deep-tech startup ecosystem through the success stories of KAIST’s representative startups.” This event is open to students, the general public, corporations, and investment institutions interested in entrepreneurship, and is prepared as a place to directly confirm the innovative achievements and expansion possibilities of the KAIST startup ecosystem. Information regarding the KAIST AI Solopreneurship Program information session and participation applications can be found on the website (https://www.kaist-overedge.com/).
By accessing the website, people can watch the information session on YouTube and apply for participation.
Professor Yiyun Kang Selected as TED 2026 Main Stage Speaker
< Professor Yiyun Kang (Photo Credit: Ryan Lash / TED) >
KAIST announced on April 17th that Professor Yiyun Kang of the Department of Industrial Design has been selected as a speaker for the Main Stage at TED 2026, the world-renowned knowledge conference.
Founded in 1984 under the motto "Ideas Worth Spreading," TED is an American non-profit knowledge platform where scholars, innovators, and artists from around the globe gather annually to lead global discourse. Previous Korean speakers on the Main Stage include novelist Young-ha Kim (2012) and violinist Ji-hae Park (2013). In 2011, roboticist Professor Dennis Hong stood on the main conference stage as the first Korean-American speaker.
< TED Lecture Photo (Photo Credit: Ryan Lash / TED) >
Professor Kang’s selection is particularly significant as it marks the first time since TED moved its venue to Vancouver, Canada, in 2014 that a Korean national—an artist and scholar actively based in South Korea, rather than an overseas resident or defector—has been invited to the Main Stage. Furthermore, it marks the return of a Korean speaker to the main stage after a 12-year hiatus, serving as a symbolic milestone.
The TED 2026 annual conference is being held from April 13 to 17 at the Vancouver Convention Centre in Canada, under the theme "ALL OF US." Professor Kang took the Main Stage on April 15, the third day of the conference, to present visual insights and philosophical solutions for a future where Artificial Intelligence (AI), humans, and nature must coexist. The lecture video will be edited and released globally via the official TED website and YouTube channel this coming July.
In this talk, Professor Kang defines AI and the climate crisis as "problems we understand intellectually but fail to feel physically," noting that data- and information-centric communication methods often lower our sense of reality. She proposes the potential of art as a means to bridge this gap. Specifically, Professor Kang will demonstrate on stage how to transform complex challenges into visual and sensory experiences through cases from her own projects.
Notably, this presentation transcends traditional lecture formats, structured as an "Immersive Talk" that transforms the entire stage into an artistic space. Rather than just listening, the audience participates by experiencing the content with their entire bodies.
Professor Yiyun Kang is a world-class media artist and researcher who crosses the boundaries between sensation and technology, and materiality (physical forms) and immateriality (elements like light, video, and data). She leads the Experience Design Lab (XD Lab) at KAIST and has consistently explored the convergence of technology and art through collaborations with NASA, Google Arts & Culture, and the Victoria and Albert Museum (V&A).
"Humanity is currently at a critical turning point that will determine the coexistence of technology and nature," Professor Kang stated. "Through this TED stage, I aim to ensure that AI and the climate crisis are perceived not just as mere information, but as realities of our lives. I hope to create a practical opportunity to expand fragmented individual perceptions into collective human solidarity through the creative energy of art."
< TED 2026 Professor Yiyun Kang (Source: TED Website) >
Breakthrough in Data Processing via Light Control... Enhancing AI Accelerators and Quantum Communication
< (From left) Undergraduate researcher Taewon Kim and Professor Sangsik Kim >
A new technology has been developed that allows light to be "designed" into desired forms, potentially making Artificial Intelligence (AI) and communication technologies faster and more accurate. A KAIST research team has developed an "integrated photonic resonator"—a core component of next-generation optical integrated circuits that process data using light. The research is particularly significant as it was led by an undergraduate student. This technology is expected to serve as a key foundation for next-generation security technologies such as high-speed data processing and quantum communication.
KAIST announced on the 15th that a research team led by Professor Sangsik Kim from the School of Electrical Engineering, in collaboration with Professor Jae Woong Yoon’s team from the Department of Physics at Hanyang University (President Kigeong Lee), has developed a new integrated photonic resonator structure capable of freely controlling optical signals by utilizing light interference (the phenomenon where two light waves meet and influence each other).
Photonic Integrated Circuits (PICs) process data at ultra-high speeds and with low power consumption using light. They are garnering significant attention as a fundamental platform technology for next-generation fields such as AI, data centers, and quantum information processing.
The core of this technology lies in the precision with which light can be controlled. Specifically, the ability to freely adjust the spectrum (color or wavelength distribution) and phase response (timing or wave position) of optical signals is essential for implementing high-performance optical communication and computing. However, conventional methods have faced fundamental limitations.
The integrated photonic resonator (optical resonator) focused on by the research team is a key optical device that traps light in a specific space to amplify it or select specific colors (wavelengths), similar to how the body of a musical instrument amplifies sound. However, existing single-bus resonators have had limitations in precisely adjusting the phase and spectrum of optical signals.
To overcome these challenges, the research team introduced a "dual-bus" structure. This design allows light that has passed through the resonator to recombine with light that has not, enabling precise control over interference. This allows for the free design of optical signals into desired forms, making it possible to control various types of light signals that were previously difficult to implement.
By applying this technology, the research team secured new characteristics for more precise control of wavelength properties and presented new possibilities for non-linear frequency conversion research (changing the color of light). Utilizing this technology enables faster and more accurate data processing, which is expected to provide the groundwork for performance enhancements in future high-speed data centers, AI accelerators, and quantum communication systems.
This research is especially meaningful as it was led by an undergraduate student. Taewon Kim, an undergraduate student who conducted the study through the KAIST Undergraduate Research Program (URP), stated, "I was able to develop the resonator principles I learned in the Introduction to Integrated Optics class into actual device designs and a published paper."
< Research Image of the Dual-bus Resonator >
Professor Sangsik Kim remarked, "This study goes beyond proposing a new device; it demonstrates that by precisely analyzing previously overlooked optical characteristics, physical limitations can be overcome. We expect this to contribute broadly to the development of optics-based AI accelerators and optical communication technologies."
KAIST undergraduate student Taewon Kim participated as the lead author of this study, and the results were published on March 6th in the international optics journal, Laser & Photonics Reviews.
Paper Title: Dual-bus resonator for multi-port spectral engineering DOI: 10.1002/lpor.202502935 Authors: Taewon Kim, Mehedi Hasan, Yu Sung Choi, Jae Woong Yoon, and Sangsik Kim
This research was supported by the KAIST URP Program, the Institute of Information & Communications Technology Planning & Evaluation (IITP), the U.S. Asian Office of Aerospace Research and Development (AOARD), and the National Research Foundation of Korea (NRF).
AI, Humanoid Robots, and Space Rovers to Gather: Experience Future Technologies at the Science Festival
<(From left) Photos of the KAIST Science Festival exhibition hall and booths from the previous year>
KAIST announced on April 10th that KAIST will participate in the ‘2026 Korea Science and Technology Festival,’ the largest science festival in the country, to mark Science Month in April. KAIST will operate ‘KAIST Play World,’ an interactive exhibition hall showcasing the pinnacle of AI and robotics. This year’s festival will be held in two parts: ‘2026 Korea Science Festival in Daejeon (April 17–19)’ and ‘2026 Korea Science Festival in Gyeonggi (April 24–26).’ KAIST will host consecutive exhibitions at the Daejeon DCC (Second Exhibition Hall) and KINTEX in Ilsan. Under the ‘Play World’ concept, KAIST plans to offer differentiated interactive content tailored to various generations. In particular, on-site events and souvenirs featuring the KAIST character ‘Nupjuk-i’ will be provided to enhance visitor engagement.
□ [Daejeon] From Humanoid Robots to Space Rovers and AI Semiconductor Friend ‘BROCA’ The exhibition at Daejeon DCC from April 17 to 19 will feature ‘Future Tech Experience Content’ centered on advanced robotics, space technology, and AI semiconductor technology, allowing visitors to experience KAIST's core research achievements firsthand. First, a humanoid robot equipped with control technology developed by Eurobotics Co., Ltd., a startup from Professor Myung Hyun’s research team in the School of Electrical Engineering, will be unveiled on the 17th. This robot is gaining attention as a next-generation platform capable of natural walking in both industrial and urban environments. Additionally, on the 19th, a humanoid robot from Professor Park Hae-won’s team in the Department of Mechanical Engineering will demonstrate high-difficulty human movements such as the duck walk and moonwalk, showcasing its potential for practical industrial use. Professor Lee Dae-young’s team in the Department of Aerospace Engineering will present the world’s first deployable lunar rover wheel based on origami technology. Visitors can touch the transformable wheel model and observe space rover demonstrations and displays by the co-developer, Unmanned Exploration Laboratory (UEL). Educational sessions for folding various space systems using origami will also be available. Along with this, visitors can experience advanced human-machine interaction through ‘BROCA,’ a mobile social AI agent that builds relationships with users beyond simple Q&A, and the voice-capable guide robot ‘On-Newro,’ developed by Professor Yoo Hoi-jun’s team at the AI Semiconductor Graduate School. The student startup ‘Liar Games’ will operate a trial zone for ‘Dual Focus,’ an abstract strategy board game where players compete 1:1 against AI. Similar to the deep strategic play of chess or Go, the rules are intuitive enough to learn in 5 minutes, which is expected to stimulate the challenge-seeking spirit of visitors.
< (Top row from left) Professor Park Hae-won’s humanoid robot, Professor Yoo Hoi-jun’s BROCA, (Bottom row from left) Eurobotics’ humanoid walking technology capable of overcoming any terrain based on a mobile kit, Professor Lee Dae-young’s storable and deployable rover for lunar exploration >
□ [Gyeonggi] ‘Raibo’ the Rough-Terrain Robot and AI-Based Future Experiences The Gyeonggi exhibition at KINTEX from April 24 to 26 will focus on ‘Life-Oriented Experience Content’ centered on AI and everyday technology. ‘Raibo,’ a quadrupedal robot developed by Professor Hwangbo Jemin’s team in the Department of Mechanical Engineering, is capable of high-speed movement on complex terrains such as sand, stairs, and debris, and is expected to be utilized for disaster relief and search missions. Visitors can experience Raibo’s driving technology directly at the site. The ‘Future Memories Studio’ from Professor Nam Tek-jin’s team in the Department of Industrial Design will provide a new experience where visitors can meet and talk to their future selves 10 years later, recreated using AI-generated visuals and voices. Participants will receive a four-cut photo capturing a moment that is the future for their current self but a memory for their future self. Professor Yun Yun-jin’s team at the KAIST Urban AI Research Center will present technology that analyzes the impact of climate change on small business sales through ‘AI-based Sight and Sound for Heatwave Consumption Index.’ They will showcase time-series AI-based sales prediction technology and generative AI technology that expresses this visually and audibly. Furthermore, Professor Yun’s lecture, “City Walk of Artificial Intelligence: Urban AI and the Future of Cities,” will be held on April 24 (Fri) at 15:00 in KINTEX Meeting Room 206. In addition, Professor Yoo Hoi-jun’s team from the AI Semiconductor Graduate School will continue from the Daejeon exhibition to operate an experience zone for various mobile AI agents based on AI semiconductors. Also, the student startup Rabbithole Company will introduce a new type of game where AI NPCs (Non-Player Characters) converse and cooperate to solve given problems. Visitors can participate by observing the process where AI characters create their own stories by being presented with situations or goals instead of being directly controlled.
< (Top row from left) Professor Hwangbo Jemin’s Raibo, Professor Nam Tek-jin’s team: Met My Future Self 10 Years Later, (Bottom row from left) Professor Yun Yun-jin’s Seeing and Hearing Heatwave Consumption Index through AI, Game image from CEO Kim Na-hoon’s Rabbithole Company >
Through the exhibitions in both regions, KAIST plans to operate various participatory programs to make science and technology easy and fun to approach, vividly conveying how technology from the laboratory transforms our lives. KAIST President Lee Kwang-hyung remarked, “This year’s science festival is a large-scale event connecting Daejeon and Gyeonggi, allowing more citizens to experience KAIST’s innovative research achievements firsthand.” He added, “I hope this will be a precious time for people to experience the future created by robots and AI, fostering their dreams and curiosity about science.”