Professor Insik Shin Becomes First Korean to Win the RTSS Most Influential Paper Award
< KAIST Professor Insik Shin >
KAIST announced on December 21st that Professor Insik Shin from the School of Computing has received the Influential Paper Award 2025 at the IEEE Real-Time Systems Symposium (RTSS), the world's most prestigious international conference in the field of real-time systems.
This honor is a "Test of Time Award," presented to papers that have exerted a sustained and significant influence on both academia and industry for more than 10 years after publication. This marks the first time a Korean researcher has received this prestigious award. The ceremony took place at IEEE RTSS 2025 in Boston, USA, on December 4th (local time).
Professor Shin’s award-winning research is the "Periodic Resource Model," co published in 2003 with Professor Insup Lee of the University of Pennsylvania. Rather than trying to verify a complex machine or system all at once, this study developed a method to verify individual components—much like LEGO blocks—to ensure each meets its designated timing requirements. It mathematically guarantees that when these components are assembled, the entire system will operate safely.
Paper Title: Periodic Resource Model for Compositional Real-Time Guarantees
DOI: 10.1109/REAL.2003.1253249
Thanks to this research, it has become possible to design real-time systems that cannot tolerate even a moment of delay—such as autonomous vehicles, aircraft, and industrial robots—with greater precision and safety. This breakthrough overcame the limitations of existing methods that required analyzing an entire system at once, which had become nearly impossible as the complexity of modern real-time systems increased rapidly.
Professor Shin presented a method to divide a system into small modules, verify that each module satisfies its time constraints, and mathematically prove that the safety of the entire system is guaranteed upon integration. This work is credited with establishing the foundation for modern compositional real-time scheduling theory.
At the time of its initial publication in 2003, this paper won the 'Best Paper Award' at RTSS—another first for a Korean researcher. Now, 20 years later, its academic and industrial value has been officially recognized once again. This is because the theory has transcended academic boundaries to become a core analytical tool in various safety-critical industries, including autonomous driving, aerospace control, and industrial automation.
The IEEE Technical Committee stated, "This model has established itself as a core language for modern real-time system design and has guided the direction of research and industry for the past 20 years." The paper is currently featured in textbooks at major universities in the United States and Europe, serving as a standard theory in the field.
"As a scholar, this is the award I have wanted most in my life," said Professor Shin. "I am honored to have it recognized that research from 20 years ago has truly had a major impact on the world. This was made possible by the many researchers and companies who applied this theory to actual systems."
Meanwhile, Professor Shin is expanding his research beyond real-time systems into the field of Artificial Intelligence (AI). He founded the faculty-led startup Fluiz and developed FluidGPT, a mobile AI agent technology that allows users to execute smartphone apps via voice commands. This technology recently won the AI Champion Competition hosted by the Ministry of Science and ICT. Experts evaluate Professor Shin as achieving rare success by bridging basic theory and applied technology, effectively linking research to industry.
AI Technology World No. 1 in Finding the Exact Moment in a Video: Where is the First Place?
< (From left) Professor Joon Hyuk Noh (Assistant Professor, Department of Artificial Intelligence, Ewha Womans University), Seojin Hwan, Yoonki Cho (Ph.D. Candidate), Professor Sung-Eui Yoon (School of Computing, KAIST) >
When faced with a complex question like 'What object disappeared while the camera was pointing elsewhere?', a common problem is that AI often relies on language patterns to guess a 'plausible answer,' instead of actually observing the real situation in the video. To overcome this limitation, our university's research team developed a technology that enables the AI to autonomously identify the 'exact critical moment (Trigger moment)' within the video, and the team’s excellence was proven by winning an international AI competition with this technology. The university announced on the 28th that the research team led by Professor Sung-Eui Yoon from the School of Computing, in collaboration with Professor Joon Hyuk Noh's team from Ewha Womans University, took 1st place in the Grounded Video Question Answering track of the Perception Test Challenge held at ICCV 2025, a world-renowned computer vision conference. The Perception Test Challenge held at ICCV 2025 was organized by Google DeepMind with a total prize pool of 50,000 Euros (approximately 83 million KRW). It assesses the cognitive and reasoning abilities of multimodal AI, which must comprehensively understand various data, including video, audio, and text. Crucially, the core evaluation factor is the ability to make judgments based on actual video evidence, moving beyond language-centric bias. Unlike conventional methods that analyze the entire video indiscriminately, our university's research team developed a new technology that instructs the AI to first locate the core scene (Trigger moment) essential for finding the correct answer. Simply put, this technology is designed to make the AI autonomously determine: “This scene is decisive for answering this question!” The research team calls this framework CORTEX (Chain-of-Reasoning for Trigger Moment Extraction). The research team's system consists of a three-stage structure where three models performing different functions operate sequentially. First, the Reasoning AI (Gemini 2.5 Pro) reasons about which moment is required to answer the question and finds candidate Trigger moments. Next, the Object Location Finding Model (Grounding Model, Molmo-7B) accurately identifies the exact location (coordinates) of people, cars, and objects on the screen during the selected moment. Finally, the Tracking Model (SAM2) precisely tracks the movement of objects in the time frame before and after the selected scene, using that scene as a reference, thereby reducing errors. In short, the 'method of accurately pinpointing a key scene and tracking the evidence for the answer centered on that scene' significantly reduced problems like initial misjudgment or occlusion in the video. In the Grounded Video Question Answering (Grounded VideoQA) track, which saw 23 participating teams, the KAIST team SGVR Lab (Scalable Graphics, Vision & Robotics Lab) recorded 0.4968 points in the HOTA (Higher Order Tracking Accuracy) metric, overwhelmingly surpassing the 2nd place score of 0.4304 from Columbia University, USA, to secure 1st place. This achievement is nearly double the previous year's winning score of 0.2704 points. This technology has wide-ranging applications in real-life settings. Autonomous driving vehicles can accurately identify moments of potential accident risk, robots can understand the surrounding environment smarter, security and surveillance systems can rapidly locate critical scenes, and media analysis can precisely track the actions of people or objects in chronological order. This is a core technology that enables AI to judge based on "actual evidence in the video." The ability to accurately pinpoint how objects behave over time in a video is expected to greatly expand the application of AI in real-world scenarios in the future.
< Pipeline image of the grounding framework for video question answering proposed by the research team >
This research was presented on October 19th at ICCV 2025, the 3rd Perception Test Challenge conference. The achievement was supported by the Ministry of Science and ICT's Basic Research Program (Mid-Career Researcher), the SW Star Lab Project's 'Development of Perception, Action, and Interaction Algorithms for Open-World Robot Services,' and the AGI Project's 'Reality Construction and Bi-directional Capability Approach based on Cognitive Agents for Embodied AGI' tasks."
Professor Youngjin Kwon's Team Wins Google Award 'Catches Bugs Without a Real CPU
< Professor Youngjin Kwon >
Modern CPUs have complex structures, and in the process of handling multiple tasks simultaneously, an order-scrambling error known as a 'concurrency bug' can occur. Although this can lead to security issues, these bugs were extremely difficult to detect using conventional methods. Our university's research team has developed a world-first-level technology to automatically detect these bugs by precisely reproducing the internal operation of the CPU in a virtual environment without needing a physical chip. Through this, they successfully found and fixed 11 new bugs in the latest Linux kernel.
Our university announced on the 21st that the research team led by Professor Youngjin Kwon of the School of Computing has won the 'Research Scholar Award' (Systems category) presented by Google.
The Google Research Scholar Award is a global research support program, implemented since 2020, to support Early-Career Professors conducting innovative research in various fields such as AI, Systems, Security, and Data Management.
It is known as a highly competitive program, with the selection process conducted directly by Google Research scientists, and only a tiny fraction of the hundreds of applicants worldwide are chosen. In particular, this award is recognized as one of the most prestigious industry research support programs globally in the field of AI and Computer Systems, and domestic recipients are rare.
■ Technology Developed to Detect Concurrency Bugs in the Latest Apple M3 and ARM Servers
Professor Kwon's team developed a technology that automatically detects concurrency bugs in the latest ARM (a CPU design method that uses less power and is highly efficient) based servers, such as the Apple M3 (Apple's latest-generation computer processor chip).
A concurrency bug is an error that occurs when the order of operations gets mixed up while the CPU handles multiple tasks simultaneously. This is a severe security vulnerability that can cause the computer to suddenly freeze or become a pathway for hackers to attack the system. However, these errors were extremely difficult to find with existing testing methods alone.
■ Automatically Detects Bugs by Reproducing CPU Internal Operations Without a Real CPU
The core achievement of Professor Kwon's team is the 'technology to reproduce the internal operation of the CPU exactly in a virtual environment without a physical chip.' Using this technology, it is possible to precisely analyze the order in which instructions are executed and where problems occur using only software, without having to disassemble the CPU or use the actual chip.
By running the Linux operating system based on this system to automatically detect bugs, the research team discovered 11 new bugs in the latest Linux kernel* and reported them to the developer community, where they were all fixed.
*Linux kernel: The core operating system engine that forms the basis of servers, supercomputers, and smartphones (Android) worldwide. It acts as the 'heart' of the system, managing the CPU, memory, and storage devices.
Google recognized this technology as 'very important for its own infrastructure' and conferred the Award.
< Google Scholar Award Recipient Page >
This technology is evaluated to have general applicability, not only to Linux but also to various operating systems such as Android and Windows. The research team has released the software as open-source (GitHub) so that anyone in academia or industry can utilize it.
Professor Youngjin Kwon stated, "This award validates the international competitiveness of KAIST's systems research," and "We will continue our research to establish a safe and highly reliable computing environment."
※ Google Scholar Award Recipient Page: https://research.google/programs-and-events/research-scholar-program/recipients/ GitHub (Technology Open-Source): https://github.com/casys-kaist/ozz
Automatic C to Rust Translation Technology Gains Global Attention for Accuracy Beyond AI
<(From Left) Professor Sukyoung Ryu, Researcher Jaemin Hong>
As the C language, which forms the basis of critical global software like operating systems, faces security limitations, KAIST's research team is pioneering core original technology research for the accurate automatic conversion to Rust to replace it. By proving the mathematical correctness of the conversion, a limitation of existing Artificial Intelligence (LLM) methods, and solving C language security issues through automatic conversion to Rust, they presented a new direction and vision for future software security research. This work has been selected as the cover story for CACM, the world's highest-authority academic journal, thereby demonstrating KAIST's global research leadership in the field of computer science.
KAIST announced on the 9th of November that the paper by Professor Sukyoung Ryu's research team (Programming Language Research Group) from the School of Computing was selected as the cover story for the November issue of CACM (Communications of the ACM), the highest authority academic journal published by ACM (Association for Computing Machinery), the world's largest computer society.
<Photo of the Paper Selected for the Cover of Communications of the ACM>
This paper comprehensively addresses the technology developed by Professor Sukyoung Ryu's research team for the automatic conversion of C language to Rust, and it received high acclaim from the international research community for presenting the technical vision and academic direction this research should pursue in the future.
The C language has been widely used in the industry since the 1970s, but its structural limitations have continuously caused severe bugs and security vulnerabilities. Rust, on the other hand, is a secure programming language developed since 2015, used in the development of operating systems and web browsers, and has the characteristic of being able to detect and prevent bugs before program execution.
The US White House recommended discontinuing the use of C language in a technology report released in February 2024, and the Defense Advanced Research Projects Agency (DARPA) also explicitly stated that Rust is the core alternative for resolving C language security issues by promoting a project to develop technology for the automatic conversion of C code to Rust.
Professor Sukyoung Ryu's research team proactively raised the issues of C language safety and the importance of automatic conversion even before these movements began in earnest, and they have continuously developed core related technologies.
In May 2023, the research team presented the Mutex conversion technology (necessary for program synchronization) at ICSE (International Conference on Software Eng), the top authority conference in software engineering. In June 2024, they presented the Output Parameter conversion technology (used for result delivery) at PLDI (Programming Language Design and Implementation), the top conference in programming languages, and in October of the same year, they presented the Union conversion technology (for storing diverse data together) at ASE (Automated Software Eng), the representative conference in software automation.
These three studies are all "world-first" achievements presented at top-tier international academic conferences, successfully implementing automatic conversion technology for each feature with high completeness.
Since 2023, the research team has consistently published papers in CACM every year, establishing themselves as global leading researchers who consistently solve important and challenging problems worldwide.
This paper was published in CACM (Communications of the ACM) on October 24, with Dr. Jaemin Hong (Postdoctoral Research Fellow at KAIST Information and Electronics Research Institute) as the first author. ※Paper Title: Automatically Translating C to Rust, DOI: https://doi.org/10.1145/3737696
Dr. Jaemin Hong stated, "The conversion technology we developed is an original technology based on programming language theory, and its biggest strength is that we can logically prove the 'correctness' of the conversion." He added, "While most research relies on Large Language Models (LLMs), our technology can mathematically guarantee the correctness of the conversion."
Dr. Hong is scheduled to be appointed as an Assistant Professor in the Computer Science Department at UNIST starting in March 2025.
Furthermore, Professor Ryu's research team has four papers accepted for presentation at ASE 2025, the highest-authority conference in software engineering, including C→Rust conversion technology.
These papers, in addition to automatic conversion technology, cover various cutting-edge software engineering fields and are receiving high international acclaim. They include: technology to verify whether quantum computer programs operate correctly, 'WEST' technology that automatically checks the correctness of WebAssembly programs (technology for fast and efficient program execution on the web) and creates tests for them, and technology that automatically simplifies complex WebAssembly code to quickly find errors. Among these, the WEST paper received the Distinguished Paper Award.
This research was supported by the Leading Research Center/Mid-career Researcher Support Program of the National Research Foundation of Korea, the Institute of Information & Communications Technology Planning & Evaluation (IITP), and Samsung Electronics.
3D Worlds from Just a Few Phone Photos
<(From Left) Ph.D candidate Jumin Lee, Ph.D candidate Woo Jae Kim, Ph.D candidate Youngju Na, Ph.D candidate Kyu Beom Han, Professor Sung-eui Yoon>
Existing 3D scene reconstructions require a cumbersome process of precisely measuring physical spaces with LiDAR or 3D scanners, or correcting thousands of photos along with camera pose information. The research team at KAIST has overcome these limitations and introduced a technology enabling the reconstruction of 3D —from tabletop objects to outdoor scenes—with just two to three ordinary photographs. The breakthrough suggests a new paradigm in which spaces captured by camera can be immediately transformed into virtual environments.
KAIST announced on November 6 that the research team led by Professor Sung-Eui Yoon from the School of Computing has developed a new technology called SHARE (Shape-Ray Estimation), which can reconstruct high-quality 3D scenes using only ordinary images, without precise camera pose information.
Existing 3D reconstruction technology has been limited by the requirement of precise camera position and orientation information at the time of shooting to reproduce 3D scenes from a small number of images. This has necessitated specialized equipment or complex calibration processes, making real-world applications difficult and slowing widespread adoption.
To solve these problems, the research team developed a technology that constructs accurate 3D models by simultaneously estimating the 3D scene and the camera orientation using just two to three standard photographs. The technology has been recognized for its high efficiency and versatility, enabling rapid and precise reconstruction in real-world environments without additional training or complex calibration processes.
While existing methods calculate 3D structures from known camera poses, SHARE autonomously extracts spatial information from images themselves and infers both camera pose and scene structure. This enables stable 3D reconstruction without shape distortion by aligning multiple images taken from different positions into a single unified space.
<Representative Image of SHARE Technology>
"The SHARE technology is a breakthrough that dramatically lowers the barrier to entry for 3D reconstruction,” said Professor Sung-Eui Yoon. “It will enable the creation of high-quality content in various industries such as construction, media, and gaming using only a smartphone camera. It also has diverse application possibilities, such as building low-cost simulation environments in the fields of robotics and autonomous driving."
<SHARE Technology, Precise Camera Information and 3D Scene Prediction Technology>
Ph.D. Candidate Youngju Na and M.S candidate Taeyeon Kim participated as co-first authors on the research. The results were presented on September 17th at the IEEE International Conference on Image Processing (ICIP 2025), where the paper received the Best Student Paper Award.
The award, given to only one paper among 643 accepted papers this year—a selection rate of 0.16 percent—once again underscores the excellent research capabilities of the KAIST research team.
Paper Title: Pose-free 3D Gaussian Splatting via Shape-Ray Estimation, DOI: https://arxiv.org/abs/2505.22978
Award Information: https://www.linkedin.com/posts/ieeeicip_congratulations-to-the-icip-2025-best-activity-7374146976449335297-6hXz
This achievement was carried out with support from the Ministry of Science and ICT's SW Star Lab Project under the task 'Development of Perception, Action, and Interaction Algorithms for Unspecified Environments for Open World Robot Services.'
KAIST's 'FluidGPT' Wins Grand Prize at the 2025 AI Champion Competition
<Commemorative Photo After Winning at the 2025 AI Champions Award Ceremony>
The era has begun where an AI assistant goes beyond simple conversation to directly view the screen, make decisions, and complete tasks such as hailing a taxi or booking an SRT ticket.
KAIST (President Kwang Hyung Lee) announced on the 6th that the AutoPhone Team (Fluidez, KAIST, Korea University, Sungkyunkwan University), led by Professor Insik Shin (CEO of Fluidez Co., Ltd.) of the School of Computing, was selected as the inaugural AI Champion (1st place) in the '2025 Artificial Intelligence Champion (AI Champion) Competition,' hosted by the Ministry of Science and ICT.
This competition is the nation's largest AI technology contest, comprehensively evaluating the innovativeness, social impact, and commercial potential of AI technology. With 630 teams participating nationwide, the AutoPhone Team claimed the top honor and will receive 3 billion Korean won in research and development funding.
The technology developed by the AutoPhone Team, 'FluidGPT,' is a fully autonomous AI agent that understands a user's voice command and enables the smartphone to independently run apps, click, input, and even complete payments.
For example, when a user says, "Book an SRT ticket from Seoul Station to Busan," or "Call a taxi," FluidGPT opens the actual app and sequentially performs the necessary steps to complete the request.
The core of this technology is its 'Non-Invasive (API-Free)' structure. Previously, calling a taxi using an app required directly connecting to the app's internal system (API communication) through the taxi app's API. In contrast, this technology does not modify the existing app's code or link an API. Instead, the AI directly recognizes and operates the screen (UI), acquiring the ability to use the smartphone just like a human.
As a result, FluidGPT presents a new paradigm—"AI that sees, judges, and moves a hand on behalf of a person"—and is evaluated as a core technology that will usher in the 'AI Phone Era.'
FluidGPT moves beyond simple voice assistance to implement the concept of 'Agentic AI' (Action-Oriented Artificial Intelligence), where the AI directly views the screen, makes decisions, and takes action. As a fully action-oriented system, the AI clicks app buttons, fills in input fields, and references data to autonomously achieve the user's objective, foreshadowing an innovation in how smartphones are used.
Professor In-sik Shin of the School of Computing shared his thoughts, stating, "AI is now evolving from conversation to action. FluidGPT is a technology that understands the user's words and autonomously executes actual apps, and it will be the starting point of the 'AI Phone Era.' The AutoPhone Team possesses world-class research capabilities, and we will contribute to the widespread adoption of AI services that everyone can easily use."
KAIST President Kwang Hyung Lee remarked, "This achievement is a representative example that demonstrates KAIST's vision for AI convergence," adding, "AI technology is entering the daily lives of citizens and leading a new wave of innovation." He further added, "KAIST will continue to lead research in future core technologies such as AI and semiconductors to bolster national competitiveness."
KAIST, Dancing Like 'Navillera'... AI Understands and Renders Garment Motions of Avatars
<(From Left)Ph.D candidate Jihyun Lee, Professor Tae-Kyun Kim, M.S candidate Changmin Lee>
The era has begun where AI moves beyond merely 'plausibly drawing' to understanding even why clothes flutter and wrinkles form. A KAIST research team has developed a new generative AI that learns movement and interaction in 3D space following physical laws. This technology, which overcomes the limitations of existing 2D-based video AI, is expected to enhance the realism of avatars in films, the metaverse, and games, and significantly reduce the need for motion capture or manual 3D graphics work.
KAIST (President Kwang Hyung Lee) announced on the 22nd that the research team of Professor Tae-Kyun (T-K) Kim from the School of Computing has developed 'MPMAvatar,' a spatial and physics-based generative AI model that overcomes the limitations of existing 2D pixel-based video generation technology.
To solve the problems of conventional 2D technology, the research team proposed a new method that reconstructs multi-view images into 3D space using Gaussian Splatting and combines it with the Material Point Method (MPM), a physics simulation technique.
In other words, the AI was trained to learn physical laws on its own by stereoscopically reconstructing videos taken from multiple viewpoints and allowing objects within that space to move and interact as if they were in real physical world.
This enables the AI to compute the movement based on objects' material, shape, and external forces, and then learn the physical laws by comparing the results with actual videos.
The research team represented the 3D space using point-units, and by applying both Gaussian and MPM to each point, they simultaneously achieved physically natural movement and realistic video rendering.
That is, they divided the 3D space into numerous small points, making each point move and deform like a real object, thereby realizing natural video that is nearly indistinguishable from reality.
In particular, to precisely express the interaction of thin and complex objects like clothing, they calculated both the object's surface (mesh) and its particle-unit structure (point), and utilized the Material Point Method (MPM), which calculates the object's movement and deformation in 3D space according to physical laws.
Furthermore, they developed a new collision handling technology to realistically reproduce scenes where clothes or objects move and collide with each other in multiple spots and complex manner.
The generative AI model MPMAvatar, to which this technology is applied, successfully reproduced the realistic movement and interaction of a person wearing loose clothing, and also succeeded in 'Zero-shot' generation, where the AI processes data it has never seen during the learning process by inferring on its own.
<Figure 1. Modeling new human poses and clothing dynamics from multi-view video input, and zero-shot generation of novel physical interactions.>
The proposed method is applicable to various physical properties, such as rigid bodies, deformable objects, and fluids, allowing it to be used not only for avatars but also for the generation of general complex scenes.
<“Figure 2. Depiction of graceful dance movements and soft clothing folds, like Navillera.>
Professor Tae-Kyun (T-K) Kim explained, "This technology goes beyond AI simply drawing a picture; it makes the AI understand 'why' the world in front of it looks the way it does. This research demonstrates the potential of 'Physical AI' that understands and predicts physical laws, marking an important turning point toward AGI (Artificial General Intelligence)." He added, "It is expected to be practically applied across the broaden immersive content industry, including virtual production, films, short-form contents, and adverts, creating significant change."
The research team is currently expanding this technology to develop a model that can generate physically consistent 3D videos simply from a user's text input.
This research involved Changmin Lee, a Master's student at the KAIST Graduate School of AI, as the first author, and Jihyun Lee, a Ph.D. student at the KAIST School of Computing, as a co-author. The research results will be presented at NeurIPS, the most prestigious international academic conference in the field of AI, on December 2nd, and the program code is to be fully released.
· Paper: C. Lee, J. Lee, T-K. Kim, MPMAvatar: Learning 3D Gaussian Avatars with Accurate and Robust Physics-Based Dynamics, Proc. of Thirty-Ninth Annual Conf. on Neural Information Processing Systems (NeurIPS), San Diego, US, 2025
· arXiv version: https://arxiv.org/abs/2510.01619
· Related Project Site: https://kaistchangmin.github.io/MPMAvatar/
· Related video links showing the 'Navillera'-like dancing drawn by AI:
o https://www.youtube.com/shorts/ZE2KoRvUF5c
o https://youtu.be/ytrKDNqACqM
This work was supported by the Institute of Information & Communications Technology Planning & Evaluation(IITP) grant funded by the Korea government(MSIT) through the Human-Oriented Next-Generation Challenging AGI Technology Project (RS-2025-25443318) and the Professional AI Talent Development Program for Multimodal AI Agents (RS-2025-25441313).
Refrigerator Use Increases with Stress, IoT Sensors Read Mental Health
<(From Left) Ph.D candidate Chanhee Lee, Professor Uichin Lee, Professor Hyunsoo Lee, Ph.D candidate Youngji Koh from School of Computing>
The number of single-person households in South Korea has exceeded 8 million, accounting for 36% of the total, marking an all-time high. A Seoul Metropolitan Government survey found that 62% of single-person households experience 'loneliness', deepening feelings of isolation and mental health issues. KAIST researchers have gone beyond the limitations of smartphones and wearables, utilizing in-home IoT data to reveal that a disruption in daily rhythm is a key indicator of worsening mental health. This research is expected to lay the foundation for developing personalized mental healthcare management systems.
KAIST (President Kwang Hyung Lee) announced on the 21st of October that a research team led by Professor Uichin Lee from the School of Computing has demonstrated the possibility of accurately tracking an individual's mental health status using in-home Internet of Things (IoT) sensor data.
Consistent self-monitoring is important for mental health management, but existing smartphone- or wearable-based tracking methods have the limitation of data loss when the user is not wearing or carrying the device inside the home.
The research team therefore focused on in-home environmental data. A 4-week pilot study was conducted on 20 young single-person households, installing appliances, sleep mats, motion sensors, and other devices to collect IoT data, which was then analyzed along with smartphone and wearable data.
The results confirmed that utilizing IoT data alongside existing methods allows for a significantly more accurate capture of changes in mental health. For instance, reduced sleep time was closely linked to increased levels of depression, anxiety, and stress, and increased indoor temperature also showed a correlation with anxiety and depression.
<Picture1. Heatmap of the Correlation Between Each User’s Mental Health Status and Sensor Data>
Participants' behavioral patterns varied, including a 'binge-eating type' with increased refrigerator use during stress and a 'lethargic type' with a sharp decrease in activity. However, a common trend clearly emerged: mental health deteriorated as daily routines became more irregular.
Variability in daily patterns was confirmed to be a more important factor than the frequency of specific behaviors, suggesting that a regular routine is essential for maintaining mental health.
When research participants viewed their life data through visualization software, they generally perceived the data as being genuinely helpful in understanding their mental health, rather than expressing concern about privacy invasion. This significantly enhanced the research acceptance and satisfaction with participation.
<Figure 2. Comparison of Average Mental Health Status Between the High Irregularity Group (Red) and the Low Irregularity Group (Blue)>
Professor Uichin Lee stated, "This research demonstrates that in-home IoT data can serve as an important clue for understanding mental health within the context of an individual's daily life," and added, "We plan to further develop this into a remote healthcare system that can predict individual lifestyle patterns and provide personalized coaching using AI."
Youngji Koh, a Ph.D candidate, participated as the first author in this research. The findings were published in the September issue of the Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, a prominent international journal in the field of human-computer interaction (HCI). ※ Harnessing Home IoT for Self-tracking Emotional Wellbeing: Behavioral Patterns, Self-Reflection, and Privacy Concerns DOI: https://dl.acm.org/doi/10.1145/3749485 ※ Youngji Koh (KAIST, 1st author), Chanhee Lee (KAIST, 2nd author), Eunki Joung (KAIST, 3rd author), Hyunsoo Lee (KAIST, corresponding author), Uichin Lee (KAIST, corresponding author)
This research was conducted with support from the LG Electronics-KAIST Digital Healthcare Research Center and the National Research Foundation of Korea, funded by the government (Ministry of Science and ICT).
KAIST Develops an AI Semiconductor Brain Combining Transformer's Intelligence and Mamba's Efficiency
<(From Left) Ph.D candidate Seongryong Oh, Ph.D candidate Yoonsung Kim, Ph.D candidate Wonung Kim, Ph.D candidate Yubin Lee, M.S candidate Jiyong Jung, Professor Jongse Park, Professor Divya Mahajan, Professor Chang Hyun Park>
As recent Artificial Intelligence (AI) models’ capacity to understand and process long, complex sentences grows, the necessity for new semiconductor technologies that can simultaneously boost computation speed and memory efficiency is increasing. Amidst this, a joint research team featuring KAIST researchers and international collaborators has successfully developed a core AI semiconductor 'brain' technology based on a hybrid Transformer and Mamba structure, which was implemented for the first time in the world in a form capable of direct computation inside the memory, resulting in a four-fold increase in the inference speed of Large Language Models (LLMs) and a 2.2-fold reduction in power consumption.
KAIST (President Kwang Hyung Lee) announced on the 17th of October that the research team led by Professor Jongse Park from KAIST School of Computing, in collaboration with Georgia Institute of Technology in the United States and Uppsala University in Sweden, developed 'PIMBA,' a core technology based on 'AI Memory Semiconductor (PIM, Processing-in-Memory),' which acts as the brain for next-generation AI models.
Currently, LLMs such as ChatGPT, GPT-4, Claude, Gemini, and Llama operate based on the 'Transformer' brain structure, which sees all of the words simultaneously. Consequently, as the AI model grows and the processed sentences become longer, the computational load and memory requirements surge, leading to speed reductions and high energy consumption as major issues.
To overcome these problems with Transformer, the recently proposed sequential memory-based 'Mamba' structure introduced a method for processing information over time, increasing efficiency. However, memory bottlenecks and power consumption limits still remained.
Professor Park Jongse's research team designed 'PIMBA,' a new semiconductor structure that directly performs computations inside the memory in order to maximize the performance of the 'Transformer–Mamba Hybrid Model,' which combines the advantages of both Transformer and Mamba.
While existing GPU-based systems move data out of the memory to perform computations, PIMBA performs calculations directly within the storage device without moving the data. This minimizes data movement time and significantly reduces power consumption.
<Analysis of Post-Transformer Models and Proposal of a Problem-Solving Acceleration System>
As a result, PIMBA showed up to a 4.1-fold improvement in processing performance and an average 2.2-fold decrease in energy consumption compared to existing GPU systems.
The research outcome is scheduled to be presented on October 20th at the '58th International Symposium on Microarchitecture (MICRO 2025),' a globally renowned computer architecture conference that will be held in Seoul. It was previously recognized for its excellence by winning the Gold Prize at the '31st Samsung Humantech Paper Award.' ※Paper Title: Pimba: A Processing-in-Memory Acceleration for Post-Transformer Large Language Model Serving, DOI: 10.1145/3725843.3756121
This research was supported by the Institute for Information & Communications Technology Planning & Evaluation (IITP), the AI Semiconductor Graduate School Support Project, and the ICT R&D Program of the Ministry of Science and ICT and the IITP, with assistance from the Electronics and Telecommunications Research Institute (ETRI). The EDA tools were supported by IDEC (the IC Design Education Center).
KAIST Develops AI Crowd Prediction Technology to Prevent Disasters like the Itaewon Tragedy
<(From Left) Ph.D candidate Youngeun Nam from KAIST, Professor Jae-Gil Lee from KAIST, Ji-Hye Na from KAIST, (Top right, from left) Professor Soo-Sik Yoon from Korea University, Professor HwanJun Song from KAIST>
To prevent crowd crush incidents like the Itaewon tragedy, it's crucial to go beyond simply counting people and to instead have a technology that can detect the real-
inflow and movement patterns of crowds. A KAIST research team has successfully developed new AI crowd prediction technology that can be used not only for managing large-scale events and mitigating urban traffic congestion but also for responding to infectious disease outbreaks.
On the 17th, KAIST (President Kwang Hyung Lee) announced that a research team led by Professor Jae-Gil Lee from the School of Computing has developed a new AI technology that can more accurately predict crowd density.
The dynamics of crowd gathering cannot be explained by a simple increase or decrease in the number of people. Even with the same number of people, the level of risk changes depending on where they are coming from and which direction they are heading.
Professor Lee's team expressed this movement using the concept of a "time-varying graph." This means that accurate prediction is only possible by simultaneously analyzing two types of information: "node information" (how many people are in a specific area) and "edge information" (the flow of people between areas).
In contrast, most previous studies focused on only one of these factors, either concentrating on "how many people are gathered right now" or "which paths are people moving along." However, the research team emphasized that combining both is necessary to truly capture a dangerous situation.
For example, a sudden increase in density in a specific alleyway, such as Alley A, is difficult to predict with just "current population" data. But by also considering the flow of people continuously moving from a nearby area, Area B, towards Area A (edge information), it's possible to pre-emptively identify the signal that "Area A will soon become dangerous."
To achieve this, the team developed a "bi-modal learning" method. This technology simultaneously considers population counts (node information) and population flow (edge information), while also learning spatial relationships (which areas are connected) and temporal changes (when and how movement occurs).
Specifically, the team introduced a 3D contrastive learning technique. This allows the AI to learn not only 2D spatial (geographical) information but also temporal information, creating a 3D relationship. As a result, the AI can understand not just whether the population is "large or small right now," but "what pattern the crowd is developing into over time." This allows for a much more accurate prediction of the time and place where congestion will occur than previous methods.
<Figure 1. Workflow of the bi-modal learning-based crowd congestion risk prediction developed by the research team.
The research team developed a crowd congestion risk prediction model based on bi-modal learning. The vertex-based time series represents indicator changes in a specific area (e.g., increases or decreases in crowd density), while the edge-based time series captures the flow of population movement between areas over time. Although these two types of data are collected from different sources, they are mapped onto the same network structure and provided together as input to the AI model. During training, the model simultaneously leverages both vertex and edge information based on a shared network, allowing it to capture complex movement patterns that might be overlooked when relying on only a single type of data. For example, a sudden increase in crowd density in a particular area may be difficult to predict using vertex information alone, but by additionally considering the steady inflow of people from adjacent areas (edge information), the prediction becomes more accurate. In this way, the model can precisely identify future changes based on past and present information, ultimately predicting high-risk crowd congestion areas in advance.>
The research team built and publicly released six real-world datasets for their study, which were compiled from sources such as Seoul, Busan, and Daegu subway data, New York City transit data, and COVID-19 confirmed case data from South Korea and New York.
The proposed technology achieved up to a 76.1% improvement in prediction accuracy over recent state-of-the-art methods, demonstrating strong perf
Professor Jae-Gil Lee stated, "It is important to develop technologies that can have a significant social impact," adding, "I hope this technology will greatly contribute to protecting public safety in daily life, such as in crowd management for large events, easing urban traffic congestion, and curbing the spread of infectious diseases."
Youngeun Nam, a Ph.D candidate in the KAIST School of Computing, was the first author of the study, and Jihye Na, another Ph.D candidate, was a co-author. The research findings were presented at the Knowledge Discovery and Data Mining (KDD) 2025 conference, a top international conference in the field of data mining, this past August.
※ Paper Title: Bi-Modal Learning for Networked Time Series ※ DOI: https://doi.org/10.1145/3711896.3736856
This technology is the result of research projects including the "Mid-Career Researcher Project" (RS-2023-NR077002, Core Technology Research for Crowd Management Systems Based on AI and Mobility Big Data) and the "Human-Centered AI Core Technology Development Project" (RS-2022-II220157, Robust, Fair, and Scalable Data-Centric Continuous Learning).
Making Truly Smart AI Agents a Reality with the World's Best DB Integration Technology
<(From Left) Engineer Jeongho Park from GraphAI, Ph.D candidate Geonho Lee, Prof. Min-Soo Kim from KAIST>
For a long time, companies have been using relational databases (DB) to manage data. However, with the increasing use of large AI models, integration with graph databases is now required. This process, however, reveals limitations such as cost burden, data inconsistency, and the difficulty of processing complex queries.
Our research team has succeeded in developing a next-generation graph-relational DB system that can solve these problems at once, and it is expected to be applied to industrial sites immediately. When this technology is applied, AI will be able to reason about complex relationships in real time, going beyond simple searches, making it possible to implement a smarter AI service.
The research team led by Professor Min-Soo Kim announced on the 8th of September that the team has developed a new DB system named 'Chimera' that fully integrates relational DB and graph DB to efficiently execute graph-relational queries. Chimera has proven its world-class performance by processing queries at least 4 times and up to 280 times faster than existing systems in international performance standard benchmarks.
Unlike existing relational DBs, graph DBs have a structure that represents data as vertices (nodes) and edges (connections), which gives them a strong advantage in analyzing and reasoning about complexly intertwined information like people, events, places, and time. Thanks to this feature, its use is rapidly spreading in various fields such as AI agents, SNS, finance, and e-commerce.
With the growing demand for complex query processing between relational DBs and graph DBs, a new standard language, 'SQL/PGQ,' which extends relational query language (SQL) with graph query functions, has also been proposed.
SQL/PGQ is a new standard language that adds graph traversal capabilities to the existing database language (SQL) and is designed to query both table-like data and connected information such as people, events, and places at once. Using this, complex relationships such as 'which company does my friend's friend work for?' can be searched much more simply than before.
<Diagram (a): This diagram shows the typical architecture of a graph query processing system based on a traditional RDBMS. It has separate dedicated operators for graph traversal and an in-memory graph structure, while attribute joins are handled by relational operators. However, this structure makes it difficult to optimize execution plans for hybrid queries because traversal and joins are performed in different pipelines. Additionally, for large-scale graphs, the in-memory structure creates memory constraints, and the method of extracting graph data from relational data limits data freshness. Diagram (b): This diagram shows Chimera's integrated architecture. Chimera introduces new components to the existing RDBMS architecture: a traversal-join operator that combines graph traversal and joins, a disk-based graph storage, and a dedicated graph access layer. This allows it to process both graph and relational data within a single execution flow. Furthermore, a hybrid query planner integrally optimizes both graph and relational operations. Its shared transaction management and disk-based storage structure enable it to handle large-scale graph databases without memory constraints while maintaining data freshness. This architecture removes the bottlenecks of existing systems by flexibly combining traversal, joins, and mappings in a single execution plan, thereby simultaneously improving performance and scalability.>
The problem is that existing approaches have relied on either trying to mimic graph traversal with join operations or processing by pre-building a graph view in memory. In the former case, performance drops sharply as the traversal depth increases, and in the latter case, execution fails due to insufficient memory even if the data size increases slightly. Furthermore, changes to the original data are not immediately reflected in the view, resulting in poor data freshness and the inefficiency of having to combine relational and graph results separately.
KAIST research team's 'Chimera' fundamentally solves these limitations. The research team redesigned both the storage layer and the query processing layer of the database.
First, the research team introduced a 'dual-store structure' that operates a graph-specific storage and a relational data storage together. They then applied a 'traversal-join operator' that processes graph traversal and relational operations simultaneously, allowing complex operations to be executed efficiently in a single system. Thanks to this, Chimera has established itself as the world's first graph-relational DB system that integrates the entire process from data storage to query processing into one.
As a result, it recorded world-class performance on the international performance standard benchmark 'LDBC Social Network Benchmark (SNB),' being at least 4 times and up to 280 times faster than existing systems.
Query failure due to insufficient memory does not occur no matter how large the graph data becomes, and since it does not use views, there is no delay problem in terms of data freshness.
Professor Min-Soo Kim stated, "As the connections between data become more complex, the need for integrated technology that encompasses both graph and relational DBs is increasing. Chimera is a technology that fundamentally solves this problem, and we expect it to be widely used in various industries such as AI agents, finance, and e-commerce."
The study was co-authored by Geonho Lee, a Ph.D. student in KAIST School of Computing, as the first author, and Jeongho Park, an engineer at Professor Kim's startup GraphAI Co., Ltd., as the second author, with Professor Kim as the corresponding author.
The research results were presented on September 1st at VLDB, a world-renowned international academic conference in the field of databases. In particular, the newly developed Chimera technology is expected to have an immediate industrial impact as a core technology for implementing 'high-performance AI agents based on RAG (a smart AI assistant with search capabilities),' which will be applied to 'AkasicDB,' a vector-graph-relational DB system scheduled to be released by GraphAI Co., Ltd.
*Paper title: Chimera: A System Design of Dual Storage and Traversal-Join Unified Query Processing for SQL/PGQ *DOI: https://dl.acm.org/doi/10.14778/3705829.3705845
This research was supported by the Ministry of Science and ICT's IITP SW Star Lab and the National Research Foundation of Korea's Mid-Career Researcher Program.
KAIST Develops AI that Automatically Detects Defects in Smart Factory Manufacturing Processes Even When Conditions Change
Recently, defect detection systems using artificial intelligence (AI) sensor data have been installed in smart factory manufacturing sites. However, when the manufacturing process changes due to machine replacement or variations in temperature, pressure, or speed, existing AI models fail to properly understand the new situation and their performance drops sharply. KAIST researchers have developed AI technology that can accurately detect defects even in such situations without retraining, achieving performance improvements up to 9.42%. This achievement is expected to contribute to reducing AI operating costs and expanding applicability in various fields such as smart factories, healthcare devices, and smart cities.
KAIST (President Kwang Hyung Lee) announced on the 26th of August that a research team led by Professor Jae-Gil Lee from the School of Computing has developed a new “time-series domain adaptation” technology that allows existing AI models to be utilized without additional defect labeling, even when manufacturing processes or equipment change.
Time-series domain adaptation technology enables AI models that handle time-varying data (e.g., temperature changes, machine vibrations, power usage, sensor signals) to maintain stable performance without additional training, even when the training environment (domain) and the actual application environment differ.
Professor Lee’s team paid attention to the fact that the core problem of AI models becoming confused by environmental (domain) changes lies not only in differences in data distribution but also in changes in defect occurrence patterns (label distribution) themselves. For example, in semiconductor wafer processes, the ratio of ring-shaped defects and scratch defects may change due to equipment modifications.
The research team developed a method for decomposing new process sensor data into three components—trends, non-trends, and frequencies—to analyze their characteristics individually. Just as humans detect anomalies by combining pitch, vibration patterns, and periodic changes in machine sounds, AI was enabled to analyze data from multiple perspectives.
In other words, the team developed TA4LS (Time-series domain Adaptation for mitigating Label Shifts) technology, which applies a method of automatically correcting predictions by comparing the results predicted by the existing model with the clustering information of the new process data. Through this, predictions biased toward the defect occurrence patterns of the existing process can be precisely adjusted to match the new process.
In particular, this technology is highly practical because it can be easily combined like an additional plug-in module inserted into existing AI systems without requiring separate complex development. That is, regardless of the AI technology currently being used, it can be applied immediately with only simple additional procedures.
In experiments using four benchmark datasets of time-series domain adaptation (i.e., four types of sensor data in which changes had occurred), the research team achieved up to 9.42% improvement in accuracy compared to existing methods.[TT1]
Especially when process changes caused large differences in label distribution (e.g., defect occurrence patterns), the AI demonstrated remarkable performance improvement by autonomously correcting and distinguishing such differences. These results proved that the technology can be used more effectively without defects in environments that produce small batches of various products, one of the main advantages of smart factories.
Professor Jae-Gil Lee, who supervised the research, said, “This technology solves the retraining problem, which has been the biggest obstacle to the introduction of artificial intelligence in manufacturing. Once commercialized, it will greatly contribute to the spread of smart factories by reducing maintenance costs and improving defect detection rates.”
This research was carried out with Jihye Na, a Ph.D. student at KAIST, as the first author, with Youngeun Nam, a Ph.D. student, and Junhyeok Kang, a researcher at LG AI Research, as co-authors. The research results were presented in August 2025 at KDD (the ACM SIGKDD Conference on Knowledge Discovery and Data Mining), the world’s top academic conference in artificial intelligence and data.
※Paper Title: “Mitigating Source Label Dependency in Time-Series Domain Adaptation under Label Shifts”
※DOI: https://doi.org/10.1145/3711896.3737050
This technology was developed as part of the research outcome of the SW Computing Industry Original Technology Development Program’s SW StarLab project (RS-2020-II200862, DB4DL: Development of Highly Available and High-Performance Distributed In-Memory DBMS for Deep Learning), supported by the Ministry of Science and ICT and the Institute for Information & Communications Technology Planning & Evaluation (IITP).