
<(From Left) Distinguisehd Professor Sang Yup Lee, Dr. Gi Bae Kim, Professor Bernhard O. Palsson>
“We know the genes, but not their functions.” To resolve this long-standing bottleneck in microbial research, a joint research team has proposed a cutting-edge research strategy that leverages Artificial Intelligence (AI) to drastically accelerate the discovery of microbial gene functions.
KAIST announced on January 12th that a research team led by Distinguished Professor Sang Yup Lee from the Department of Chemical and Biomolecular Engineering, in collaboration with Professor Bernhard Palsson from the Department of Bioengineering at UCSD, has published a comprehensive review paper. The study systematically analyzes and organizes the latest AI-based research approaches aimed at revolutionizing the speed of gene function discovery.
Since the early 2000s, when whole-genome sequencing became a reality, there were high expectations that the genetic blueprint of life would be fully decoded. However, even twenty years later, the roles of a significant portion of genes within microbial genomes remain unknown.
While various experimental methods—such as gene deletion, analysis of gene expression profiles, and in vitro activity assays—have been employed, discovering gene functions remains a time-consuming and costly endeavor. This is primarily due to the limitations of large-scale experimentation, complex biological interactions, and the discrepancy between laboratory results and actual in vivo responses.
To overcome these hurdles, the research team emphasized that an AI-driven approach combining computational biology with experimental biology is essential.
In this paper, the team provides a comprehensive overview of computational biology approaches that have facilitated gene function discovery, ranging from traditional sequence similarity analysis to the latest deep-learning-based AI models.
Notably, 3D protein structure prediction technologies such as AlphaFold (developed by Google DeepMind) and RoseTTAFold (developed by the University of Washington) have opened new doors. These tools go beyond simple functional estimation, offering the potential to understand the underlying mechanisms of how gene functions operate. Furthermore, generative AI is now extending research boundaries toward designing proteins with specifically desired functions.
Focusing on transcription factors (proteins that act as genetic switches) and enzymes (proteins that catalyze chemical reactions), the team presented various application cases and future research directions that integrate gene sequence analysis, protein structure prediction, and diverse metagenomic analyses.

<Schematic illustration of computational biology methods for enzyme function prediction>
To overcome the biases and limitations inherent in traditional gene discovery, the researchers highlighted the need for an “Active Learning” framework where AI guides the experimental process.
Active Learning is a method where an AI model identifies predictions with high uncertainty and suggests specific experiments to resolve them. The results are then fed back into the model to improve its accuracy. This iterative loop allows researchers to efficiently validate the most critical gene functions first.
The team stressed that this approach requires tight integration with automated experimental platforms and shared research infrastructures, such as biofoundries. They also noted that “failed data”—experiments that did not yield the expected results—must be shared as vital learning assets for future research.
“While deep learning-based prediction performance has improved significantly, developing ‘Explainable AI’ models that can provide biological justifications for their results remains a critical challenge,” said Dr. Gi Bae Kim of KAIST, a co-author of the study.
Distinguished Professor Sang Yup Lee emphasized, “The key to surpassing the limits of gene function discovery lies in combining a systematic, AI-guided experimental framework with an automated research infrastructure under the direction of human researchers. Establishing a research ecosystem where prediction and validation are repeatedly linked is essential.”
The paper was published on January 7th in Nature Microbiology, a prestigious journal in the field of biotechnology published by Nature.
Publication Information
< Professor Kyung-Jin Lee at the ceremony > KAIST announced on February 12th that it has selected Professor Kyung-Jin Lee from the Department of Physics as the recipient of the ‘KAISTian of the Year’ award in celebration of the university's 55th anniversary. Established in 2001, the ‘KAISTian of the Year’ award is the university’s highest honor, presented to members who have significantly enhanced KAIST's global prestige through exceptional academic and r
2026-02-12<(From Left) Dr. Sukkyung Kang, Professor Sanha Kim from Department of Mechanical Engineering> The performance and stability of smartphones and artificial intelligence (AI) services depend on how uniformly and precisely semiconductor surfaces are processed. KAIST researchers have expanded the concept of everyday “sandpaper” into the realm of nanotechnology, developing a new technique capable of processing semiconductor surfaces uniformly down to the atomic level. This techn
2026-02-11<(From top left) Professor Woo Youn Kim (KAIST), Dr. Jeheon Woo (KISTI), Dr. Seonghwan Kim (KAIST), and Jun Hyeong Kim (PhD candidate)> Whether a smartphone battery lasts longer or a new drug can be developed to treat incurable diseases depends on how stably the atoms constituting the material are bonded. The core of 'molecular design' lies in finding how to arrange these countless atoms to form the most stable molecule. Until now, this process has been as difficult as finding the lowes
2026-02-10< KAIST Professor Kyung Ryul Park delivering a keynote speech > KAIST announced on February 9th that the KAIST-NYU AI and Digital Governance Summit, co-hosted with New York University (NYU), was held at NYU in New York from February 6 to 7 (local time). Amid the rapidly expanding impact of Artificial Intelligence (AI) across society, this summit was designed to combine private consensus meetings with public discussions to seek practical AI governance solutions that harmonize technolog
2026-02-09< (From left) Ph.D candidate Changhwan Kim, Ph.D candidate Seunghwan Kim , Ph.D candidate Namwook Hur, Professor Joonki Suh, Ph. D candidate Youngseok Cho> As artificial intelligence advances, computers demand faster and more efficient memory. The key to ultra-high-speed, low-power semiconductors lies in the "switching" principle—the mechanism by which memory materials turn electricity on and off. A South Korean research team has successfully captured the elusive moment of switchi
2026-02-09