ENGINEERING
UT Southwestern prof uses AI to predict protein interactions for a wealth of drug targets
- Written by: Tyler O'Neal, Staff Editor
- Category: ENGINEERING
UT Southwestern and University of Washington researchers led an international team that used artificial intelligence (AI) and evolutionary analysis to produce 3D models of eukaryotic protein interactions. The study, published in Science, identified more than 100 probable protein complexes for the first time and provided structural models for more than 700 previously uncharacterized ones. Insights into the ways pairs or groups of proteins fit together to carry out cellular processes could lead to a wealth of new drug targets.
“Our results represent a significant advance in the new era in structural biology in which computation plays a fundamental role,” said Qian Cong, Ph.D., Assistant Professor in the Eugene McDermott Center for Human Growth and Development with a secondary appointment in Biophysics.
Dr. Cong led the study with David Baker, Ph.D., Professor of Biochemistry and Dr. Cong’s postdoctoral mentor at the University of Washington before her recruitment to UT Southwestern. The study has four co-lead authors, including UT Southwestern Computational Biologist Jimin Pei, Ph.D.
Proteins often operate in pairs or groups known as complexes to accomplish every task needed to keep an organism alive, Dr. Cong explained. While some of these interactions are well studied, many remain a mystery. Constructing comprehensive interactomes – or descriptions of the complete set of molecular interactions in a cell – would shed light on many fundamental aspects of biology and give researchers a new starting point on developing drugs that encourage or discourage these interactions. Dr. Cong works in the emerging field of interactomics, which combines bioinformatics and biology.
Until recently, a major barrier for constructing an interactome was uncertainty over the structures of many proteins, a problem scientists have been trying to solve for half a century. In 2020 and 2021, a company called DeepMind and Dr. Baker’s lab independently released two AI technologies called AlphaFold (AF) and RoseTTAFold (RF) that use different strategies to predict protein structures based on the sequences of the genes that produce them.
In the current study, Dr. Cong, Dr. Baker, and their colleagues expanded on those AI structure-prediction tools by modeling many yeast protein complexes. Yeast is a common model organism for fundamental biological studies. To find proteins that were likely to interact, the scientists first searched the genomes of related fungi for genes that acquired mutations in a linked fashion. They then used the two AI technologies to determine whether these proteins could be fit together in 3D structures.
Their work identified 1,505 probable protein complexes. Of these, 699 had already been structurally characterized, verifying the utility of their method. However, there was only limited experimental data supporting 700 of the predicted interactions, and another 106 had never been described.
To better understand these poorly characterized or unknown complexes, the University of Washington and UT Southwestern teams worked with colleagues around the world who were already studying these or similar proteins. By combining the 3D models the scientists in the current study had generated with information from collaborators, the teams were able to gain new insights into protein complexes involved in the maintenance and processing of genetic information, cellular construction and transport systems, metabolism, DNA repair, and other areas. They also identified roles for proteins whose functions were previously unknown based on their newly identified interactions with other well-characterized proteins.
“The work described in our new paper sets the stage for similar studies of the human interactome and could eventually help in developing new treatments for human disease,” Dr. Cong added.
Dr. Cong noted that the predicted protein complex structures generated in this study are available to download from ModelArchive. These structures and others generated using this technology in future studies will be a rich source of research questions for years to come, she said.