Improving Alphafold to predict very large proteins

04 November 2024

Anders Törneholm

The AI tool AlphaFold has been improved so that it can now predict the shape of very large and complex protein structures. 91视频 researchers have also succeeded in integrating experimental data into the tool. The results, published in Nature Communications, are a step toward more efficient development of new proteins for, among other things, medical drugs.

Two men wearing white lab coats in a lab. — Claudio Mirabello and Björn Wallner have developed AlphaFold further. It can now take in information from experiments and partial data as well as predict very large and complex protein structures. Photographer: Thor Balkhed

In all living organisms, there is a huge variety of proteins that regulate cell functions. Basically, everything that happens in the body, from controlling muscles and forming hair to transporting oxygen into the blood and digesting food, involves proteins. But proteins are also found outside the body in, for example, detergents and medical drugs.

Proteins are large molecules consisting of 20 different amino acids that stick together in long rows, much like beads in a necklace. The sequences, or chains, can be anything from 50 up to a few thousand amino acids long. This gives rise to several billion different combinations, which in turn determine the three-dimensional shape of the protein. Depending on the shape of the protein chain, that is, the way it is folded, the protein has completely different functions.

For over 50 years, researchers have been trying to both predict and design different protein structures to gain a deeper understanding of the body鈥檚 mechanisms, various diseases, and to develop new types of medical drugs. This has been a laborious and expensive task involving a lot of manual handling.

Breakthrough wit AI

But in 2020, the company Deepmind released open source software called AlphaFold. It is an artificial intelligence, based on so-called neural networks, that can predict with great accuracy how proteins will fold, and thus what functions they will have. This was a breakthrough that also resulted in the Nobel Prize in Chemistry 2024.

Two men in white lab coats with a computer in a lab. — Claudio Mirabello and Bj枚rn Wallner are researchers at the Department of Physics, Chemistry and Biology (IFM).Photographer: Thor Balkhed

However, the programme has had its limitations. Among other things, it has not been able to predict very large protein compounds nor draw conclusions from experimental or incomplete data.

Researchers at Link枚ping University have now developed AlphaFold further to overcome these shortcomings. The tool, which they call AF_unmasked, can now take in information from experiments and partial data as well as predict very large and complex protein structures.

鈥淲e鈥檙e giving a new type of input to AlphaFold. The idea is to get the whole picture, both from experiments and neural networks, making it possible to build larger structures. But you can also have a draft of a structure that you feed into AlphaFold and get a relatively accurate result,鈥� says Claudio Mirabello, docent at the Department of Physics, Chemistry and Biology at Link枚ping University.

Refine experiments

The idea behind AF_unmasked is for researchers to refine the experiments carried out by providing guidance on how the researchers could design the protein. This is a step toward even better understanding of the functions of proteins and designing new types of protein drugs.

The AlphaFold breakthrough was made possible by researchers around the world collecting data since the 1970s on the structure of approximately 200,000 different proteins in a database. This database provided training data for AlphaFold. What finally made it work on a large scale was the technological development of supercomputers that use GPUs for heavy calculations.

Bj枚rn Wallner talking. — Bj枚rn Wallner leads a research group working on structural bioinformatics, which is part of the large research field of data-driven life science.Photographer: Thor Balkhed

Bj枚rn Wallner is a professor of bioinformatics at Link枚ping University and has worked with one of the three Nobel Prize winners.

鈥淭he possibilities for protein design are endless, only the imagination sets limits. It鈥檚 possible to develop proteins for use both inside and outside the body. You always have to find new, more difficult problems when you have solved the old ones. And within our field, finding problems is no problem,鈥� says Bj枚rn Wallner.

An idea from LiU

Together with Claudio Mirabello, he developed a precursor to AlphaFold that also inspired Deepmind in developing the tool. Thanks to the resources of the Google-owned company, they were then able to develop what is now an indispensable tool for the world鈥檚 protein scientists.

鈥淎lphaFold wasn鈥檛 the first tool to use deep neural networks to solve the problem. In fact, one of the most important characteristics of AlphaFold is that it encodes the evolutionary history of a protein inside the neural network, an idea that actually originated here at LiU and was published by Bj枚rn and me in 2019. So, you could say that AlphaFold was based on our idea, and now we are building on AlphaFold,鈥� says Claudio Mirabello.

The study was funded mainly by SciLife Lab, the Knut and Alice Wallenberg Foundation, and the Swedish Foundation for Strategic Research. The calculations were performed on the supercomputers Tetralith and Berzelius at the National Supercomputer Centre at Link枚ping University.

Article: , Claudio Mirabello, Bj枚rn Wallner, Bj枚rn Nystedt, Stavros Azinas & Marta Carroni, Nature Communications 15, 8724 (2024), published online 9 October 2024. DOI: 10.1038/s41467-024-52951-w

Contact

Research environment

Bioinformatics (BIOIN)

The research at the Bioinformatics division is focused on development of methods to analyse and understand biological data.

Artificial intelligence

AI - Artificial intelligence is changing our lives

LiU has over 100 university courses related to AI and AI competence at every department. AI at LiU is about AI techniques as well as applications of these techniques, about views on AI, how it benefits society, ethical guidelines etc.

LiU and Region Östergötland invest in AI and precision health

91视频 and Region Östergötland announce initiatives in the areas of life science, medtech, innovation and information-driven precision health in a new collaboration agreement.

A man in a lab applies water to the surface of a yellow-green material.

More effective production of 鈥済reen鈥� hydrogen with new combined material

Hydrogen produced from water is a promising renewable energy source 鈥� especially if the hydrogen is produced using sunlight. Now LiU researchers show that a combination of new materials improves the efficiency of the chemical reaction several times.

Fatty liver 鈥� but not liver damage 鈥� common in type 2 diabetes

Six out of ten people with type 2 diabetes had fatty liver in a new study. Of these, only a small percentage had developed more severe liver disease. Type 2 diabetes in combination with obesity is linked to a greater risk.