Why AlphaFold 3 needs to be open source
Imagine a world where in a matter of minutes, scientists could identify drugs to treat incurable diseases, design chemicals that could break down plastics to clean up pollution, and develop new materials that can suck out of the air to help address climate change. This is the promise of new biology- and chemistry-based models that use artificial intelligence, or AI, to perform traditionally time-consuming tasks such as determining the structures of proteins.
Google DeepMind, a private research subsidiary of Google, released the highly anticipated last month as a paper in Nature. This model claims to be an improvement over its earlier version, AlphaFold 2, because it can predict not just protein structures, but also how they interact with RNA, DNA, and — most importantly — drugs. said that it hopes AlphaFold 3 will “transform our understanding of the biological world and drug discovery.”
However, it’s unlikely to change how computer scientists such as myself understand biology anytime soon, because Nature, the highly competitive journal that states its is to “serve scientists,” allowed DeepMind to keep the software’s code unavailable, despite its own requiring authors “to make materials, data, code, and associated protocols promptly available to readers without undue qualifications.”
In an with Nature reporter Ewen Callaway, DeepMind cited its own commercial interests as a reason to restrict access, in particular through its spinoff company Isomorphic Labs. “We have to strike a balance between making sure that this is accessible and has the impact in the scientific community as well as not compromising Isomorphic’s ability to pursue commercial drug discovery,” said Pushmeet Kohli, DeepMind’s head of AI science and vice president of research.
Since DeepMind did produce the software, it’s understandable that the company should be the one to determine how AlphaFold 3 gets released. DeepMind will just have to pay the consequences that its software may not be as popular among researchers.
Google CEO Sundar Pichai wrote that more than have used previous versions of AlphaFold, most notably , the earth-shatteringly powerful technology released by DeepMind in 2021. A large part of its popularity came because it was verified by hundreds of academic groups, for example during the in 2020, a global challenge held every two years where teams make predictions on the structures of proteins that have never been seen before.
AlphaFold 3 has no third party verification of the results it describes in the paper, leaving researchers no recourse but to believe that the model’s results are correct, presumably because they came from the creators of the highly successful AlphaFold 2.
“The amount of disclosure in the AlphaFold3 publication is appropriate for an announcement on a company website,” stated 10 scientists in a submitted to the editors of Nature, “but it fails to meet the scientific community’s standards of being usable, scalable, and transparent.” As of May 28, the letter has accumulated more than 1,000 signatures.
In response to the letter, Kohli quickly came out on stating that the model will be downloadable for academic use in the next six months. I applaud Kohli and DeepMind on this statement; however, concerns remain. A post on X is not a binding agreement between DeepMind and Nature; it contains vague release details with a deadline far in the future.
In an published on May 22, Nature claimed that by allowing peer-reviewed publications from the private sector, it “promotes the sharing of knowledge, verification of the research and the reproducibility researchers strive for” and that its policy states that the editors reserve the right to decide if all code needs to be released. However, it’s unclear to me how one can verify research without having the tools available to do so.
Popular journals such as Nature need to employ equal standards for all groups, not make exceptions for large for-profit industries. Instead, AlphaFold 3 should have been posted as a paper on — a widely accepted database of preprints, or non-peer reviewed articles — until all materials needed to reproduce the results were released. It could even have been just a blog post, similar to how the text-to-video model, , by OpenAI, was released.
Due to widespread criticism in many academic circles, Nature Editor-in-Chief Magdalena Skipper appeared to suggest to and to that biosecurity and ethical concerns were the reason to publish AlphaFold 3 without open-access code. This concern is understandable given that in March, leaders in the biotechnology community released a expressing the need to self-regulate AI.
However, DeepMind never explicitly stated that was a reason for limiting access. I was only able to find a semi-relevant statement in the press release, which says that DeepMind worked with 50 domain experts “to understand the capabilities of successive AlphaFold models and any potential risks.”
Even if DeepMind were concerned with biosecurity, the restricted release doesn’t follow the precedent set by DeepMind itself for publishing models that could be used for unethical purposes. For example, in September, DeepMind released a model to help understand rare genetic diseases, , in the journal , along with the code to reproduce the model.
The paper notes that the source code can be downloaded, but parts of the model were not shared to “prevent use in potentially unsafe applications.” According to , the decision was assessed by DeepMind’s responsible AI team and an anonymous “outside biosafety expert,” in order to reduce misuse of the model by bad actors. This is like giving someone the recipe to bake a cake, instead of handing them one fresh out of the oven.
Under this type of release, researchers who want to replicate the results must start over, implementing the model from scratch, which is a long and expensive process but doable with enough effort. That way, everyone wins: The model’s abilities can be assessed fairly — including identifying any unknown security concerns — but it can’t be quickly reproduced by bad actors.
If DeepMind were truly concerned about the biosecurity implications of AlphaFold 3, it should have stated that concern directly, and Nature should have demanded a code release similar to that of AlphaMissense.
Perhaps by upholding open-access standards, we will be able to achieve a perfect future, one in which all diseases can be cured, plastic pollution is cleaned up, and climate change is mitigated. However, we won’t have a chance to get there if the rules for academic publication are not applied equally.
This article was originally published on . Read the .
Enjoy reading 91亚色传媒 Today?
Become a member to receive the print edition four times a year and the digital edition weekly.
Learn moreGet the latest from 91亚色传媒 Today
Enter your email address, and we鈥檒l send you a weekly email with recent articles, interviews and more.
Latest in Opinions
Opinions highlights or most popular articles
Our top 10 articles of 2024
91亚色传媒 Today posted more than 400 original articles this year. The ones that were most read covered research, society news, policy, mental health, careers and more.
From curiosity to conversation: My first science café
鈥淲hy was I so nervous? I鈥檇 spoken in hundreds of seminars and classes, in front of large audiences.鈥 But this was the first time Ed Eisenstein was explaining his research 鈥渢o a crowd of nonscientists relaxing over food and drink at a local tavern.鈥
鈥極ne word or less鈥
For a long time, Howard Steinman thought this phrase was a joke: 鈥淟ess than one word is no words, and you can't answer a question without words.鈥
Can we make grad school more welcoming for all?
The students and faculty at most of the institutions training the next generation of STEM professionals do not reflect the country鈥檚 diversifying demographics, leaving a gap in experience and cultural understanding.
I am not a fake. I am authentically me
Camellia Moses Okpodu explains why she believes the term 鈥渋mposter syndrome鈥 is inaccurate and should be replaced.
Where do we search for the fundamental stuff of life?
Recent books by Thomas Cech and Sara Imari Walker offer two perspectives on where to look for the basic properties that define living things.