Drug Discovery and Development

  • Home Drug Discovery and Development
  • Drug Discovery
  • Women in Pharma and Biotech
  • Oncology
  • Neurological Disease
  • Infectious Disease
  • Resources
    • Video features
    • Podcast
    • Voices
    • Views
    • Webinars
  • Pharma 50
    • 2025 Pharma 50
    • 2024 Pharma 50
    • 2023 Pharma 50
    • 2022 Pharma 50
    • 2021 Pharma 50
  • Advertise
  • SUBSCRIBE

Helix and Recursion join forces in quest to crack the AI-driven drug discovery code

By Brian Buntz | June 27, 2024

A display of DNA sequencing output from an automated ABI 373 sequencer. [National Human Genome Research Institute (NHGRI)]

A display of DNA sequencing output from an automated ABI 373 sequencer. [National Human Genome Research Institute (NHGRI)]

The UK Biobank, with genetic data on half a million individuals, set a benchmark for scale in medical research. It contains whole genome sequencing data for all 500,000 participants, making it the world’s largest genetic project to date.

A recent collaboration between the San Diego–headquartered clinico-genomics company Helix and the Salt Lake City–based biotech Recursion Pharmaceuticals aims to go even further. “We could surpass [UK Biobank] probably by the end of 2025,” says Hylton Kalvaria of Helix.

The partnership will combine Helix’s growing clinico-genomic dataset with more than 25 petabytes of Recursion’s proprietary biological and chemical data.

The world is awash in data. To cite one example, the latest CommonCrawl dataset, encompassing 17 years of website data, totals 377 tebibytes. That’s equivalent to the storage capacity of more than 3 million 128-gigabyte USB flash drives.

“There are larger data sources out there,” Kalvaria acknowledged. “But [the combined Helix-Recursion dataset] is still a heck of a lot of data and, importantly, it adds to itself over time.”

The “sequence once, query often” model: Unlocking a lifetime of genetic insights

Helix researchers in the lab

Helix researchers in the lab [Helix]

Helix isn’t focused on building a massive genomic database alone. The company follows a “sequence once, query often” approach to genomic data management. “Your genetics are the one thing about your health that never changes over time,” Kalvaria said. Having that information available upfront can offer a range of advantages whether in routine care or disease diagnostics. “There are many reasons why you might come back to that [genomic] information over time,” Kalvaria added.

For patients, a single DNA sequencing can provide insights for years to come, eliminating the need for further repeated genetic tests. For physicians, access to a patient’s genomic information can support quicker, more informed decisions. Kalvaria notes, “If a patient has already been sequenced, results can be obtained in minutes rather than weeks.” In the longer run, doctors could potentially confirm or rule out genetic conditions during an office visit using such data in the context of patient care. As scientists learn more about how genes relate to diseases, such readily available genetic data could help identify patients who might benefit from specific treatments or preventive measures.

The advantages also extend to healthcare systems: For instance, reducing duplicate testing can lead to cost savings and more efficient use of resources. The potential impact of this method goes beyond just saving time and money.

BioHive-2: The supercomputer informing drug hunting

While the potential of mining genomic data to improve patient care and more targeted drug discovery are not new, decoding such data is not necessarily straightforward. To help unearth the hidden connections between genes, diseases, and potential treatments, Helix is turning to Recursion Pharmaceuticals and its secret weapon: BioHive-2.

This isn’t your run-of-the-mill supercomputer. Powered by 504 of NVIDIA’s latest H100 Tensor Core GPUs, BioHive-2 is purpose-built for AI-driven drug discovery. Recursion estimates that the computer, four times faster than its predecessor, is currently the fastest, wholly-owned supercomputer in the pharmaceutical industry globally.

Hylton Kalvaria

Hylton Kalvaria

Over the past decade, Recursion has also amassed one of the world’s largest collections of biological and chemical data. Now that its supercomputer, BioHive-2, is operational, it plans to use the horsepower to create more advanced AI models that can help streamline the drug discovery process.

“We’re really excited to work with [Recursion] and make those new discoveries that will bring medicines to patients faster,” Kalvaria said.

Potential drug development lifecycle advantages extend from lab to market

While it is difficult to quantify how the partnership will impact drug development workflows, the efficiency gains could be significant. “Ultimately, that’s how we see our role in this drug development lifecycle,” Kalvaria noted. “Help people to screen through more things, to interrogate more drug targets upfront.”

The technological partnership could also inform the commercialization of drugs once they hit the market. “We have this footprint with health systems where we can actually serve patients who could be good candidates for a new drug that comes onto the market,” Kalvaria said.


Filed Under: Drug Delivery, Genomics/Proteomics, machine learning and AI, Omics/sequencing
Tagged With: AI models, BioHive-2 supercomputer, clinico-genomic data, drug target screening, Helix, precision medicine, Recursion Pharmaceuticals
 

About The Author

Brian Buntz

As the pharma and biotech editor at WTWH Media, Brian has almost two decades of experience in B2B media, with a focus on healthcare and technology. While he has long maintained a keen interest in AI, more recently Brian has made making data analysis a central focus, and is exploring tools ranging from NLP and clustering to predictive analytics.

Throughout his 18-year tenure, Brian has covered an array of life science topics, including clinical trials, medical devices, and drug discovery and development. Prior to WTWH, he held the title of content director at Informa, where he focused on topics such as connected devices, cybersecurity, AI and Industry 4.0. A dedicated decade at UBM saw Brian providing in-depth coverage of the medical device sector. Engage with Brian on LinkedIn or drop him an email at bbuntz@wtwhmedia.com.

Related Articles Read More >

Sai Life Sciences exec: GLP-1 boom has ‘exploded the peptide field’ as firm opens new center
Ypsomed Ypsodose patch injector
ten23 health to develop large-volume injectables for Ypsomed’s YpsoDose
Shasqi
Click chemistry breakthroughs drive Shasqi and J&J cancer alliance
Shasqi
Nobel-connected startup Shasqi deepens J&J partnership on CAPAC platform
“ddd
EXPAND YOUR KNOWLEDGE AND STAY CONNECTED
Get the latest news and trends happening now in the drug discovery and development industry.

MEDTECH 100 INDEX

Medtech 100 logo
Market Summary > Current Price
The MedTech 100 is a financial index calculated using the BIG100 companies covered in Medical Design and Outsourcing.
Drug Discovery and Development
  • MassDevice
  • DeviceTalks
  • Medtech100 Index
  • Medical Design Sourcing
  • Medical Design & Outsourcing
  • Medical Tubing + Extrusion
  • Subscribe to our E-Newsletter
  • Contact Us
  • About Us
  • R&D World
  • Drug Delivery Business News
  • Pharmaceutical Processing World

Copyright © 2025 WTWH Media LLC. All Rights Reserved. The material on this site may not be reproduced, distributed, transmitted, cached or otherwise used, except with the prior written permission of WTWH Media
Privacy Policy | Advertising | About Us

Search Drug Discovery & Development

  • Home Drug Discovery and Development
  • Drug Discovery
  • Women in Pharma and Biotech
  • Oncology
  • Neurological Disease
  • Infectious Disease
  • Resources
    • Video features
    • Podcast
    • Voices
    • Views
    • Webinars
  • Pharma 50
    • 2025 Pharma 50
    • 2024 Pharma 50
    • 2023 Pharma 50
    • 2022 Pharma 50
    • 2021 Pharma 50
  • Advertise
  • SUBSCRIBE