Drug Discovery and Development

  • Home Drug Discovery and Development
  • Drug Discovery
  • Women in Pharma and Biotech
  • Oncology
  • Neurological Disease
  • Infectious Disease
  • Resources
    • Video features
    • Podcast
    • Voices
    • Webinars
  • Pharma 50
    • 2025 Pharma 50
    • 2024 Pharma 50
    • 2023 Pharma 50
    • 2022 Pharma 50
    • 2021 Pharma 50
  • Advertise
  • SUBSCRIBE

Cutting through the noise of machine learning for drug discovery

By Brian Buntz | February 5, 2022

Variational AIWhile the topic of AI in drug discovery has received considerable attention in recent years, mature deployments of techniques such as machine learning in the industry remain rare. 

“The chemistry domain is qualitatively different from any other problem that machine learning has exhibited real success in,” said Jason Rolfe, CTO of Variational AI (Vancouver, Canada). 

A dataset involving FDA-approved drugs that have been tested in humans would be orders of magnitude smaller than the sort of datasets that underlie Generative Pre-trained Transformer 3 (GPT-3), a language model from OpenAI, an AI research company co-founded by Elon Musk.   

Jason Rolfe

Jason Rolfe

High-throughput screening can generate substantially larger datasets. The PubChem database is billed by NIH as the “largest collection of freely accessible chemical information,” but the data can be noisy. Many of the apparent active compounds likely won’t be validated in a secondary screen due to factors such as aggregation, contamination of samples and assay interference. 

“These datasets are intrinsically more difficult to work with than something like ImageNet, which has been the workhorse for much of the architectural development in machine learning,” Rolfe said. 

With ImageNet, a visual database containing more than 14 million annotated images, the data are relatively clean. “Some images are misclassified, but it’ll be something like a Shih Tzu classified as a Pomeranian,” Rolfe said. 

By contrast, noise is “rampant in pharmacological data,” Rolfe said. 

Drug discovery is “a very challenging domain to work in, but it has outsized promise,” Rolfe said. 

With the cost of developing a new drug frequently hitting billions of dollars and the failure rate high, “anything that can reduce that by even a fraction would be of extreme value to society,” Rolfe noted.

Handol Kim

Handol Kim

Variational AI focuses on machine learning for drug discovery to generate small molecules that become assets licensed to biopharma companies. 

The biopharma industry is in the “first or second inning” in terms of adopting techniques such as machine learning for drug discovery, said Handol Kim, CEO of Variational AI. 

Skepticism about AI’s promise in drug discovery has begun to fade during the pandemic, Kim said. “Pharma companies are realizing this could be a new potential modality not unlike biotech in the 1970s or 1980s,” he added.

In addition, “a lot of pharma companies are now investing in hiring people to specifically work on AI for drug discovery companies,” Kim said.


Filed Under: Data science, Drug Discovery, machine learning and AI
Tagged With: drug discovery, machine learning, Variational AI
 

About The Author

Brian Buntz

As the pharma and biotech editor at WTWH Media, Brian has almost two decades of experience in B2B media, with a focus on healthcare and technology. While he has long maintained a keen interest in AI, more recently Brian has made making data analysis a central focus, and is exploring tools ranging from NLP and clustering to predictive analytics.

Throughout his 18-year tenure, Brian has covered an array of life science topics, including clinical trials, medical devices, and drug discovery and development. Prior to WTWH, he held the title of content director at Informa, where he focused on topics such as connected devices, cybersecurity, AI and Industry 4.0. A dedicated decade at UBM saw Brian providing in-depth coverage of the medical device sector. Engage with Brian on LinkedIn or drop him an email at bbuntz@wtwhmedia.com.

Related Articles Read More >

Abstract neural network
Inside IQVIA’s quest to build a multi-agent AI ‘dream team’ to transform clinical trials
Xaira and Verily co-founder ponders low-hanging fruit and blue-sky potential in FDA’s genAI rollout
Capgemini’s life-sciences lead says ROI and data security, not algorithms, will decide pharma’s AI future
Portrait of happy smiling mature middle aged professional business woman investor manager executive or lawyer attorney looking at camera at workplace working on laptop computer in office.
As FDA pushes agency-wide generative AI, pharma experience show similar tools can cut clinical study-report drafting time by 30% or more
“ddd
EXPAND YOUR KNOWLEDGE AND STAY CONNECTED
Get the latest news and trends happening now in the drug discovery and development industry.

MEDTECH 100 INDEX

Medtech 100 logo
Market Summary > Current Price
The MedTech 100 is a financial index calculated using the BIG100 companies covered in Medical Design and Outsourcing.
Drug Discovery and Development
  • MassDevice
  • DeviceTalks
  • Medtech100 Index
  • Medical Design Sourcing
  • Medical Design & Outsourcing
  • Medical Tubing + Extrusion
  • Subscribe to our E-Newsletter
  • Contact Us
  • About Us
  • R&D World
  • Drug Delivery Business News
  • Pharmaceutical Processing World

Copyright © 2025 WTWH Media LLC. All Rights Reserved. The material on this site may not be reproduced, distributed, transmitted, cached or otherwise used, except with the prior written permission of WTWH Media
Privacy Policy | Advertising | About Us

Search Drug Discovery & Development

  • Home Drug Discovery and Development
  • Drug Discovery
  • Women in Pharma and Biotech
  • Oncology
  • Neurological Disease
  • Infectious Disease
  • Resources
    • Video features
    • Podcast
    • Voices
    • Webinars
  • Pharma 50
    • 2025 Pharma 50
    • 2024 Pharma 50
    • 2023 Pharma 50
    • 2022 Pharma 50
    • 2021 Pharma 50
  • Advertise
  • SUBSCRIBE