Drug Discovery and Development

  • Home Drug Discovery and Development
  • Drug Discovery
  • Women in Pharma and Biotech
  • Oncology
  • Neurological Disease
  • Infectious Disease
  • Resources
    • Video features
    • Podcast
    • Voices
    • Views
    • Webinars
  • Pharma 50
    • 2025 Pharma 50
    • 2024 Pharma 50
    • 2023 Pharma 50
    • 2022 Pharma 50
    • 2021 Pharma 50
  • Advertise
  • SUBSCRIBE

New AI tool InClinico predicts clinical trial outcomes with 79% accuracy: A closer look with InSilico CEO, Alex Zhavoronkov

By Brian Buntz | August 3, 2023

Artistic ollustration of InClinico

Artistic ollustration of InClinico, an AI tool for predicting clinical trial outcomes

Roughly nine out of ten clinical trials fail as a result of factors such as lack of efficacy and unmanageable toxicity, as the journal Pharma Excipients has noted.

After seven years of development, Insilico Medicine has scored a breakthrough with its generative AI tool, inClinico. In particular, the tool demonstrated 79% accuracy in predicting the outcomes of real-world phase 2 and 3 trials.

While predicting clinical trial performance has been a sort of Holy Grail in drug development, traditionally, it has been elusive. Several factors have constrained adoption: limited peer-reviewed evidence backing the use of ML models, a “black box” dilemma making clinicians wary of ML predictions, logistical hurdles in integrating ML into clinical workflows and EHR systems and a need for ample high-quality training data. Also, regulatory concerns over ML models for trials and philosophical qualms about favoring ML over human judgment act as barriers.

InSilico is thus believes the accuracy of InClinico in predicting clinical trial outcomes could be a game-changer in drug development.

The company also recently announced that one of its AI-discovered drugs has moved to phase 2.

How InClinico can predict clinical trial outcomes

Incorporating transformer-based AI models, the AI predictive tool inClinico is based on a collection of models using generative AI and multimodal data sources including text, omics, trial design and drug properties. The developers of tool trained it on more than 55,600 phase 2 clinical trials.

The peer-reviewed journal Clinical Pharmacology and Therapeutics journal published the research. InSilico also notes that pharmaceutical companies and financial institutions have validated InClinical through retrospective, quasi-prospective and prospective validation studies.

The platform achieved a score of 0.88 on the Receiver Operating Characteristic Area Under the Curve (ROC AUC) in predicting the transition from phase 2 to 3. ROC AUC stands is a statistical measure used in machine learning and data analytics to gauge the performance of a binary classifier system. AUC values lie between 0 and 1. Higher values indicate that the model is better at predicting 0s as 0s and 1s as 1s. An AUC value of 0.5 indicates that the model fares no better than random chance.

Insilico touted the tool’s usefulness to investors, stating that it supported a 35% 9-month return on investment in a virtual trading portfolio.

InSilico CEO dishes on the power of the AI predictive tool in clinical trials

To learn more about InClinico, we reached out to Alex Zhavoronkov, founder and CEO and co-author on the paper.

How does Insilico’s generative AI, inClinico, use multimodal data to predict the outcome of clinical trials?

AI-powered autonomous robotics lab called Life Star in Suzhou that performs target discovery, compound screening, precision medicine development and translational research.

Alex Zhavoronkov in an AI-powered autonomous robotics lab called Life Star in Suzhou that performs target discovery, compound screening, precision medicine development and translational research. [Insilico Medicine]

Zhavoronkov: InClinico relies on various data — including omics data, clinical trials, patents, publications, grants and 13,000 drugs from phase 1 to launch. Clinical trials are where the vast majority of drugs in development – over 90% – fail. The highest probability of failure occurs in phase 2 clinical trials, where both safety and efficacy are tested in a substantial number of patients. By that point, tens of millions of dollars have already been invested in the development of the drug. Around 60% of phase 2 trials fail. Most often, programs fail at this stage because they do not demonstrate efficacy, but failure can also happen due to toxicity and the complexity of the data.

Using machine learning and AI, we built models based on data points specific to successfully launched and failed drugs, including clinical trial design, biological features of the disease, drug targets and patient criteria. We combined these models into our AI tool. For every evaluated phase 2, inClinico generates a probability of success for proceeding into phase 3.

We have been training inClinico on clinical trials, drugs and diseases since 2014 and as the technology has advanced into multi-modal large language models and other generative AI technologies, we’ve been able to improve its predictive capabilities so that it can serve as a tool to help companies direct their research dollars and expertise toward programs that have the best likelihood of success and to capture and utilize valuable information about the programs that have failed.

The press release describes an accuracy of 79% in predicting the outcomes of real-world trials. How did the company measure this benchmark?

Zhavoronkov: Our researchers collected phase 2 clinical trial data from 55,653 phase 2 trials from clinicaltrials.gov, which come from a variety of public sources, including pharma press releases and publications. The data was curated by both biomedical experts and by a generative large language model. Our AI tool then mapped these trials to drugs and diseases using a natural language processing pipeline based on the state-of-the-art Drug and Disease Interpretation Learning with Biomedical Entity Representation Transformer (DILBERT). Using this data, we  built  a  model  for  predicting  the  phase  2  clinical  trial  probability of success which is defined as the transition of drug-condition pair from phase 2 to phase 3.

Validation studies for this paper were based on 17 clinical trials whose phase 2 outcomes were made public by August 2022. Based on these studies, inClinico’s predictive ability was found to be 79% accurate. We determined a phase 2 clinical trial as “successful” based on three factors: the statistical and clinical significance of efficacy and safety endpoints; a company’s decision to transition the drug program to phase 3  and the momentary increase of company’s stock price in response to clinical trial results.

As noted in the paper: “While Machine learning  (ML)  and  artificial  intelligence  (AI)  are  widely  used  in  clinical  trial  operations  by  pharmaceutical companies, there have been no validated reports on the implementation of ML/AI predicted probability of technical and regulatory success (PTRS) systems within pharma research and development (R&D) processes.”

Could you elaborate on the findings that “target choice” has more impact on clinical trial outcome prediction than “trial design”?

Zhavoronkov: The traditional method of finding targets in the pharma industry is a combination of academic research and serendipity. But we know that the vast majority of these hypotheses will ultimately fail when drugs reach clinical trials, suggesting that the complexity of diseases and biological mechanisms makes finding precise targets for disease incredibly difficult.

AI can help us find those critical connections, analyzing massive amounts of diverse data – omics data, grants, clinical trials, compounds and publications – in the context of a disease of interest in order to produce a ranked list of potential targets. That is how our AI target discovery engine PandaOmics works, to not only efficiently, but quickly, identify promising targets for disease – and it has identified targets for 30-plus drugs that we are developing, including three in clinical trials. AI not only has a huge role to play in helping us find new promising targets, but also in designing new molecules to fit these targets and predicting the likelihood of success in bringing them through clinical trials.

What’s next for InClinico? What further advances do you expect in terms of prediction accuracy?

Zhavoronkov: Ultimately, I would like this tool to be deployed broadly as with broader use, it will further improve. We utilize an approach we call Reinforcement Learning from Expert Feedback (RLEF) – the more expert feedback we receive from analysts using the tool for making predictions, the better it becomes. At this time, we can only predict small molecule first-in class single-agent targeted therapeutics. We estimate about 20–25% of the trials to be predictable with our current tool with meaningful accuracy. I hope that with our new laboratory robotics capabilities, we might be able to expand into predicting success rates for combination therapies and help select the best combination for targeted therapies.


Filed Under: clinical trials, Data science, Drug Discovery, Drug Discovery and Development, machine learning and AI, Regulatory affairs
Tagged With: AI in Pharma, Biotech Innovations, clinical trials, drug development, InClinico, pharma R&D, Predictive Modeling
 

About The Author

Brian Buntz

As the pharma and biotech editor at WTWH Media, Brian has almost two decades of experience in B2B media, with a focus on healthcare and technology. While he has long maintained a keen interest in AI, more recently Brian has made making data analysis a central focus, and is exploring tools ranging from NLP and clustering to predictive analytics.

Throughout his 18-year tenure, Brian has covered an array of life science topics, including clinical trials, medical devices, and drug discovery and development. Prior to WTWH, he held the title of content director at Informa, where he focused on topics such as connected devices, cybersecurity, AI and Industry 4.0. A dedicated decade at UBM saw Brian providing in-depth coverage of the medical device sector. Engage with Brian on LinkedIn or drop him an email at [email protected].

Related Articles Read More >

The FDA’s AI ambitions depend on better data practices
Pharma leaders support new Pistoia Alliance data standardization initiative
Why the next breakthrough drug depends on smarter infrastructure
Finding signals in the storm: Automation in biomarker discovery 
“ddd
EXPAND YOUR KNOWLEDGE AND STAY CONNECTED
Get the latest news and trends happening now in the drug discovery and development industry.

MEDTECH 100 INDEX

Medtech 100 logo
Market Summary > Current Price
The MedTech 100 is a financial index calculated using the BIG100 companies covered in Medical Design and Outsourcing.
Drug Discovery and Development
  • MassDevice
  • DeviceTalks
  • Medtech100 Index
  • Medical Design Sourcing
  • Medical Design & Outsourcing
  • Medical Tubing + Extrusion
  • Subscribe to our E-Newsletter
  • Contact Us
  • About Us
  • R&D World
  • Drug Delivery Business News
  • Pharmaceutical Processing World

Copyright © 2025 WTWH Media LLC. All Rights Reserved. The material on this site may not be reproduced, distributed, transmitted, cached or otherwise used, except with the prior written permission of WTWH Media
Privacy Policy | Advertising | About Us

Search Drug Discovery & Development

  • Home Drug Discovery and Development
  • Drug Discovery
  • Women in Pharma and Biotech
  • Oncology
  • Neurological Disease
  • Infectious Disease
  • Resources
    • Video features
    • Podcast
    • Voices
    • Views
    • Webinars
  • Pharma 50
    • 2025 Pharma 50
    • 2024 Pharma 50
    • 2023 Pharma 50
    • 2022 Pharma 50
    • 2021 Pharma 50
  • Advertise
  • SUBSCRIBE