Drug Discovery and Development

  • Home Drug Discovery and Development
  • Drug Discovery
  • Women in Pharma and Biotech
  • Oncology
  • Neurological Disease
  • Infectious Disease
  • Resources
    • Video features
    • Podcast
    • Voices
    • Views
    • Webinars
  • Pharma 50
    • 2025 Pharma 50
    • 2024 Pharma 50
    • 2023 Pharma 50
    • 2022 Pharma 50
    • 2021 Pharma 50
  • Advertise
  • SUBSCRIBE

30 promising biotech startups: Scatter plot

By Brian Buntz | August 25, 2023

The preceding visual representation of the biotech startups of 2023 are grouped — clustered — according to their focus areas. Each cluster color corresponds to a specific domain:

  • Orange for advanced molecular techniques (Cluster 0).
  • Blue for cell and gene therapies (Cluster 1).
  • Green for AI-driven drug discovery (Cluster 2).
  • Red for epigenetics and genomic medicine (Cluster 3).

A note on the numbers in the chart

The numbers you see on x- and y-axis of the scatter plot were derived from a technique called Principal Component Analysis (PCA). These numbers reflect the two most significant directions for data variation for the startups. The coordinates are similar to x- and y- coordinates on a map, but instead of representing locations, they capture the characteristics that distinguish each startup in the biotech landscape. Similar to how two cities can be geographically close, two neighboring startups on this plot share more similarities in their data features.

To arrive at the four clusters, we applied a series of data analytics and machine learning techniques. The first step was identifying promising companies based largely on funding milestones and presence of prominent investors. The companies’ novelty of focus also played a role in the analysis. A technique known as K-means clustering helped break the startups into clusters. The “K” in “K-means” refers to the number of clusters. In our case, we chose four distinct clusters. The “means” in “K-means” refers to the method of determining the center of each cluster. Specifically, the algorithm, which is common in unsupervised machine learning, assigns each data point to the nearest cluster center and then recalculates the center as the mean of all of the data points in a given cluster. The algorithm iterates until the cluster assignments stabilize. After finalizing the clusters, the technique grouped the startups based on similarity based on the selected data features. The result is the breakdown above.

More on vectors

The coordinates displayed when hovering over a company correspond to vector positions obtained from their data features. Vectors, which capture direction and magnitude, are useful in machine learning and data science, where they extract patterns, relationships and semantics from data. For example, Google developed Word2Vec to represent words as vectors. That is, a vector in Word2Vec can illuminate semantic relationships between words. Using Word2Vec embeddings, the operation “France” – “Paris” + “Berlin” might yield “Germany.”

Another example of vectors in neural network architecture is FaceNet, which represents facial images as vectors. The technique works even a face is rotated or exhibits different expressions or angles because the features remain close in the vector space.

You’ll find more on the methodology of the biotech startup feature here.


Filed Under: Biologics, Cell & gene therapy, Data science
Tagged With: Biopharma pioneers, biotech startups, Biotech ventures, pharma industry trends, R&D innovation, Startup spotlight
 

About The Author

Brian Buntz

As the pharma and biotech editor at WTWH Media, Brian has almost two decades of experience in B2B media, with a focus on healthcare and technology. While he has long maintained a keen interest in AI, more recently Brian has made making data analysis a central focus, and is exploring tools ranging from NLP and clustering to predictive analytics.

Throughout his 18-year tenure, Brian has covered an array of life science topics, including clinical trials, medical devices, and drug discovery and development. Prior to WTWH, he held the title of content director at Informa, where he focused on topics such as connected devices, cybersecurity, AI and Industry 4.0. A dedicated decade at UBM saw Brian providing in-depth coverage of the medical device sector. Engage with Brian on LinkedIn or drop him an email at bbuntz@wtwhmedia.com.

Related Articles Read More >

Nektar’s Phase 2b atopic dermatitis win triggers 1,746% analyst target surge, but legal tussle with ex-partner Lilly could complicate path forward
GSK sees latest Nucala approval as the first shot in a long-term war to deconstruct and personalize COPD treatment
EVEREST lead investigator on why Dupixent sets a new bar for treating coexisting CRSwNP and asthma
FDA approved ENFLONSIA for the prevention of RSV in Infants
“ddd
EXPAND YOUR KNOWLEDGE AND STAY CONNECTED
Get the latest news and trends happening now in the drug discovery and development industry.

MEDTECH 100 INDEX

Medtech 100 logo
Market Summary > Current Price
The MedTech 100 is a financial index calculated using the BIG100 companies covered in Medical Design and Outsourcing.
Drug Discovery and Development
  • MassDevice
  • DeviceTalks
  • Medtech100 Index
  • Medical Design Sourcing
  • Medical Design & Outsourcing
  • Medical Tubing + Extrusion
  • Subscribe to our E-Newsletter
  • Contact Us
  • About Us
  • R&D World
  • Drug Delivery Business News
  • Pharmaceutical Processing World

Copyright © 2025 WTWH Media LLC. All Rights Reserved. The material on this site may not be reproduced, distributed, transmitted, cached or otherwise used, except with the prior written permission of WTWH Media
Privacy Policy | Advertising | About Us

Search Drug Discovery & Development

  • Home Drug Discovery and Development
  • Drug Discovery
  • Women in Pharma and Biotech
  • Oncology
  • Neurological Disease
  • Infectious Disease
  • Resources
    • Video features
    • Podcast
    • Voices
    • Views
    • Webinars
  • Pharma 50
    • 2025 Pharma 50
    • 2024 Pharma 50
    • 2023 Pharma 50
    • 2022 Pharma 50
    • 2021 Pharma 50
  • Advertise
  • SUBSCRIBE