Drug Discovery and Development

  • Home Drug Discovery and Development
  • Drug Discovery
  • Women in Pharma and Biotech
  • Oncology
  • Neurological Disease
  • Infectious Disease
  • Resources
    • Video features
    • Podcast
    • Voices
    • Views
    • Webinars
  • Pharma 50
    • 2025 Pharma 50
    • 2024 Pharma 50
    • 2023 Pharma 50
    • 2022 Pharma 50
    • 2021 Pharma 50
  • Advertise
  • SUBSCRIBE

Tool Reveals Molecular Causes of Disease, Including Infant Cancer

By Princeton University, Engineering School | March 4, 2019

In a demonstration of a tool for revealing molecular differences between diseases, researchers discovered four genes associated with a rare pediatric cancer. The image on the left shows a normal cell while the one on the right highlights one of the discovered genes in neuroblastoma, which afflicts babies and young children. Credit: Troyanskaya Lab

Princeton University researchers are gaining new insights into the causes and characteristics of diseases by harnessing machine learning to analyze molecular patterns across hundreds of diseases simultaneously. Demonstrating a new tool now available to researchers worldwide, the team of computer scientists and biologists has already uncovered and experimentally confirmed previously unknown contributions of four genes to a rare form of cancer that primarily affects babies and young children.

The team, which includes collaborators at Michigan State University and the University of Oslo, introduced the system and demonstrated its abilities in a paper published in the Feb. 23 issue of the journal Cell Systems.

While previous approaches focused on genes associated with specific diseases or types of cancer, the new technique uses machine learning to find unique patterns of gene activity by looking at more than 300 different diseases simultaneously, including cancers, heart disease, metabolic disorders and many others. In doing so, it reveals distinctions between diseases and tissue types, including fine-tuned differences between related diseases that were not possible to discern with other techniques.

The researchers believe that, with further development, the tool will be useful to clinicians in diagnosing disease, tailoring and tracking the effectiveness of therapies, and finding new treatment approaches.

The system, called Unveiling RNA Sample Annotation for Human Diseases, or URSA(HD), incorporates information about the activity of genes from publicly available records of about 8,000 biopsies taken from healthy and diseased tissues of thousands of patients. Going forward, researchers may submit new samples to the tool, via a web interface, and receive an analysis of possible associations with diseases and tissue types.

“The real innovation is comparing all samples to every other sample,” said Chandra Theesfeld, one of the lead researchers along with Young-Suk Lee, who earned his Ph.D. at Princeton in 2016.

Theesfeld likened the idea to humans’ ability to recognize nuanced differences between behaviors based on having seen a wide variety of examples. Watching soccer players, for example, might reveal the characteristics of a kicking action, but watching soccer players and ballet dancers at the same time reveals details and context for a similar action with a very different style and purpose.

“Studying them together provides a way to distinguish unique aspects,” said Theesfeld, a research scientist in the lab of Olga Troyanskaya of Princeton and the Simons Foundation, who led the team. That viewpoint provides an unbiased way “to learn new things about disease that aren’t possible to find with the one-disease-at-a-time approach — and potentially identify new targets for therapies or even discover new aspects of disease that weren’t appreciated.”

In making its comparisons, the algorithm gives more weight to differences in gene activity that uniquely define the distinct tissues and diseases. It de-emphasizes information about gene activity common to related diseases, much of which already is well studied. In the soccer-dancing analogy, it’s like setting aside the large-scale action of lifting a leg in a kick and finding many details, such the angle of a foot, that taken together constitute a signature set of characteristics that reliably identifies one action or the other.

“Our method is driven by the disease information in the patient sample, so it’s not biased toward the popular disease genes that always get studied,” Theesfeld said. “We can track patterns of changes in data without knowing exactly what each change means.”

Theesfeld noted that 90 percent of studies of genes look at just 10 percent of human genes. URSA(HD) looks at the entire human genome and creates a genome-wide model or signature for each disease.

This approach could be particularly powerful for rare diseases, for which the researchers can now create a model with just a few samples. In the case of neuroblastoma, the pediatric cancer, the researchers found four genes that particularly contributed to the disease and for which there was no previous information in the scientific literature. To confirm the findings, Theesfeld performed laboratory tests on human cells, manipulating the gene activity and observing their effects on cancer-related processes in the cells.

Rather than looking at DNA itself, URSA(HD) looks at RNA, the product that cells create as they transcribe the information in DNA into working molecules that build and run cells and transmit signals from cell to cell. In this way, the system looks beyond mutations (scrambling in the genes themselves) and instead focuses on the downstream transcription products, which can be dysregulated in ways that cause problems even if the original gene is normal.

The research is part of longstanding work in Troyanskaya’s lab to integrate massive collections of dissimilar datasets to extract information necessary to make precise biological predictions and to direct laboratory experiments to accelerate discovery. A wide range of data science at Princeton brings together computing and biology to develop foundational tools and insights with potential to have a broad impact for health and humanity.

“Interdisciplinary approaches that merge sophisticated data science with deep knowledge of biology are key to deciphering biomedical puzzles necessary to realize the promise of precision medicine,” said Troyanskaya.


Filed Under: Oncology

 

Related Articles Read More >

New treatment paradigms in oncology: Highlights from the ASCO 2025 annual meeting
Korean team reports all-in-one cancer nanomedicine in pre-clinical studies
As some biotechs cut, Genmab unveils striking new site near Princeton
OS Therapies announces favorable FDA feedback on external control arm strategy for osteosarcoma drug
“ddd
EXPAND YOUR KNOWLEDGE AND STAY CONNECTED
Get the latest news and trends happening now in the drug discovery and development industry.

MEDTECH 100 INDEX

Medtech 100 logo
Market Summary > Current Price
The MedTech 100 is a financial index calculated using the BIG100 companies covered in Medical Design and Outsourcing.
Drug Discovery and Development
  • MassDevice
  • DeviceTalks
  • Medtech100 Index
  • Medical Design Sourcing
  • Medical Design & Outsourcing
  • Medical Tubing + Extrusion
  • Subscribe to our E-Newsletter
  • Contact Us
  • About Us
  • R&D World
  • Drug Delivery Business News
  • Pharmaceutical Processing World

Copyright © 2025 WTWH Media LLC. All Rights Reserved. The material on this site may not be reproduced, distributed, transmitted, cached or otherwise used, except with the prior written permission of WTWH Media
Privacy Policy | Advertising | About Us

Search Drug Discovery & Development

  • Home Drug Discovery and Development
  • Drug Discovery
  • Women in Pharma and Biotech
  • Oncology
  • Neurological Disease
  • Infectious Disease
  • Resources
    • Video features
    • Podcast
    • Voices
    • Views
    • Webinars
  • Pharma 50
    • 2025 Pharma 50
    • 2024 Pharma 50
    • 2023 Pharma 50
    • 2022 Pharma 50
    • 2021 Pharma 50
  • Advertise
  • SUBSCRIBE