Skip to main content
An official website of the United States government

Childhood Cancer Data Initiative–Funded Projects

CCDI awarded administrative supplements to NCI-Designated Cancer Centers to investigate how data within the CCDI Data Ecosystem can be used to drive innovative discoveries and foster collaborative research in childhood cancer. Research conducted using this funding could help identify critical scientific questions and determine the analytical tools that need to be developed.

The projects are summarized and listed below in alphabetical order by institution and by fiscal year awarded.

Project Name and Institution

Project Team Fiscal Year Awarded Summary

Pilot Study to Link Clinical and Imaging Data from Electronic Medical Records (EMRs), Children’s Brain Tumor Network (CBTN), and the Molecular Characterization Initiative (MCI)

Fred Hutchinson Cancer Center, University of Washington

Sarah Leary, M.D.

Daksha Ranade, M.P.H., M.B.A.

Jeffrey Stevens, B.S.

2024

CCDI’s MCI provides unprecedented genomics data on children with newly diagnosed cancers, with over 2,000 participants having primary tumors of the central nervous system (CNS). The CBTN is a consortium that has collected longitudinal clinical and biological data on over 5,000 CNS tumor participants over the past decade. This project aims to: 1) develop a pipeline to automate data extraction from EMRs with Amazon Web Services teams; 2) increase imaging data available through CBTN and CCDI; and 3) connect CBTN and MCI data at the participant level. The tools, processes, and pipelines developed may be expanded to other institutions, enhancing CCDI as a resource for the childhood cancer research community.

GAIPO: Graph Artificial Intelligence for Pediatric Oncology

Melvin and Bren Simon Comprehensive Cancer Center, Indiana University 

Jing Su, Ph.D.

Kun Huang, Ph.D.

Waqas Amin, M.D.

Karen Pollok, Ph.D.

2024

CCDI is building an ecosystem to bring data and computational tools together to improve prevention, treatment, quality of life, and survivorship. This project proposes to create a scalable AI platform, GAIPO, to overcome the challenge of integrating various data modalities, from -omics data to clinical features, treatments, outcomes, and pathological imaging. GAIPO will be a ready-to-use tool in the CCDI Data Ecosystem and will be shared via GitHub, container-based images, and cloud workflows.

Clinical Impact of Pediatric Tumor Sequencing in Management of Cancer in Pediatric and AYA Patients

Rogel Cancer Center, University of Michigan

Rajen Mody, M.D.

Carl Koschmann, M.D.

Chandan Kumar, Ph.D.

Joshua Goldman, M.D.

David Hanauer, M.D.

Arul Chinnaiyan, M.D., Ph.D.

2024

The completion of the human genome project and advancements in low-cost, high-throughput genomic sequencing have marked a new era of precision oncology. This project aims to enhance the impact of next-generation sequencing in pediatric and adolescent oncology by integrating molecular sequencing data with detailed clinical information from the Michigan Oncology Sequencing Center study’s CCDI cohort. By extracting clinical data from electronic health records using EMERSE, the project seeks to develop a standardized Clinico-Genomic database template in REDCap. This study will focus on translating actionable molecular findings from the 1,002-patient CCDI cohort into targeted treatments based on identified molecular alterations. This could support targeted treatments and immunotherapy evaluation and provide a uniform resource for data harmonization across the CCDI Data Ecosystem.

Integrating Brain Development Data with Pediatric and AYA Cancer Profiles to Develop New Therapies

UCLA Health Jonsson Comprehensive Cancer Center, University of California, Los Angeles

Aparna Bhaduri, Ph.D.

Harley Ian Kornblum, M.D., Ph.D.

Riki Kawaguchi, Ph.D.

2024

Pediatric, adolescent, and young adult (AYA) brain cancer has limited treatment options. Through national efforts like CCDI's MCI and other projects, extensive data on molecular profiles and outcomes of these cancers are available through the CCDI Data Ecosystem. Parallel efforts through the BRAIN Initiative and other atlas-scale consortia have similarly characterized brain cell types, focusing on human brain development. This project proposes to integrate data from developing human brains with pediatric and AYA brain cancer data using network-based methods and novel informatics tools. This integration aims to identify similarities and differences in cell types, gene programs leveraged by cancers, and interactions between cancer and normal brain cells, ultimately enabling the development of new therapeutic options.

Creating the Childhood Cancer Isoform Atlas: Informatics Tools and Multi-Omics Insights for Immunotherapy Targets

Abramson Cancer Center, University of Pennsylvania

Yi Xing, Ph.D.

Richard Aplenc, M.D., Ph.D.

2023

Alternative splicing is a cellular process that allows a gene to code for many different proteins, but this process is often disrupted in cancer. A multidisciplinary team at the University of Pennsylvania proposes to create informatics tools that enable the use of CCDI data to map the various proteins, or isoforms, that result from alternative splicing. This project aims to: 1) map alternative isoforms into a new resource called the Childhood Cancer Isoform Atlas, 2) identify isoforms that could be immunotherapy targets, and 3) integrate and visualize data on isoforms and targets in childhood cancers. All software developed will be open source and accessible to the research community, facilitating the discovery of immunotherapy targets for hard-to-treat childhood cancers.

Real-World Molecularly Targeted Treatment Registry (MaTTeR): A Pilot Study to Enrich CCDI Data Utilizing Directed Electronic Medical Record Extraction

Boston Children’s Hospital and Dana-Farber Cancer Institute

Yana Pikman, M.D.

Suzanne Forrest, M.D.

Kee Yeo, M.D.

Katherine Janeway, M.D.

2023

Doctors are increasingly using therapies that target specific gene changes in cancer cells, known as molecularly targeted therapies (MTT). Collecting and sharing data on how well these therapies work (their efficacy and toxicity), as well as on specific doses and drug combinations related to MTTs, is critical, especially when they are given outside of clinical trials. This project aims to implement an “Electronic Medical Record Search Engine” to identify patients who received MTTs outside of clinical trials. Investigators will then create a Real-World Molecularly Targeted Treatment Registry (MaTTeR) within the CCDI framework, using genomic data from the Dana-Farber Cancer Institute and the CCDI Data Ecosystem. They also propose to launch a data visualization platform in the ecosystem that doctors and researchers can use to explore and apply MaTTeR in their clinical care and research projects.

Enhancing Precision of Pediatric Cancer Molecular Targets by Aggregating CCDI Genomic Data to Pediatric Cancer Knowledgebase

Comprehensive Cancer Center, St. Jude Children’s Research Hospital

Jinghui Zhang, Ph.D.

Xiaotu Ma, Ph.D.

Clay Mcleod, M.S.

Michael Rusch, B.A.

2023

In recent years, doctors and researchers have learned a lot about how genetic changes drive childhood and young adult cancers. Pediatric Cancer Knowledgebase version 2 provides dynamic visualizations of genetic changes in 300 molecular subtypes of childhood cancer. This project’s goal is to enhance the FDA’s Relevant Molecular Target List, characterized in CCDI’s Molecular Targets Platform, to support cancer care. This involves developing an application programming interface for summarizing statistics and patterns related to genetic changes in childhood cancers and for integrating CCDI data sets.

Machine Learning Framework for Accurate Childhood Acute Myeloid Leukemia Subtype Identification

Fred & Pamela Buffett Cancer Center, University of Nebraska

Shibiao Wan, Ph.D.

Joseph Khoury, M.D.

Jieqiong Wang, Ph.D.

2023

Acute myeloid leukemia (AML) in children has many different subtypes, each characterized by different genetic alterations. This project aims to improve the identification of subtypes by integrating multi-omics data, including genomics, transcriptomics, and epigenetics. The team proposes to develop a machine learning framework that would refine risk stratification, diagnosis, and treatment selection for children with AML. This approach also holds promise for identifying subtypes of other childhood and young adult cancers, including ultra-rare tumors.

Unlocking the Potential of Extrachromosomal Circular DNA (eccDNA) as Prognostic Markers in Childhood and AYA Cancers

Sanford Burnham Prebys Medical Discovery Institute

Lukas Chavez, Ph.D.

Yuk-Lap (Kevin) Yip, Ph.D.

2023

Extrachromosomal circular DNA (eccDNA) is a type of DNA that plays a role in the amplification of oncogenes in cancer. This project aims to increase understanding of how eccDNA affects the development, spread, and prognosis of childhood, adolescent, and young adult cancers. Investigators will use a computational pipeline to identify eccDNAs from whole-genome sequencing data from more than 3,500 tumor samples. These findings will be made available to the public through the CCDI Data Ecosystem.

Automated Classification of Pediatric Soft Tissue Sarcoma from Histopathology Images

The Jackson Laboratory

Jill Rubinstein, M.D., Ph.D.

Jeffrey Chuang, Ph.D.

Carol Bult, Ph.D.

2023

Soft tissue sarcomas, while rare in children and young adults, have a range of subtypes with varying prognoses and clinical characteristics. The Jackson Laboratory investigators have expertise in gathering, integrating, and analyzing data from diverse sources and in computational oncology—using computers to model tumor characteristics, responses to therapies, and more. The aim of this project is to expand the collection of digitized whole-slide images of pediatric soft tissue sarcomas and to use computational techniques to classify and diagnose soft tissue sarcomas more accurately.

Enhancing Pediatric Cancer Research with AI-Driven Diagnostics

USC Norris Comprehensive Cancer Center, University of Southern California and Children’s Hospital Los Angeles

James Amatruda, M.D.

Jaclyn Biegel, Ph.D.

Xiaowu Gai, Ph.D.

Bruce Pawel, M.D.

Jennifer Cotter, M.D.

Mikako Warren, M.D.

Fariba Navid, M.D.

2023

The USC Norris Comprehensive Cancer Center (NCCC), in collaboration with Children’s Hospital Los Angeles (CHLA), proposes to develop an online diagnostic resource powered by augmented artificial intelligence (AI). This AI will be used to create an AI-powered classifier that can sort vast amounts of imaging and molecular data to help determine a specific diagnosis for central nervous system (CNS) tumors, sarcomas, and ultimately all childhood and young adult cancers. The classifier is called “Multi-Modal AI-Based Diagnosis for Pediatric Oncology.” Additionally, NCCC and CHLA will collect whole-slide images from 599 solid tumors and whole-genome methylome data from 200 CNS tumors. These data will be added to the existing “OncoKids - NGS Panel for Pediatric Malignancies” data set within the CCDI Data Ecosystem.

Leveraging ExtractEHR and FHIR Framework for Enhancing Clinical Data Integration

Winship Cancer Institute, Emory University, and Children’s Hospital of Philadelphia

Tamara Miller, M.D.

Allison Heath, Ph.D.

Richard Aplenc, M.D., Ph.D.

2023

ExtractEHR retrieves childhood cancer data such as hospital encounters, laboratory test results, medications, outcomes, pathology reports, etc., from electronic health records (EHR). It then transforms these data into a format that is readable and understandable for health care professionals. Fast Healthcare Interoperability Resources (FHIR) establish a framework for standardizing EHR data, making it easier and faster to exchange and share. The goal of this project is to use ExtractEHR and FHIR to match patients’ treatment and outcomes data with their molecular data in the CCDI Data Ecosystem. The team proposes to extract clinical data from two large children’s hospitals, Children’s Healthcare of Atlanta and the Children’s Hospital of Philadelphia, and incorporate them into the CCDI Data Ecosystem.

  • Updated:

If you would like to reproduce some or all of this content, see Reuse of NCI Information for guidance about copyright and permissions. In the case of permitted digital reproduction, please credit the National Cancer Institute as the source and link to the original NCI product using the original product's title; e.g., “Childhood Cancer Data Initiative–Funded Projects was originally published by the National Cancer Institute.”

Email