October 27-28, 2025
St. Jude Children's Research Hospital
The St. Jude Knowledge in Data Science Symposium will enrich the collaborative data science culture by sharing and brainstorming ways to gain the most valuable scientific knowledge from biomedical data. It will bring together biologists, clinicians, biostatisticians, computational biologists, bioinformaticians, image analysts, structural biologists, software engineers, and others, as data science is an intrinsically multidisciplinary field.
Experts from St. Jude and elsewhere, working at the forefront of their fields, will share the challenges they have encountered, solved, and those that remain in data science for biomedical research in invited oral sessions and a contributed poster reception.
Join developmental workshops, explore solutions to pressing data science challenges from St. Jude Biohackathon teams, and attend lunchtime panels on current topics.
Don’t miss the keynote presentations by Dr. Faisal Mahmood, Associate Professor at Harvard Medical School, and Vivek Natarajan, Research Scientist at Google DeepMind!
-
Hosts
-
- Chief Data Scientist and Senior Vice President for Data Science
- Director, Center of Excellence for Data Driven Discovery
- George J. Pedersen Endowed Chair in Biological Data Sciences
- Department of Structural Biology
- St. Jude Graduate School of Biomedical Sciences
- Comprehensive Cancer Center
-
- Director, Center for Spatial OMICs
- Department of Developmental Neurobiology
- Department of Cell & Molecular Biology
- Comprehensive Cancer Center
-
- Director of Biostatistics Courses, St. Jude Graduate School of Biomedical Sciences
- Department of Biostatistics
- Comprehensive Cancer Center
- St. Jude Graduate School of Biomedical Sciences
-
- Director, Lead Discovery Informatics
- Department of Chemical Biology & Therapeutics
- Comprehensive Cancer Center
- St. Jude Graduate School of Biomedical Sciences
-
Renato Umeton, PhD
Umeton
- Vice President of Data Sciences and Chief of Artificial Intelligence
- Office of Data Science
-
-
Monday, October 27
Time Event Location 8:00 am-9:00 am Check in & Breakfast MTC Atrium/Lobby 9:00 am-9:30 am Welcome
Introduction to the Data Science Initiative
M. Madan Babu, PhD, FRSMTC Auditorium 9:30 am-10:10 am Keynote
Vivek Natarajan, Google DeepMindMTC Auditorium 10:10 am-11:10 am BioHackathon Highlights
Jared Andrews, PhD and Susanna Downing, PhDMTC Auditorium 11:10 am-11:45 am Break MTC Auditorium Noon - 1:00 pm Lunchtime Lightning Talk
Stanley Pounds, PhD
Malvika Sharan, PhDMTC Auditorium 1:00 pm-1:15 pm Break MTC Atrium/Lobby 1:15 pm-3:15 pm Oral Session 1A: AI-Driven Imaging and Quantitative Analysis in Cancer and Cell Biology
Moderator: Jasmine Plummer, PhD- Quantitative Live-Cell Imaging for Comprehensive Analysis of Phenotypic Heterogeneity
Anand Singh, PhD and Zeliha Kilic, PhD - Bridging the gap in histopathology image analysis of pediatric solid tumors with unsupervised representation learning
Ali Foroughi Pour, PhD - Mapping Nuclear Remodeling in Neuronal Differentiation with Multimodal Imaging and Quantitative Analysis
Kirby Campbell, Ph.D. - An AI-Guided Platform for Scalable Expert Annotation of Pediatric Cancer Pathology Images
Haoran Chen, Ph.D.
MTC Lecture Hall 1:15 pm-3:15 pm Oral Session 1B: Mitochondrial Single Cell Omics
Moderator: Julie Waits, PhD- Mitovolve Models the Evolution of the Prevalence of Mitochondrial Mutations in Pediatric Leukemia
Stanley Pounds, PhD - Deciphering the role of mitochondrial DNA somatic mutations in leukemia tumor evolution
Gang Wu, PhD - NetBID identifies hidden drivers and pathways associated with pathogenic mitochondrial DNA variants in leukemia
Qingfei Pan, PhD - Custom Genomic Pipelines Reveal New Insights into Leukemia Mitochondrial Genetics
Mondira Kundu, MD, PhD
MTC Auditorium 1:15 pm-3:15 pm Oral Session 1C: Advancing Predictive Medicine Through Longitudinal Clinical Data Analysis
Moderator: Sadie Mirzaei, PhD- Machine Learning Using High-Dimensional Longitudinal Clinical Data Improves the Prediction of Acute and Chronic Graft-versus-Host Disease after Allogeneic Hematopoietic Cell Transplantation
Dinesh Keerthi - Early-Life Prediction of Hospitalization and Emergency Department Visits in Patients with Sickle Cell Disease Using Laboratory and Social Determinants of Health (SDoH) Data
Chengzhou Wu, PhD - Temporal Deep Learning for Longitudinal Modeling of Fetal Hemoglobin in Sickle Cell Disease
Minzhang Zheng - Turning Data into Action: Real-Time Patient Tracking to Prevent Treatment Abandonment in Pediatric Oncology in El Salvador
Fuentes-Alabi de Aparicio, MD, MPH
PRT-E1003 3:15 pm-3:30 pm Break MTC Atrium/Lobby 3:30 pm-5:30 pm Oral Session 2A: Building Reliable and Interpretable AI Systems for Medical Image Analysis
Renato Umeton, PhD, Moderator- Often Wrong, Never in Doubt: GPT-4oV’s Performance Vs. Confidence and Explanations for Head CT Hemorrhage Interpretation
Briana Malik, BS - Region-Aware Reconstruction Strategy for Pretraining fMRI Foundation Model
Ruthwik Reddy Doodipala, B.Tech, MS Candidate - Bayesian Joint Mediation Model to Identify Brain Regions Mediating Negative Effects of Radiation Therapy on Neurocognition in Medulloblastoma Patients
Saikat Nandy, PhD - AI for Automated Body Composition Analysis on Pediatric and Adult CT Images
Rafah Mresh, MD - Resiliency to Suboptimal Training Data in Machine Learning Segmentation Models for Radiology
James Pierce, BS
MTC Auditorium 3:30 pm-5:30 pm Oral Session 2B: Single-Cell and Systems Immunology in Pediatric Cancer and Immunotherapy
Jeremy Chase Crawford, PhD, Moderator- Single-cell immune profiling reveals long-lasting B cell convergence to distinct broadly reactive epitopes following vaccination with chimeric influenza virus hemagglutinins
Lei Li, PhD - Cytotoxic bystander activation of endogenous CAR-negative T cells in response to CD19-CAR T cell therapy
Jia-Hua Qu - Evaluating CAR T cell response in pediatric brain tumor patients using CSF samples and multi-omics approaches
Jorge Andres Ibanez Vega, PhD - An AI agent applied to a pan-pediatric cancer single cell atlas discovers convergent and tumor specific gene expression programs
Alessandro Davini, MSc - Single cell transcriptional analysis reveals leukocyte heterogeneity and impacts of JAK1/2 modulation of cytokine signaling in a mouse model of primary hemophagocytic lymphohistiocytosis (pHLH)
Aidin Foroutan, PhD
ARC M1003 5:30 pm - 7:00 pm Poster Session/Reception MTC Atrium/Lobby Tuesday, October 28
Time Event Location 8:00 am-9:00 am Check in & Breakfast MTC Atrium/Lobby 9:00 am-9:30 am Poster Awards
Stanley Pounds, PhDMTC Auditorium 9:30 am-9:35 am Wecome
Renato Umeton, PhDMTC Auditorium 9:35 am-10:15 am Keynote
Faisal Mahmood, PhD, Harvard Medical SchoolMTC Auditorium 10:15 am-11:30 am Oral Session 2C: From Genomic Insights to Policy Impact: Advancing Pediatric Precision Medicine
Malvika Sharan, PhD, Moderator- Applying Data Science to Transform National Cancer Policies for Children: AI x SJCARES Policies
Desiree Becerra, MPP, MSc - Large-scale meQTL mapping reveals treatment-related differences in genetic regulation of DNA methylation in survivors of childhood cancer from the St. Jude Lifetime Cohort (SJLIFE)
Tiffany Eulalio, PhD - Novel Trajectory-Informed Mendelian Randomization Demonstrates Consistent Causal Protective Effect of Fetal Hemoglobin on Clinical Outcomes in Children with Sickle Cell Disease
Guolian Kang,PhD
MTC Auditorium 11:30 am-Noon Break MTC Atrium/Lobby Noon - 1:00 pm Lunchtime Lightning Talk - ABDS MTC Atrium/Lobby 1:00 pm-1:20 pm Break MTC Atrium/Lobby 1:30 pm-3:30 pm Oral Session 3A: From Data to Drugs: Accelerating Drug Discovery Through Data-Driven Methods
Anang Shelat, PhD and Christoph Gorgulla, PhD, Moderators- Using Bespoke Data Analysis to Improve Throughput, Communication, and Accuracy in Basic Lab Science: Don’t Forget the Small Things
Mary Ashley Rimmer, PhD - Harnessing Data Science to Enable N-of-1 Antisense Therapies in Personalized Medicine
Davneet Kaur, PhD - Direct-to-Biology Enabled Molecular Glue Discovery
Uma Neelakantan - Developmental dynamics of GPCRs provide insights for drug development for fetal, pediatric, and geriatric population
Katarina Nemec, PhD, PharmD
MTC Auditorium 1:30PM-3:30PM Oral Session 3B: AI/ML Modeling in Computational Systems Biology
Stanley Pounds, PhD, Moderator- Deep Learning Meets High-Throughput CRISPR Editing: Utilizing Novel CHANCE-net and CHANCE-seq platforms for Safe Lead Target Identification
Jacqueline Chyr. PhD - AI-driven mRNA optimization for cell-type-specific protein expression
Steve Mathew D A, Mtech - Proteome-wide computational analyses reveal links between protein condensate formation and RNA biology
Swarnendu Tripathi, PhD - From Literature to Structured Data: Hybrid Manual–AI Curation of the Human Kinome
Katalin Parej, PhD
ARC M1003 3:30 pm-4:00 pm Break MTC Atrium/Lobby 1:30 pm-4:30 pm Workshop 1A: Introduction to Nextflow: Building Scalable Bioinformatics Pipelines ARC M1110 1:30 pm-4:30 pm Workshop 1B: Mathematical modeling of infectious diseases TRB East O4004 1:30 pm-4:30 pm Workshop 1C: KIDS25 Research-based Germline Variant Classification Workshop ARC MP163 5:00 pm-5:30 pm Closing Remarks
Stanley Pounds, PhD and Renato Umeton, PhDMTC Auditorium - Quantitative Live-Cell Imaging for Comprehensive Analysis of Phenotypic Heterogeneity
-
Abstracts
The St. Jude Knowledge in Data Science Symposium (KIDS25) aims to create a dynamic forum for the dissemination and advancement of scholarly work in data science for biomedical research. Submitted abstracts will be peer-reviewed and considered for presentation formats, including poster presentations, oral presentations, roundtable discussions, skills development workshops, and panel discussions. All individuals planning to attend, regardless of role, discipline, or level of training, are encouraged to submit an abstract. We are especially committed to promoting the participation of multidisciplinary trainees and early-career professionals.
Abstract Categories:
Session categories include:
- Imaging and Image Analysis
- AI/ML & Advanced Analytics
- Omics & Genomic Sciences
- Computational & Systems Biology
- Drug Discovery & Translational Therapeutics
- Population Health & Clinical Research.
Why Submit an Abstract?
- Contribute to the future of biomedical data science by sharing innovative work that drives the field forward
- Gain recognition and visibility among leaders in data science
- Engage with a diverse community of experts across disciplines
- Receive feedback and build connections with thought leaders in the field
The abstract submission deadline has passed.
We appreciate your interest and enthusiasm in sharing your work with the KIDS25 community.
For questions, please contact: ariel.maclin@stjude.org