The Big Data Problem in Genomic Medicine: Acquisition, Processing, Interpretation, storage and Retrieval
Speaker
Kyle Retterer, MS
Chief Data Science Officer, Geisinger
Moderator
Marc S. Williams, MD, FACMG
Professor and Director Emeritus, Department of Genomic Health, Geisinger
The realization of genomics-first medicine depends on effectively harnessing population-scale genomic data within the clinical setting and integrating it with rich clinical and phenomic information. This talk explores the critical infrastructure required to support such ambitious initiatives, examining the challenges of building scalable systems for secure storage, efficient annotation, and rapid querying of these massive datasets.
Crucially, we will discuss the role of integrating genomic data with detailed phenomic information to unlock clinical utility, enabling real-time insights, as well as the challenges presented by genomics-first implementation, including potentially under-appreciated variability in the penetrance and phenotypic spectrum of genetic conditions. We will also examine the innovative application of Large Language Models (LLMs) to extract granular phenotypic insights from large-scale observational data, enabling a deeper understanding of genotype-phenotype relationships and paving the way for personalized, data-driven healthcare.
Webinar Questions & Answers