False Discovery Rate Control for Fast Screening of Large-Scale Genomics Biobanks

Jasin MacHkour, Michael Muma, Daniel P. Palomar

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

Abstract

Genomics biobanks are information treasure troves with thousands of phenotypes (e.g., diseases, traits) and millions of single nucleotide polymorphisms (SNPs). The development of methodologies that provide reproducible discoveries is essential for the understanding of complex diseases and precision drug development. Without statistical reproducibility guarantees, valuable efforts are spent on researching false positives. Therefore, scalable multivariate and high-dimensional false discovery rate (FDR)-controlling variable selection methods are urgently needed, especially, for complex polygenic diseases and traits. In this work, we propose the Screen-T-Rex selector, a fast FDR-controlling method based on the recently developed T-Rex selector. The method is tailored to screening large-scale biobanks and it does not require choosing additional parameters (sparsity parameter, target FDR level, etc). Numerical simulations and a real-world HIV-1 drug resistance example demonstrate that the performance of the Screen-T-Rex selector is superior, and its computation time is multiple orders of magnitude lower compared to current benchmark knockoff methods.

Original languageEnglish
Title of host publicationProceedings of the 22nd IEEE Statistical Signal Processing Workshop, SSP 2023
PublisherIEEE Computer Society
Pages666-670
Number of pages5
ISBN (Electronic)9781665452458
DOIs
Publication statusPublished - 2023
Event22nd IEEE Statistical Signal Processing Workshop, SSP 2023 - Hanoi, Viet Nam
Duration: 2 Jul 20235 Jul 2023

Publication series

NameIEEE Workshop on Statistical Signal Processing Proceedings
Volume2023-July

Conference

Conference22nd IEEE Statistical Signal Processing Workshop, SSP 2023
Country/TerritoryViet Nam
CityHanoi
Period2/07/235/07/23

Bibliographical note

Publisher Copyright:
© 2023 IEEE.

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • FDR control
  • GWAS
  • HIV-1 drug resistance
  • Screen-T-Rex selector
  • high-dimensional variable selection

Fingerprint

Dive into the research topics of 'False Discovery Rate Control for Fast Screening of Large-Scale Genomics Biobanks'. Together they form a unique fingerprint.

Cite this