Files in this item



application/pdfYAN-DISSERTATION-2021.pdf (11MB)
(no description provided)PDF


Title:Functional analysis of low grade glioma genetic variants using statistics and physics-inspired deep learning methods
Author(s):Yan, Jialu
Director of Research:Song, Jun S
Doctoral Committee Chair(s):Dahmen, Karin A
Doctoral Committee Member(s):Zhao, Sihai Dave; Kim, Sangjin
Department / Program:Physics
Degree Granting Institution:University of Illinois at Urbana-Champaign
Subject(s):functional genomics
low-grade glioma
genetic variants
convolutional neural network
tensor train decomposition
Abstract:Large-scale genome-wide association studies (GWAS) have implicated thousands of germline variants in modulating individual's risk of diseases, including cancer. For low grade gliomas (LGGs), at least 25 risk loci have been identified, whose molecular functions, however, remain largely unknown. Understanding how the risk loci function in tumorigenesis poses a major challenge in the field, owing to potential confounding factors and the lack of relevant types of experimental data in the brain. Based on statistical methods and physics-inspired deep learning methods, this work presents a comprehensive computational framework for performing functional analysis of LGG GWAS loci. We hypothesized that GWAS loci contain causal single nucleotide polymorphisms (SNPs) which reside in accessible open chromatin regions and modulate the expression of target genes by perturbing the binding affinity of transcription factors (TFs). We performed an integrative analysis using genomic, epigenomic and transcriptomic data from public repositories and identified the candidate (causal SNP, target gene, TF) triplets that might contribute to oncogenesis. We assessed a candidate causal SNP's potential regulatory role via convolutional neural network (CNN) and simulated-annealing-based interpretation methods. Finally, we applied tensor train decomposition (TT-decomposition) to neural network parameter reduction and demonstrated that the reduced convolutional neural network performed well. This work helps understand the molecular mechanisms underlying genetic risk factors of low grade glioma. The CNN and TT-decomposition-based deep learning approach may benefit future functional genomic studies, where TF chromatin immunoprecipitation followed by sequencing (ChIP-seq) data are not readily available in the brain.
Issue Date:2021-04-21
Rights Information:Copyright 2021 Jialu Yan
Date Available in IDEALS:2021-09-17
Date Deposited:2021-05

This item appears in the following Collection(s)

Item Statistics