Dissecting Protein-RNA Interaction Network in Human Genome

Indiana University-Purdue University Indianapolis (IUPUI) === In eukaryotes, gene regulation is a complex multilevel process comprising of transcriptional, post-transcriptional, and post-translational control. Although the regulation at transcriptional and post-translational levels is gradually be...

Full description

Bibliographic Details
Main Author: Hashemikhabir, Seyedsasan
Other Authors: Liu, Xiaowen
Language:en_US
Published: 2020
Online Access:http://hdl.handle.net/1805/23682
Description
Summary:Indiana University-Purdue University Indianapolis (IUPUI) === In eukaryotes, gene regulation is a complex multilevel process comprising of transcriptional, post-transcriptional, and post-translational control. Although the regulation at transcriptional and post-translational levels is gradually being understood, protein machinery and the mechanisms underlying the post-transcriptional regulation remain to be elucidated. In the first study of this dissertation, I designed and implemented a database of RNA Binding Protein (RBP) Expression and Disease Dynamics (READ-DB: darwin.soic.iupui.edu), a non-redundant, curated database of human RBPs. This RBP knowledge base includes data from different experimental studies providing a one stop portal for understanding the expression, evolutionary trajectories, and disease dynamics of RBPs in the context of post-transcriptional regulatory networks. Despite the existence of several experimental procedures to understand the function of RBPs, a lack of a proper computational method to profile differential occupancy limits the scope of research. In the second study, I built a scalable framework for comparing genome-wide protein occupancy profiles among cell-types data, to uncover alterations in protein-RNA interactomes. diffHunter (github.com/Sasanh/diffHunter), is a window based peak calling and profile comparison method that can efficiently store the base-pair level read information of every given sample in a NoSQL (Not Only SQL) database. It identifies and quantitates the genome-wide binding differences between a pair of samples in two stages: Peak Calling and Differential Binding Identification. Identifying such regions enables us to compare the biologically important regions that differ between two conditions. Finally, I studied A-to- I RNA editing as one of the special functions of an RBPs’ family. ADAR family RBPs are the primary driver in the conversion of adenosine to inosine (A-to-I) within mRNA. I developed a Cancer-specific RNA-editing Identification using Somatic variation Pipeline (CRISP: github.com/Sasanh/CRISP) a computational framework for accurate identification of A-to-I editing events contributing to the prognosis and stratification of glioblastoma subtypes as well as the editing events that can serve as molecular classifiers for therapeutic approaches. I proposed two models that explains the cis-regulatory role of A-to-I editing events in noncoding regions in modulating the post-transcriptional regulation of target transcripts in glioblastoma. === 2022-08-17