Raihanul Bari Tanvir

Ph.D. Candidate


Lecture Information:
  • June 26, 2023
  • 10:00 AM
  • CASE 349 & Zoom
Headshot of Ph.D. Candidate Raihanul Bari Tanvir

Speaker Bio

Raihanul Bari Tanvir is a Ph.D. candidate at the Knight Foundation School of Computing and Information Sciences (KFSCIS) at Florida International University (FIU). He has been working as a Graduate Research Assistant under the supervision of Dr. Ananda M. Mondal and Dr. Giri Narasimhan. He is a part of the Machine Learning and Data Analytics Group (MLDAG), as well as the Bioinformatics Research Group (BioRG). His research interests are at the intersection of Machine Learning, Data Science, and Computational Biology. Raihanul published several papers in journals such as Data, BMC Bioinformatics, International Journal of Molecular Sciences, and conferences such as IEEE BIBM, BIOCOMP, RECOMB-CCB. He holds a B.Sc. in Computer Science and Engineering from Bangladesh University of Engineering and Technology (BUET). Prior to joining FIU, he worked as a software engineer at a REVE Systems in Bangladesh for one and a half years. After graduation, he has agreed to join Boehringer Ingleheim as a bioinformatics research post doc.

Description

Biomarkers are highly significant in cancer research, diagnosis, and treatment as they help to comprehend the biological response mechanisms following an internal or external intervention. The next-generation sequencing technologies have significantly increased the generation of genomic, epigenomic, and transcriptomic data, resulting in data-driven biomarker discovery using both single-omics and multi-omics. The existing statistical approaches in identifying biomarker genes using comparative transcriptomics between cancer patients and healthy controls have two major shortcomings. Shortcoming 1: The existing approaches overlook critical biological phenomena like inter-gene association and groups of genes working together to trigger a particular ailment alongside crosstalk among these groups of genes. Shortcoming 2: These approaches fail to consider individual genetic and epigenetic variations in a tumor, termed intratumor heterogeneity (ITH). The state-of-the-art multi-omics analyses in cancer research use graph neural network-based (GNN-based) approaches for biomarker discovery and cancer subtype prediction. But these approaches have major limitations since they fail to determine the relative significance of the neighboring nodes (in this case, patients) in the graph and identify the most influential omics data when it comes to downstream analyses, such as cancer subtype classification, patient stratification, etc. We propose graph-theoretic, feature selection-based machine learning, and graph attention network-based approaches to overcome the shortcoming 1, shortcoming 2, and the limitations of GNN-based multi-omics analyses, respectively.

Graph-Theoretic Approaches in Biomarker Discovery: We hypothesize that a group of genes work together by forming a clique-like structure, and a bipartite graph can represent the crosstalk between two groups of genes that form clique-like structures. To prove this hypothesis, gene expression data of three cancer types were analyzed separately. The biomarkers identified using the proposed graph-theoretic approaches were prognostically significant. Feature Selection-Based Approach in Evaluating ITH: ITH is defined by the diversity of the tumor cell subpopulations, which is the biggest obstacle in precision medicine. The major limitation of the state-of-the-art method in estimating ITH level using transcription profile is that it uses expression values of all the genes. We hypothesize that a reduced set of important genes (biomarkers) is sufficient to estimate the level of ITH. Our proposed deep learning-based feature selection approach identified a reduced set of genes, effectively estimating ITH levels in different patients. Multi-Omics Integration Using Graph Attention Networks (MOGAT): We propose MOGAT, a novel multi-omics integration approach, leveraging a graph attention network model that incorporates graph-based learning with an attention mechanism. It performs better than other GNN-based approaches in cancer subtype prediction and patient characterization.
Overall, this dissertation is a significant step forward in discovering cancer biomarkers and cancer subtype prediction, which could help physicians select appropriate treatment strategies and thus reduce patients’ suffering.