TY - JOUR
T1 - A Bioinformatics Tool for Identifying Intratumoral Microbes from the ORIEN Dataset
AU - exORIEN Consortium
AU - Wang, Cankun
AU - Ma, Anjun
AU - Li, Yingjie
AU - McNutt, Megan E.
AU - Zhang, Shiqi
AU - Zhu, Jiangjiang
AU - Hoyd, Rebecca
AU - Wheeler, Caroline E.
AU - Robinson, Lary A.
AU - Chan, Carlos H.F.
AU - Zakharia, Yousef
AU - Dodd, Rebecca D.
AU - Ulrich, Cornelia M.
AU - Hardikar, Sheetal
AU - Churchman, Michelle L.
AU - Tarhini, Ahmad A.
AU - Singer, Eric A.
AU - Ikeguchi, Alexandra P.
AU - McCarter, Martin D.
AU - Denko, Nicholas
AU - Tinoco, Gabriel
AU - Husain, Marium
AU - Jin, Ning
AU - Osman, Afaf E.G.
AU - Eljilany, Islam
AU - Tan, Aik Choon
AU - Coleman, Samuel S.
AU - Denko, Louis
AU - Riedlinger, Gregory
AU - Schneider, Bryan P.
AU - Spakowicz, Daniel
AU - Ma, Qin
N1 - Publisher Copyright:
© 2024 The Authors; Published by the American Association for Cancer Research.
PY - 2024/2/5
Y1 - 2024/2/5
N2 - Evidence supports significant interactions among microbes, immune cells, and tumor cells in at least 10%-20% of human cancers, emphasizing the importance of further investigating these complex relationships. However, the implications and significance of tumor-related microbes remain largely unknown. Studies have demonstrated the critical roles of host microbes in cancer prevention and treatment responses. Understanding interactions between host microbes and cancer can drive cancer diagnosis and microbial therapeutics (bugs as drugs). Computational identification of cancer-specific microbes and their associations is still challenging due to the high dimensionality and high sparsity of intratumoral microbiome data, which requires large datasets containing sufficient event observations to identify relationships, and the interactions within microbial communities, the heterogeneity in microbial composition, and other confounding effects that can lead to spurious associations. To solve these issues, we present a bioinformatics tool, microbial graph attention (MEGA), to identify the microbes most strongly associated with 12 cancer types. We demonstrate its utility on a dataset from a consortium of nine cancer centers in the Oncology Research Information Exchange Network. This package has three unique features: species-sample relations are represented in a heterogeneous graph and learned by a graph attention network; it incorporates metabolic and phylogenetic information to reflect intricate relationships within microbial communities; and it provides multiple functionalities for association interpretations and visualizations. We analyzed 2,704 tumor RNA sequencing samples and MEGA interpreted the tissue-resident microbial signatures of each of 12 cancer types. MEGA can effectively identify cancer-associated microbial signatures and refine their interactions with tumors. SIGNIFICANCE: Studying the tumor microbiome in high-throughput sequencing data is challenging because of the extremely sparse data matrices, heterogeneity, and high likelihood of contamination. We present a new deep learning tool, MEGA, to refine the organisms that interact with tumors.
AB - Evidence supports significant interactions among microbes, immune cells, and tumor cells in at least 10%-20% of human cancers, emphasizing the importance of further investigating these complex relationships. However, the implications and significance of tumor-related microbes remain largely unknown. Studies have demonstrated the critical roles of host microbes in cancer prevention and treatment responses. Understanding interactions between host microbes and cancer can drive cancer diagnosis and microbial therapeutics (bugs as drugs). Computational identification of cancer-specific microbes and their associations is still challenging due to the high dimensionality and high sparsity of intratumoral microbiome data, which requires large datasets containing sufficient event observations to identify relationships, and the interactions within microbial communities, the heterogeneity in microbial composition, and other confounding effects that can lead to spurious associations. To solve these issues, we present a bioinformatics tool, microbial graph attention (MEGA), to identify the microbes most strongly associated with 12 cancer types. We demonstrate its utility on a dataset from a consortium of nine cancer centers in the Oncology Research Information Exchange Network. This package has three unique features: species-sample relations are represented in a heterogeneous graph and learned by a graph attention network; it incorporates metabolic and phylogenetic information to reflect intricate relationships within microbial communities; and it provides multiple functionalities for association interpretations and visualizations. We analyzed 2,704 tumor RNA sequencing samples and MEGA interpreted the tissue-resident microbial signatures of each of 12 cancer types. MEGA can effectively identify cancer-associated microbial signatures and refine their interactions with tumors. SIGNIFICANCE: Studying the tumor microbiome in high-throughput sequencing data is challenging because of the extremely sparse data matrices, heterogeneity, and high likelihood of contamination. We present a new deep learning tool, MEGA, to refine the organisms that interact with tumors.
UR - http://www.scopus.com/inward/record.url?scp=85194860075&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85194860075&partnerID=8YFLogxK
U2 - 10.1158/2767-9764.CRC-23-0213
DO - 10.1158/2767-9764.CRC-23-0213
M3 - Article
C2 - 38259095
AN - SCOPUS:85194860075
SN - 2767-9764
VL - 4
SP - 293
EP - 302
JO - Cancer Research Communications
JF - Cancer Research Communications
IS - 2
ER -