CROssBAR: comprehensive resource of biomedical relations with knowledge graph representations
View/ Open
Date
2021Author
Doğan, TuncaAtaş, Heval
Joshi, Vishal
Atakan, Ahmet
Rifaioğlu, Ahmet Süreyya
Nalbat, Esra
Nightingale, Andrew
Saidi, Rabie
Volynkin, Vladimir
Zellner, Hermann
Atalay, Rengül Çetin
Martin, Maria
Atalay, Volkan
Metadata
Show full item recordCitation
Doğan, T., Atas, H., Joshi, V., Atakan, A., Rifaioglu, A. S., Nalbat, E., Nightingale, A., Saidi, R., Volynkin, V., Zellner, H., Cetin-Atalay, R., Martin, M., & Atalay, V. (2021). CROssBAR: comprehensive resource of biomedical relations with knowledge graph representations. Nucleic acids research, gkab543. Advance online publication. https://doi.org/10.1093/nar/gkab543Abstract
Systemic analysis of available large-scale biological/biomedical data is critical for studying biological mechanisms, and developing novel and effective treatment approaches against diseases. However, different layers of the available data are produced using different technologies and scattered across individual computational resources without any explicit connections to each other, which hinders extensive and integrative multi-omics-based analysis. We aimed to address this issue by developing a new data integration/representation methodology and its application by constructing a biological data resource. CROssBAR is a comprehensive system that integrates large-scale biological/biomedical data from various resources and stores them in a NoSQL database. CROssBAR is enriched with the deep-learning-based prediction of relationships between numerous data entries, which is followed by the rigorous analysis of the enriched data to obtain biologically meaningful modules. These complex sets of entities and relationships are displayed to users via easy-to-interpret, interactive knowledge graphs within an open-access service. CROssBAR knowledge graphs incorporate relevant genes-proteins, molecular interactions, pathways, phenotypes, diseases, as well as known/predicted drugs and bioactive compounds, and they are constructed on-the-fly based on simple non-programmatic user queries. These intensely processed heterogeneous networks are expected to aid systems-level research, especially to infer biological mechanisms in relation to genes, proteins, their ligands, and diseases.