SLPred: a multi-view subcellular localization prediction tool for multi-location human proteins

Tarih
2022Yazar
Özsarı, GökhanRifaioğlu, Ahmet Süreyya
Atakan, Ahmet
Tunca, Doğan
Martin, Maria Jesus
Atalay, Rengül Çetin
Atalay, Volkan
Üst veri
Tüm öğe kaydını gösterKünye
Özsarı, G., Rifaioglu, A. S., Atakan, A., Doğan, T., Martin, M. J., Çetin Atalay, R., & Atalay, V. (2022). SLPred: a multi-view subcellular localization prediction tool for multi-location human proteins. Bioinformatics (Oxford, England), 38(17), 4226–4229. https://doi.org/10.1093/bioinformatics/btac458Özet
Accurate prediction of the subcellular locations (SLs) of proteins is a critical topic in protein science. In this study, we present SLPred, an ensemble-based multi-view and multi-label protein subcellular localization prediction tool. For a query protein sequence, SLPred provides predictions for nine main SLs using independent machine-learning models trained for each location. We used UniProtKB/Swiss-Prot human protein entries and their curated SL annotations as our source data. We connected all disjoint terms in the UniProt SL hierarchy based on the corresponding term relationships in the cellular component category of Gene Ontology and constructed a training dataset that is both reliable and large scale using the re-organized hierarchy. We tested SLPred on multiple benchmarking datasets including our-in house sets and compared its performance against six state-of-the-art methods. Results indicated that SLPred outperforms other tools in the majority of cases.