DSpace Repository

ECPred: a tool for the prediction of the enzymatic functions of protein sequences based on the EC nomenclature

Show simple item record

dc.contributor.author Dalkiran, Alperen
dc.contributor.author Rifaioglu, Ahmet Sureyya
dc.contributor.author Martin, Maria Jesus
dc.contributor.author Cetin-Atalay, Rengul
dc.contributor.author Atalay, Volkan
dc.contributor.author Dogan, Tunca
dc.date.accessioned 2019-11-25T10:57:19Z
dc.date.available 2019-11-25T10:57:19Z
dc.date.issued 2018-09
dc.identifier.citation Dalkiran, A., Rifaioglu, A. S., Martin, M. J., Cetin-Atalay, R., Atalay, V., & Dogan, T. (2018). ECPred: A tool for the prediction of the enzymatic functions of protein sequences based on the EC nomenclature. Bmc Bioinformatics, 19, 334. https://doi.org/10.1186/s12859-018-2368-y tr_TR
dc.identifier.issn 1471-2105
dc.identifier.uri http://openaccess.adanabtu.edu.tr:8080/xmlui/handle/123456789/614
dc.identifier.uri https://doi.org/10.1186/s12859-018-2368-y
dc.description WOS indeksli yayınlar koleksiyonu. / WOS indexed publications collection.
dc.description.abstract Background: The automated prediction of the enzymatic functions of uncharacterized proteins is a crucial topic in bioinformatics. Although several methods and tools have been proposed to classify enzymes, most of these studies are limited to specific functional classes and levels of the Enzyme Commission (EC) number hierarchy. Besides, most of the previous methods incorporated only a single input feature type, which limits the applicability to the wide functional space. Here, we proposed a novel enzymatic function prediction tool, ECPred, based on ensemble of machine learning classifiers. Results: In ECPred, each EC number constituted an individual class and therefore, had an independent learning model. Enzyme vs. non-enzyme classification is incorporated into ECPred along with a hierarchical prediction approach exploiting the tree structure of the EC nomenclature. ECPred provides predictions for 858 EC numbers in total including 6 main classes, 55 subclass classes, 163 sub-subclass classes and 634 substrate classes. The proposed method is tested and compared with the state-of-the-art enzyme function prediction tools by using independent temporal hold-out and no-Pfam datasets constructed during this study. Conclusions: ECPred is presented both as a stand-alone and a web based tool to provide probabilistic enzymatic function predictions (at all five levels of EC) for uncharacterized protein sequences. Also, the datasets of this study will be a valuable resource for future benchmarking studies. ECPred is available for download, together with all of the datasets used in this study, at: https://github.com/cansyl/ECPred. ECPred webserver can be accessed through http://cansyl.metu.edu.tr/ECPred.html. tr_TR
dc.language.iso en tr_TR
dc.publisher BMC BIOINFORMATICS / BMC tr_TR
dc.relation.ispartofseries 2018;Volume: 19 Article Number: 334
dc.subject Protein sequence tr_TR
dc.subject EC numbers
dc.subject Function prediction
dc.subject Machine learning
dc.subject Benchmark datasets
dc.subject SUBFAMILY CLASS
dc.subject ENZYMES
dc.subject Biochemical Research Methods
dc.subject Biotechnology & Applied Microbiology
dc.subject Mathematical & Computational Biology
dc.title ECPred: a tool for the prediction of the enzymatic functions of protein sequences based on the EC nomenclature tr_TR
dc.type Article tr_TR


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account