Log in to save to my catalogue

Computational Identification of Protein Pupylation Sites by Using Profile-Based Composition of k-Spa...

Computational Identification of Protein Pupylation Sites by Using Profile-Based Composition of k-Spa...

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_plos_journals_1689306092

Computational Identification of Protein Pupylation Sites by Using Profile-Based Composition of k-Spaced Amino Acid Pairs

About this item

Full title

Computational Identification of Protein Pupylation Sites by Using Profile-Based Composition of k-Spaced Amino Acid Pairs

Publisher

United States: Public Library of Science

Journal title

PloS one, 2015-06, Vol.10 (6), p.e0129635-e0129635

Language

English

Formats

Publication information

Publisher

United States: Public Library of Science

More information

Scope and Contents

Contents

Prokaryotic proteins are regulated by pupylation, a type of post-translational modification that contributes to cellular function in bacterial organisms. In pupylation process, the prokaryotic ubiquitin-like protein (Pup) tagging is functionally analogous to ubiquitination in order to tag target proteins for proteasomal degradation. To date, several experimental methods have been developed to identify pupylated proteins and their pupylation sites, but these experimental methods are generally laborious and costly. Therefore, computational methods that can accurately predict potential pupylation sites based on protein sequence information are highly desirable. In this paper, a novel predictor termed as pbPUP has been developed for accurate prediction of pupylation sites. In particular, a sophisticated sequence encoding scheme [i.e. the profile-based composition of k-spaced amino acid pairs (pbCKSAAP)] is used to represent the sequence patterns and evolutionary information of the sequence fragments surrounding pupylation sites. Then, a Support Vector Machine (SVM) classifier is trained using the pbCKSAAP encoding scheme. The final pbPUP predictor achieves an AUC value of 0.849 in 10-fold cross-validation tests and outperforms other existing predictors on a comprehensive independent test dataset. The proposed method is anticipated to be a helpful computational resource for the prediction of pupylation sites. The web server and curated datasets in this study are freely available at http://protein.cau.edu.cn/pbPUP/....

Alternative Titles

Full title

Computational Identification of Protein Pupylation Sites by Using Profile-Based Composition of k-Spaced Amino Acid Pairs

Identifiers

Primary Identifiers

Record Identifier

TN_cdi_plos_journals_1689306092

Permalink

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_plos_journals_1689306092

Other Identifiers

ISSN

1932-6203

E-ISSN

1932-6203

DOI

10.1371/journal.pone.0129635

How to access this item