N test.Sn ( )c 90.48,92.46 73.41,97.22 Sp ( )c 94.05,98.02 44.84,100 Acc ( )c 92.86,94.84 71.03,92.66 MCCc 0.87,0.90 0.49,0.Predictor iSMP-GreyaP252! : ?52{Int?52=5?Int?52=5? ?9?PSEApredba252! ?52{Int?52=5?Int?52=5? !2 252! w9:25|10128 ?52{50?50!b cSee footnote a of Table 1. From ref. [2]. See the discussion in the text and Eq.19 for why the results obtained by the 5fold cross-validation test were not unique. doi:10.1371/journal.pone.0049040.twhere the symbol Int is the integer-truncating operator meaning to take the integer part for the number in the bracket right after it. The result of Eq.19 indicates that the number of possible Title Loaded From File combinations of taking one-fifth proteins from each of the two subsets, z and { , for conducting the 5-fold cross-validation will be greater than 9:25|10128 , which is an astronomical figure, too large to be practically feasible. Actually, in their study [2], Verma et al. only randomly picked 100 different combinations from the possible 9:25|10128 combinations (cf. Eq.19) to perform the 5fold cross-validation, yielding 100 different results located within a certain region. Title Loaded From File Therefore, in their report, rather than a single figure but a figures region was used to show their test result. For example, according to their report (Table 2), Acc 71:03*92:66 , meaning that the lowest one of the 100 overall success rates obtained by the PSEApred predictor [2] was 71.03 , while the highest one was 92.66 . To make the comparison of iSMP-Grey with PSEApred [2] under the same condition with the same test method, we also randomly picked 100 different combinations as done by Verma et al. [2] to perform the 5-fold cross-validation test with iSMP-Grey, and the corresponding results thus obtained are given in Table 2 as well. As we can see from the table, not only the average rates obtained by the iSMP-Grey predictor are remarkably higher than those by the PSEApred predictor [2], but the corresponding region widths by the former are also significantly narrower than those by 22948146 the latter, indicating the success rates by the iSMP-Grey are not only higher but also more stable than those by the PSEApred predictor [2].All the above results have indicated that the novel pseudo amino acid composition formulated via the grey system model GM(2,1) can more effectively incorporate the protein sequence evolution information so as to remarkably enhance the success rates of the iSMP-Grey predictor in identifying the secretory proteins of malaria parasite. It is anticipated that iSMP-Grey may become a useful high throughput tool for both basic research and drug development in the relevant areas.Supporting InformationSupporting Information S1 The benchmark datasetBenchincludes 504 proteins, classified into 252 secretory proteins of malaria parasite and 252 non-secretory proteins. (PDF)AcknowledgmentsThe authors wish to thank the two anonymous Reviewers, whose constructive comments were very helpful for strengthening the presentation of this paper.Author ContributionsConceived and designed the experiments: WZL XX. Performed the experiments: WZL JAF. Analyzed the data: WZL XX KCC. Contributed reagents/materials/analysis tools: XX. Wrote the paper: WZL KCC.
The colon provides the most favorable conditions for intestinal microbiota and harbors, with approximately 1012 microorganisms per gram of intestinal content, the most densely populated and complex community of the human gastrointestinal tract [1,2]. During evolution a complex and intensive mutu.N test.Sn ( )c 90.48,92.46 73.41,97.22 Sp ( )c 94.05,98.02 44.84,100 Acc ( )c 92.86,94.84 71.03,92.66 MCCc 0.87,0.90 0.49,0.Predictor iSMP-GreyaP252! : ?52{Int?52=5?Int?52=5? ?9?PSEApredba252! ?52{Int?52=5?Int?52=5? !2 252! w9:25|10128 ?52{50?50!b cSee footnote a of Table 1. From ref. [2]. See the discussion in the text and Eq.19 for why the results obtained by the 5fold cross-validation test were not unique. doi:10.1371/journal.pone.0049040.twhere the symbol Int is the integer-truncating operator meaning to take the integer part for the number in the bracket right after it. The result of Eq.19 indicates that the number of possible combinations of taking one-fifth proteins from each of the two subsets, z and { , for conducting the 5-fold cross-validation will be greater than 9:25|10128 , which is an astronomical figure, too large to be practically feasible. Actually, in their study [2], Verma et al. only randomly picked 100 different combinations from the possible 9:25|10128 combinations (cf. Eq.19) to perform the 5fold cross-validation, yielding 100 different results located within a certain region. Therefore, in their report, rather than a single figure but a figures region was used to show their test result. For example, according to their report (Table 2), Acc 71:03*92:66 , meaning that the lowest one of the 100 overall success rates obtained by the PSEApred predictor [2] was 71.03 , while the highest one was 92.66 . To make the comparison of iSMP-Grey with PSEApred [2] under the same condition with the same test method, we also randomly picked 100 different combinations as done by Verma et al. [2] to perform the 5-fold cross-validation test with iSMP-Grey, and the corresponding results thus obtained are given in Table 2 as well. As we can see from the table, not only the average rates obtained by the iSMP-Grey predictor are remarkably higher than those by the PSEApred predictor [2], but the corresponding region widths by the former are also significantly narrower than those by 22948146 the latter, indicating the success rates by the iSMP-Grey are not only higher but also more stable than those by the PSEApred predictor [2].All the above results have indicated that the novel pseudo amino acid composition formulated via the grey system model GM(2,1) can more effectively incorporate the protein sequence evolution information so as to remarkably enhance the success rates of the iSMP-Grey predictor in identifying the secretory proteins of malaria parasite. It is anticipated that iSMP-Grey may become a useful high throughput tool for both basic research and drug development in the relevant areas.Supporting InformationSupporting Information S1 The benchmark datasetBenchincludes 504 proteins, classified into 252 secretory proteins of malaria parasite and 252 non-secretory proteins. (PDF)AcknowledgmentsThe authors wish to thank the two anonymous Reviewers, whose constructive comments were very helpful for strengthening the presentation of this paper.Author ContributionsConceived and designed the experiments: WZL XX. Performed the experiments: WZL JAF. Analyzed the data: WZL XX KCC. Contributed reagents/materials/analysis tools: XX. Wrote the paper: WZL KCC.
The colon provides the most favorable conditions for intestinal microbiota and harbors, with approximately 1012 microorganisms per gram of intestinal content, the most densely populated and complex community of the human gastrointestinal tract [1,2]. During evolution a complex and intensive mutu.