Authors: ShaoWu Zhang YunLong Zhang HuiFang Yang ChunHui Zhao Quan Pan
Publish Date: 2007/12/11
Volume: 34, Issue: 4, Pages: 565-572
Abstract
The rapidly increasing number of sequence entering into the genome databank has called for the need for developing automated methods to analyze them Information on the subcellular localization of new found protein sequences is important for helping to reveal their functions in time and conducting the study of system biology at the cellular level Based on the concept of Chou’s pseudoamino acid composition a series of useful information and techniques such as residue conservation scores von Neumann entropies multiscale energy and weighted autocorrelation function were utilized to generate the pseudoamino acid components for representing the protein samples Based on such an infrastructure a hybridization predictor was developed for identifying uncharacterized proteins among the following 12 subcellular localizations chloroplast cytoplasm cytoskeleton endoplasmic reticulum extracell Golgi apparatus lysosome mitochondria nucleus peroxisome plasma membrane and vacuole Compared with the results reported by the previous investigators higher success rates were obtained suggesting that the current approach is quite promising and may become a useful highthroughput tool in the relevant areasThis paper was supported in part by the National Natural Science Foundation of China No 60775012 and 60634030 and the Technological Innovation Foundation of Northwestern Polytechnical University No KC02 and the Science Technology Research and Development Program of Shaanxi No 2006k04G14
Keywords: