One recent area of interest in computer science is data stream management and processing. By ‘data stream', we refer to continuous and rapidly generated packages of data. Specific features of data streams are imm...One recent area of interest in computer science is data stream management and processing. By ‘data stream', we refer to continuous and rapidly generated packages of data. Specific features of data streams are immense volume, high production rate, limited data processing time, and data concept drift; these features differentiate the data stream from standard types of data. An issue for the data stream is classification of input data. A novel ensemble classifier is proposed in this paper. The classifier uses base classifiers of two weighting functions under different data input conditions. In addition, a new method is used to determine drift, which emphasizes the precision of the algorithm. Another characteristic of the proposed method is removal of different numbers of the base classifiers based on their quality. Implementation of a weighting mechanism to the base classifiers at the decision-making stage is another advantage of the algorithm. This facilitates adaptability when drifts take place, which leads to classifiers with higher efficiency. Furthermore, the proposed method is tested on a set of standard data and the results confirm higher accuracy compared to available ensemble classifiers and single classifiers. In addition, in some cases the proposed classifier is faster and needs less storage space.展开更多
Membrane proteins are embedded in the lipid bilayer,which creates a suitable environment for their actions. It is important to decide which tpye it belongs to because it is closely relevant to its biological function ...Membrane proteins are embedded in the lipid bilayer,which creates a suitable environment for their actions. It is important to decide which tpye it belongs to because it is closely relevant to its biological function and its interaction process with other molecules in a biological system. Membrane proteins have different types. The function of a membrane protein is closely correlated with the type it belongs to. In this study,on the basis of the concept of pseudo amino acid (PseAA) composition originally introduced by Chou,the value of approximate entropy (ApEn) of the query membrane protein was used to integrate the complementary information. By fusing fifteen powerful individual fuzzy K-nearest neighbor ( FKNN) classifiers,an ensemble classifier was presented. Each basic classifier was trained in PseAA composition of membrane protein sequences with different parameters. The results of experiments demonstrate it is efficient for the structural prediction of membrane proteins.展开更多
文摘One recent area of interest in computer science is data stream management and processing. By ‘data stream', we refer to continuous and rapidly generated packages of data. Specific features of data streams are immense volume, high production rate, limited data processing time, and data concept drift; these features differentiate the data stream from standard types of data. An issue for the data stream is classification of input data. A novel ensemble classifier is proposed in this paper. The classifier uses base classifiers of two weighting functions under different data input conditions. In addition, a new method is used to determine drift, which emphasizes the precision of the algorithm. Another characteristic of the proposed method is removal of different numbers of the base classifiers based on their quality. Implementation of a weighting mechanism to the base classifiers at the decision-making stage is another advantage of the algorithm. This facilitates adaptability when drifts take place, which leads to classifiers with higher efficiency. Furthermore, the proposed method is tested on a set of standard data and the results confirm higher accuracy compared to available ensemble classifiers and single classifiers. In addition, in some cases the proposed classifier is faster and needs less storage space.
基金National Nature Science Foundations of China (No.60975059, No.60775052)Specialized Research Fund for the Doctoral Program of Higher Education from Ministry of Education of China ( No.20090075110002)Projects of the Shanghai Committee of Science and Technology (No.09JC1400900, No.08JC1400100, No.10DZ0506500)
文摘Membrane proteins are embedded in the lipid bilayer,which creates a suitable environment for their actions. It is important to decide which tpye it belongs to because it is closely relevant to its biological function and its interaction process with other molecules in a biological system. Membrane proteins have different types. The function of a membrane protein is closely correlated with the type it belongs to. In this study,on the basis of the concept of pseudo amino acid (PseAA) composition originally introduced by Chou,the value of approximate entropy (ApEn) of the query membrane protein was used to integrate the complementary information. By fusing fifteen powerful individual fuzzy K-nearest neighbor ( FKNN) classifiers,an ensemble classifier was presented. Each basic classifier was trained in PseAA composition of membrane protein sequences with different parameters. The results of experiments demonstrate it is efficient for the structural prediction of membrane proteins.