Attacks such as APT usually hide communication data in massive legitimate network traffic, and mining structurally complex and latent relationships among flow-based network traffic to detect attacks has become the foc...Attacks such as APT usually hide communication data in massive legitimate network traffic, and mining structurally complex and latent relationships among flow-based network traffic to detect attacks has become the focus of many initiatives. Effectively analyzing massive network security data with high dimensions for suspicious flow diagnosis is a huge challenge. In addition, the uneven distribution of network traffic does not fully reflect the differences of class sample features, resulting in the low accuracy of attack detection. To solve these problems, a novel approach called the fuzzy entropy weighted natural nearest neighbor(FEW-NNN) method is proposed to enhance the accuracy and efficiency of flowbased network traffic attack detection. First, the FEW-NNN method uses the Fisher score and deep graph feature learning algorithm to remove unimportant features and reduce the data dimension. Then, according to the proposed natural nearest neighbor searching algorithm(NNN_Searching), the density of data points, each class center and the smallest enclosing sphere radius are determined correspondingly. Finally, a fuzzy entropy weighted KNN classification method based on affinity is proposed, which mainly includes the following three steps: 1、 the feature weights of samples are calculated based on fuzzy entropy values, 2、 the fuzzy memberships of samples are determined based on affinity among samples, and 3、 K-neighbors are selected according to the class-conditional weighted Euclidean distance, the fuzzy membership value of the testing sample is calculated based on the membership of k-neighbors, and then all testing samples are classified according to the fuzzy membership value of the samples belonging to each class;that is, the attack type is determined. The method has been applied to the problem of attack detection and validated based on the famous KDD99 and CICIDS-2017 datasets. From the experimental results shown in this paper, it is observed that the FEW-NNN method improves the accuracy and efficiency of f展开更多
Collaborative filtering (CF) is one of the most popular techniques behind the success of recommendation system. It predicts the interest of users by collecting information from past users who have the same opinions....Collaborative filtering (CF) is one of the most popular techniques behind the success of recommendation system. It predicts the interest of users by collecting information from past users who have the same opinions. The most popular approaches used in CF research area are Matrix factorization methods such as SVD. However, many well- known recommendation systems do not use this method but still stick with Neighborhood models because of simplicity and explainability. There are some concerns that limit neighborhood models to achieve higher prediction accuracy. To address these concerns, we propose a new exponential fuzzy clustering (XFCM) algorithm by reformulating the clustering's objective function with an exponential equation in order to improve the method for membership assignment. The proposed method assigns data to the clusters by aggressively excluding irrelevant data, which is better than other fuzzy C-means (FCM) variants. The experiments show that XFCM-based CF improved 6.9% over item-based method and 3.0% over SVD in terms of mean absolute error for 100 K and 1 M MovieLens dataset.展开更多
基金the Natural Science Foundation of China (No. 61802404, 61602470)the Strategic Priority Research Program (C) of the Chinese Academy of Sciences (No. XDC02040100)+3 种基金the Fundamental Research Funds for the Central Universities of the China University of Labor Relations (No. 20ZYJS017, 20XYJS003)the Key Research Program of the Beijing Municipal Science & Technology Commission (No. D181100000618003)partially the Key Laboratory of Network Assessment Technology,the Chinese Academy of Sciencesthe Beijing Key Laboratory of Network Security and Protection Technology
文摘Attacks such as APT usually hide communication data in massive legitimate network traffic, and mining structurally complex and latent relationships among flow-based network traffic to detect attacks has become the focus of many initiatives. Effectively analyzing massive network security data with high dimensions for suspicious flow diagnosis is a huge challenge. In addition, the uneven distribution of network traffic does not fully reflect the differences of class sample features, resulting in the low accuracy of attack detection. To solve these problems, a novel approach called the fuzzy entropy weighted natural nearest neighbor(FEW-NNN) method is proposed to enhance the accuracy and efficiency of flowbased network traffic attack detection. First, the FEW-NNN method uses the Fisher score and deep graph feature learning algorithm to remove unimportant features and reduce the data dimension. Then, according to the proposed natural nearest neighbor searching algorithm(NNN_Searching), the density of data points, each class center and the smallest enclosing sphere radius are determined correspondingly. Finally, a fuzzy entropy weighted KNN classification method based on affinity is proposed, which mainly includes the following three steps: 1、 the feature weights of samples are calculated based on fuzzy entropy values, 2、 the fuzzy memberships of samples are determined based on affinity among samples, and 3、 K-neighbors are selected according to the class-conditional weighted Euclidean distance, the fuzzy membership value of the testing sample is calculated based on the membership of k-neighbors, and then all testing samples are classified according to the fuzzy membership value of the samples belonging to each class;that is, the attack type is determined. The method has been applied to the problem of attack detection and validated based on the famous KDD99 and CICIDS-2017 datasets. From the experimental results shown in this paper, it is observed that the FEW-NNN method improves the accuracy and efficiency of f
文摘Collaborative filtering (CF) is one of the most popular techniques behind the success of recommendation system. It predicts the interest of users by collecting information from past users who have the same opinions. The most popular approaches used in CF research area are Matrix factorization methods such as SVD. However, many well- known recommendation systems do not use this method but still stick with Neighborhood models because of simplicity and explainability. There are some concerns that limit neighborhood models to achieve higher prediction accuracy. To address these concerns, we propose a new exponential fuzzy clustering (XFCM) algorithm by reformulating the clustering's objective function with an exponential equation in order to improve the method for membership assignment. The proposed method assigns data to the clusters by aggressively excluding irrelevant data, which is better than other fuzzy C-means (FCM) variants. The experiments show that XFCM-based CF improved 6.9% over item-based method and 3.0% over SVD in terms of mean absolute error for 100 K and 1 M MovieLens dataset.