期刊文献+

Study on the Development and Implementation of Different Big Data Clustering Methods

Study on the Development and Implementation of Different Big Data Clustering Methods
下载PDF
导出
摘要 Clustering is an unsupervised learning method used to organize raw data in such a way that those with the same (similar) characteristics are found in the same class and those that are dissimilar are found in different classes. In this day and age, the very rapid increase in the amount of data being produced brings new challenges in the analysis and storage of this data. Recently, there is a growing interest in key areas such as real-time data mining, which reveal an urgent need to process very large data under strict performance constraints. The objective of this paper is to survey four algorithms including K-Means algorithm, FCM algorithm, EM algorithm and BIRCH, used for data clustering and then show their strengths and weaknesses. Another task is to compare the results obtained by applying each of these algorithms to the same data and to give a conclusion based on these results. Clustering is an unsupervised learning method used to organize raw data in such a way that those with the same (similar) characteristics are found in the same class and those that are dissimilar are found in different classes. In this day and age, the very rapid increase in the amount of data being produced brings new challenges in the analysis and storage of this data. Recently, there is a growing interest in key areas such as real-time data mining, which reveal an urgent need to process very large data under strict performance constraints. The objective of this paper is to survey four algorithms including K-Means algorithm, FCM algorithm, EM algorithm and BIRCH, used for data clustering and then show their strengths and weaknesses. Another task is to compare the results obtained by applying each of these algorithms to the same data and to give a conclusion based on these results.
作者 Jean Pierre Ntayagabiri Jérémie Ndikumagenge Longin Ndayisaba Boribo Kikunda Philippe Jean Pierre Ntayagabiri;Jérémie Ndikumagenge;Longin Ndayisaba;Boribo Kikunda Philippe(Center of Research in Infrastructure, Environment and Technology (CRIET), University of Burundi, Bujumbura, Burundi;Catholic University of Bukavu, Bukavu, Democratic Republic of the Congo)
出处 《Open Journal of Applied Sciences》 2023年第7期1163-1177,共15页 应用科学(英文)
关键词 CLUSTERING K-MEANS Fuzzy c-Means Expectation Maximization BIRCH Clustering K-Means Fuzzy c-Means Expectation Maximization BIRCH
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部