摘要
大数据表现出数据规模庞大、数据类型多样化的特征,所以在实际开展大数据分析工作时,对数据处理速度以及数据处理实效性都有着较为苛刻的要求。数据挖掘技术是依托于特定的建模算法,从大规模数据中挖掘出其中的隐藏信息,充分发挥大数据的应用价值。Spark平台是一个面向海量数据集合的高效率集群分布式计算系统,依托于该平台开展大数据挖掘有助于获得更好的效果,本文对此开展了简单的探讨。
Big data shows the characteristics of large data scale and diversified data types.Therefore,when actually carrying out big data analysis,there are strict requirements for data processing speed and effectiveness.Data mining technology relies on specific modeling algorithms to mine the hidden information from largescale data and give full play to the application value of big data.Spark platform is an efficient cluster distributed computing system for massive data sets.Relying on this platform to carry out big data mining will help to obtain better results.This paper makes a simple discussion on this.
作者
曹海平
CAO Haiping(Hubei Land Resources Vocational College,Wuhan Hubei 430090)
出处
《软件》
2022年第7期84-86,共3页
Software
关键词
Spark平台
大数据
据挖掘技术
Spark platform
big data
according to mining technology