A Bayes discriminant analysis method to identify the risky of complicated goaf in mines was presented. Nine factors influencing the stability of goaf risky, including uniaxial compressive strength of rock, elastic mod...A Bayes discriminant analysis method to identify the risky of complicated goaf in mines was presented. Nine factors influencing the stability of goaf risky, including uniaxial compressive strength of rock, elastic modulus of rock, rock quality designation (RQD), area ratio of pillar, ratio of width to height of pillar, depth of ore body, volume of goaf, dip of ore body and area of goal, were selected as discriminant indexes in the stability analysis of goal. The actual data of 40 goals were used as training samples to establish a discriminant analysis model to identify the stability of goaf. The results show that this discriminant analysis model has high precision and misdiscriminant ratio is 0.025 in re-substitution process. The instability identification of a metal mine was distinguished by using this model and the identification result is identical with that of practical situation.展开更多
The sharp increase of the amount of Internet Chinese text data has significantly prolonged the processing time of classification on these data.In order to solve this problem,this paper proposes and implements a parall...The sharp increase of the amount of Internet Chinese text data has significantly prolonged the processing time of classification on these data.In order to solve this problem,this paper proposes and implements a parallel naive Bayes algorithm(PNBA)for Chinese text classification based on Spark,a parallel memory computing platform for big data.This algorithm has implemented parallel operation throughout the entire training and prediction process of naive Bayes classifier mainly by adopting the programming model of resilient distributed datasets(RDD).For comparison,a PNBA based on Hadoop is also implemented.The test results show that in the same computing environment and for the same text sets,the Spark PNBA is obviously superior to the Hadoop PNBA in terms of key indicators such as speedup ratio and scalability.Therefore,Spark-based parallel algorithms can better meet the requirement of large-scale Chinese text data mining.展开更多
基金Project (2010CB732004) supported by the National Basic Research Program of China
文摘A Bayes discriminant analysis method to identify the risky of complicated goaf in mines was presented. Nine factors influencing the stability of goaf risky, including uniaxial compressive strength of rock, elastic modulus of rock, rock quality designation (RQD), area ratio of pillar, ratio of width to height of pillar, depth of ore body, volume of goaf, dip of ore body and area of goal, were selected as discriminant indexes in the stability analysis of goal. The actual data of 40 goals were used as training samples to establish a discriminant analysis model to identify the stability of goaf. The results show that this discriminant analysis model has high precision and misdiscriminant ratio is 0.025 in re-substitution process. The instability identification of a metal mine was distinguished by using this model and the identification result is identical with that of practical situation.
基金Project(KC18071)supported by the Application Foundation Research Program of Xuzhou,ChinaProjects(2017YFC0804401,2017YFC0804409)supported by the National Key R&D Program of China
文摘The sharp increase of the amount of Internet Chinese text data has significantly prolonged the processing time of classification on these data.In order to solve this problem,this paper proposes and implements a parallel naive Bayes algorithm(PNBA)for Chinese text classification based on Spark,a parallel memory computing platform for big data.This algorithm has implemented parallel operation throughout the entire training and prediction process of naive Bayes classifier mainly by adopting the programming model of resilient distributed datasets(RDD).For comparison,a PNBA based on Hadoop is also implemented.The test results show that in the same computing environment and for the same text sets,the Spark PNBA is obviously superior to the Hadoop PNBA in terms of key indicators such as speedup ratio and scalability.Therefore,Spark-based parallel algorithms can better meet the requirement of large-scale Chinese text data mining.