期刊文献+

Regression Analysis of the Number of Association Rules 被引量:1

Regression Analysis of the Number of Association Rules
下载PDF
导出
摘要 The typical model, which involves the measures: support, confidence, and interest, is often adapted to mining association rules. In the model, the related parameters are usually chosen by experience; consequently, the number of useful rules is hard to estimate. If the number is too large, we cannot effectively extract the meaningful rules. This paper analyzes the meanings of the parameters and designs a variety of equations between the number of rules and the parameters by using regression method. Finally, we experimentally obtain a preferable regression equation. This paper uses multiple correlation coeficients to test the fitting efiects of the equations and uses significance test to verify whether the coeficients of parameters are significantly zero or not. The regression equation that has a larger multiple correlation coeficient will be chosen as the optimally fitted equation. With the selected optimal equation, we can predict the number of rules under the given parameters and further optimize the choice of the three parameters and determine their ranges of values. The typical model, which involves the measures: support, confidence, and interest, is often adapted to mining association rules. In the model, the related parameters are usually chosen by experience; consequently, the number of useful rules is hard to estimate. If the number is too large, we cannot effectively extract the meaningful rules. This paper analyzes the meanings of the parameters and designs a variety of equations between the number of rules and the parameters by using regression method. Finally, we experimentally obtain a preferable regression equation. This paper uses multiple correlation coeficients to test the fitting efiects of the equations and uses significance test to verify whether the coeficients of parameters are significantly zero or not. The regression equation that has a larger multiple correlation coeficient will be chosen as the optimally fitted equation. With the selected optimal equation, we can predict the number of rules under the given parameters and further optimize the choice of the three parameters and determine their ranges of values.
出处 《International Journal of Automation and computing》 EI 2011年第1期78-82,共5页 国际自动化与计算杂志(英文版)
基金 supported by the National Natural Science Foundation of China (No. J07240003, No. 60773084, No. 60603023) National Research Fund for the Doctoral Program of Higher Education of China (No. 20070151009)
关键词 Association rules regression analysis multiple correlation coeficients INTEREST SUPPORT confidence. Association rules regression analysis multiple correlation coeficients interest support confidence.
  • 相关文献

参考文献4

二级参考文献36

  • 1贺志,黄厚宽,田盛丰.一种优化相关规则的发现方法[J].计算机学报,2006,29(6):906-913. 被引量:12
  • 2[5]E R A Omiecinski.Lternative interest measures for mining associations in databases.IEEE Trans on Knowedge and Data Engineering,2003,15(1):57-69 被引量:1
  • 3[6]http://www.ics.uci.edu/~mlearn/MLSummary.html,2006 被引量:1
  • 4Liu B,Hsu W, Ma Y.Mining Association Rules with Multiple Minimum Supports.KDD-99,1999. 被引量:1
  • 5Nag B,Deshpande P M,Dewitt D J.Using a Knowledge Cache for Interactive Discovery of Association Rules.KDD-99,1999:244-253. 被引量:1
  • 6Han J,Fu Y.Discovery of Multiple-level from Large Databases.Zurich,Switzerland:In Proc. 21th VLDB Conf., 1995-09:420-431. 被引量:1
  • 7Carlin B.P.,Louis T.A..Bayes and Empirical Bayes Methods for Data Analysis.2nd Edition.London U.K.:Chapman &Hall,2000 被引量:1
  • 8Light R.J.,Margolin B.H..An analysis of variance for categorical data.Journal of the American Statistical Association,1971,66:534~544 被引量:1
  • 9Steven R.,Matthew S.etal.Integrated Public Use Microdata Series:Version 2.0.Historical Census Projects,University of Minnesota,Minneapolis,1997 被引量:1
  • 10Au W.-H.,Chan K.C.C.,Yao X..A novel evolutionary data mining algorithm with applications to churn prediction.IEEE Transactions on Evolutionary Computation,2003,7 (6):532~545 被引量:1

共引文献23

同被引文献5

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部