摘要
The literature generally agrees that longer-horizon(over a month) predictions make more sense than short-horizon ones. However, it's an especially challenging task due to the lack of data(in unit of long horizon)and economic data have a low S/N ratio. We hypothesize that the stock trend is largely dictated by driving factors which are filtered by psychological factors and work on behavioral factors: representative indicators from these three aspects would be adequate in trend prediction. We then extend the Stepwise Regression Analysis(SRA)algorithm to constrained SRA(c SRA) to carry out a further feature selection and lag optimization. During modeling stage, we introduce the Deep Neural Network(DNN) model in stock prediction under the suspicion that economic interactions are too complex for shallow networks to capture. Our experiments indeed show that deep structures generally perform better than shallow ones. Instead of comparing to a kitchen sink model, where over-fitting can easily happen with a shortage of data, we turn around and use a model ensemble approach which indirectly demonstrates our proposed method is adequate.
The literature generally agrees that longer-horizon(over a month) predictions make more sense than short-horizon ones. However, it's an especially challenging task due to the lack of data(in unit of long horizon)and economic data have a low S/N ratio. We hypothesize that the stock trend is largely dictated by driving factors which are filtered by psychological factors and work on behavioral factors: representative indicators from these three aspects would be adequate in trend prediction. We then extend the Stepwise Regression Analysis(SRA)algorithm to constrained SRA(c SRA) to carry out a further feature selection and lag optimization. During modeling stage, we introduce the Deep Neural Network(DNN) model in stock prediction under the suspicion that economic interactions are too complex for shallow networks to capture. Our experiments indeed show that deep structures generally perform better than shallow ones. Instead of comparing to a kitchen sink model, where over-fitting can easily happen with a shortage of data, we turn around and use a model ensemble approach which indirectly demonstrates our proposed method is adequate.
作者
FANG Yan
方艳(School of Statistics and Management, Shanghai University of Finance and Economics;School of Finance, Shanghai University of International Business and Economics)
基金
the National Natural Science Foundation of China(Nos.11501355 and 71571116)
the Project of Knowledge Innovation Program of Shanghai Municipal Education Commission(No.15ZZ090)
the 59th China Postdoctoral Sciences Foundation Funded Project(No.2016M591640)
the Humanities and Social Sciences Research Project of Ministry of Education(No.15YJA790039)
the National Social Science Foundation of China(No.15ZDA058)