An N-gram Chinese language model incorporating linguistic rules is presented. By constructing elements lattice, rules information is incorporated in statistical frame. To facilitate the hybrid modeling, novel methods ...An N-gram Chinese language model incorporating linguistic rules is presented. By constructing elements lattice, rules information is incorporated in statistical frame. To facilitate the hybrid modeling, novel methods such as MI-based rule evaluating, weighted rule quantification and element-based n-gram probability approximation are presented. Dynamic Viterbi algorithm is adopted to search the best path in lattice. To strengthen the model, transformation-based error-driven rules learning is adopted. Applying proposed model to Chinese Pinyin-to-character conversion, high performance has been achieved in accuracy, flexibility and robustness simultaneously. Tests show correct rate achieves 94.81% instead of 90.53% using bi-gram Markov model alone. Many long-distance dependency and recursion in language can be processed effectively.展开更多
针对尾矿坝在线监测重建设、轻利用的现状,基于尾矿坝位移在线监测时间序列,通过多步逆向云变换算法(Multi-step Backward Cloud Transformation Algorithm Based on Sampling with Replacement,MBCT-SR)改进云模型,根据“3E_(n)原则”...针对尾矿坝在线监测重建设、轻利用的现状,基于尾矿坝位移在线监测时间序列,通过多步逆向云变换算法(Multi-step Backward Cloud Transformation Algorithm Based on Sampling with Replacement,MBCT-SR)改进云模型,根据“3E_(n)原则”和内外包络曲线确定在线监测位移的正常运行值,从而建立尾矿坝位移分级预警阈值模型,并利用某尾矿坝全球导航卫星(Global Navigation Satellite System,GNSS)技术表面位移在线监测数据进行实例验证。结果表明:该尾矿坝水平方向位移的黄、橙、红预警阈值分别为8.41 mm/d、12.94 mm/d、19.41 mm/d,呈现出坝体中间预警阈值最大、并由中间向两侧减小的空间变化规律;尾矿坝垂直方向位移的黄、橙、红预警阈值分别为16.56 mm/d、25.48 mm/d、38.22 mm/d,且随着子坝的堆积,预警阈值逐渐增大。展开更多
文摘An N-gram Chinese language model incorporating linguistic rules is presented. By constructing elements lattice, rules information is incorporated in statistical frame. To facilitate the hybrid modeling, novel methods such as MI-based rule evaluating, weighted rule quantification and element-based n-gram probability approximation are presented. Dynamic Viterbi algorithm is adopted to search the best path in lattice. To strengthen the model, transformation-based error-driven rules learning is adopted. Applying proposed model to Chinese Pinyin-to-character conversion, high performance has been achieved in accuracy, flexibility and robustness simultaneously. Tests show correct rate achieves 94.81% instead of 90.53% using bi-gram Markov model alone. Many long-distance dependency and recursion in language can be processed effectively.