期刊文献+

基于分形布朗运动和Ada Boosting的多类音频例子识别 被引量:8

Recognition of Multiple Audio Clip Classes Based on FBM and Ada Boosting
下载PDF
导出
摘要 提出了一种基于分形布朗运动的音频特征提取和识别方法 这种方法使用分形布朗运动模型计算出音频例子的分形维数 ,并作为其分形特征 针对音频分形特征符合高斯分布的特点 ,使用AdaBoosting算法进行特征约减 然后分别使用Ada 加权高斯分类器和支持向量机对约减特征后的音频分类 ,并在两类分类的基础上构造多类分类的模型 实验表明 。 A novel method for audio feature extraction and recognition is presented In this method, FBM (fractional brownian motion) based fractal dimension is defined as audio fractal feature According to Gaussian distribution characteristic of audio fractal feature, Ada boosting algorithm is used for feature reduction Then two classifiers, weighted Ada Gaussian classifier and support vector machine, are implemented respectively for audio classification Based on these two classifiers, a multiple classifier model is finally constructed Experimental data shows that audio fractal feature achieves better performance than other audio features for music and speech classification
出处 《计算机研究与发展》 EI CSCD 北大核心 2003年第7期941-949,共9页 Journal of Computer Research and Development
基金 国家自然科学基金项目 ( 60 2 72 0 3 1) 浙江省自然科学基金重点项目 (ZD0 2 12 ) 浙江省科技计划重点科研项目 ( 2 0 0 3C2 10 10 )
关键词 分形布朗运动 音频分形维数 音频分形特征 特征约减 FBM (fractional Brownian motion) audio fractal dimension audio fractal feature
  • 相关文献

参考文献16

  • 1吴飞,庄越挺,张引,潘云鹤.基于隐马尔可夫链的音频语义检索[J].模式识别与人工智能,2001,14(1):104-108. 被引量:10
  • 2庄越挺,毛祎,吴飞,潘云鹤.基于隐马尔可夫链的广播新闻分割分类[J].计算机研究与发展,2002,39(9):1057-1063. 被引量:7
  • 3庄越挺,刘骏伟,吴飞,潘云鹤,张引.基于支持向量机的视频字幕自动定位与提取[J].计算机辅助设计与图形学学报,2002,14(8):750-753. 被引量:38
  • 4J T Foote. An overview of audio information retrieval .Multimedia Systems, 1999, 7(1): 2--11. 被引量:1
  • 5John Saunders. Real time discrimination of broadcast speech/music. IEEE Int'l Cord on Acoustic, Speech, and Signal Processing (ICASSP-96), Atla, 1996. 被引量:1
  • 6Eric Scheirer, M Slaney. Construction and evaluation of a robust multifeature music/speech discriminator. Int' 1 Cord on Acoustic,Speech, and Signal Processing ( ICASSP' 97 ), Munich,Germany, 1997. 被引量:1
  • 7J T Foote. A similarity measure for automatic audio classification.AAAI 1997 Spring Symposium on Intelligent Integration and Use of Text, Image, Video, and Audio Corpora, Stanford, 1997. 被引量:1
  • 8B B Mandlebrot. The Fractal Geometry. of Nature. New York: W H Freeman & Co, 1982. 被引量:1
  • 9R F Voss, J Clarke. 1/f noise in music and speech. Nature,1975, 258:317--318. 被引量:1
  • 10R F Moss, J Clark. 1/f noise in music: Music from 1/f noise.Journal of the Acoustical Sodety of America, 1978, 63 (1) : 258--263. 被引量:1

二级参考文献25

  • 1[1]Y Wang, Z Liu, J Huang. Multimedia content analysis using audio and visual information[J]. IEEE Signal Processing Magazine, 2000, 17(6):12~36 被引量:1
  • 2[2]R Lienhart, F Stuber. Automatic text recognition in digital videos[A]. In: Proceedings of ACM Multimedia, Boston, 1996.11~20 被引量:1
  • 3[3]Zhong Yu, Zhang Hongjiang, Jain Anil K. Automatic caption localization in compressed video[J]. Pattern Analysis and Machine Intelligence, 2000, 22(4):385~392 被引量:1
  • 4[4]V Vapnik. The Nature of Statistical Learning Theory[M]. New York: Springer, 1995 被引量:1
  • 5[5]M Schmidt. Identifying speaker with support vector networks[A]. In: Proceedings of Interface'96, Sydney, 1996 被引量:1
  • 6[6]T Joachims. Text categorization with support vector machines: Learning with many relevant features[A]. In: Proceedings of the 10th European Conference on Machine Learning, Chemnitz, Germany, 1998.137~142 被引量:1
  • 7[7]Yuan Qi. Learning algorithms for video and audio processing: Independent component analysis and support vector machine based approaches[R].College Park: University of Maryland at College Park, LAMP-TR-056(CAR-TR-951), 2000 被引量:1
  • 8[8]Edgar Osuna, Robert Freund, Federico Girosi. Training support vector machines: An application to face detection[A]. In: Proceedings of Computer Vision and Pattern Recognition, Puerto Rico, 1997.130~136 被引量:1
  • 9[9]C J C Burges. A tutorial on support vector machines for pattern recognition[J]. Data Mining, and Knowledge Discovery, 1998, 2(2):121~167 被引量:1
  • 10[10]T M Cover. Geometrical and statistical properties of systems and linear inequalities with applications in pattern recognition[J]. IEEE Transactions on Electronic Computers, 1965, 14(3):326~334 被引量:1

共引文献51

同被引文献37

引证文献8

二级引证文献76

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部