At present, although the human speech separation has achieved fruitful results, it is not ideal for the separation of singing and accompaniment. Based on low-rank and sparse optimization theory, in this paper, we prop...At present, although the human speech separation has achieved fruitful results, it is not ideal for the separation of singing and accompaniment. Based on low-rank and sparse optimization theory, in this paper, we propose a new singing voice separation algorithm called Low-rank, Sparse Representation with pre-learned dictionaries and side Information (LSRi). The algorithm incorporates both the vocal and instrumental spectrograms as sparse matrix and low-rank matrix, meanwhile combines pre-learning dictionary and the reconstructed voice spectrogram form the annotation. Evaluations on the iKala dataset show that the proposed methods are effective and efficient for singing voice separation.展开更多
针对视频处理中运动目标的精确检测这一问题,提出了一种自适应的低秩稀疏分解算法。该算法首先用背景模型与待求解的帧向量构建增广矩阵,然后使用鲁棒的主成分分析(robust principal component analysis,RPCA)对降维后的增广矩阵进行低...针对视频处理中运动目标的精确检测这一问题,提出了一种自适应的低秩稀疏分解算法。该算法首先用背景模型与待求解的帧向量构建增广矩阵,然后使用鲁棒的主成分分析(robust principal component analysis,RPCA)对降维后的增广矩阵进行低秩稀疏分解,分离出的低秩部分和稀疏噪声分别对应于视频帧的背景和运动前景,然后使用增量奇异值分解方法用当前得到的背景向量更新背景模型。实验结果表明,该算法能更好地处理光线变化、背景运动等复杂场景,并有效降低算法的延迟和内存的占用。展开更多
文摘At present, although the human speech separation has achieved fruitful results, it is not ideal for the separation of singing and accompaniment. Based on low-rank and sparse optimization theory, in this paper, we propose a new singing voice separation algorithm called Low-rank, Sparse Representation with pre-learned dictionaries and side Information (LSRi). The algorithm incorporates both the vocal and instrumental spectrograms as sparse matrix and low-rank matrix, meanwhile combines pre-learning dictionary and the reconstructed voice spectrogram form the annotation. Evaluations on the iKala dataset show that the proposed methods are effective and efficient for singing voice separation.
文摘针对视频处理中运动目标的精确检测这一问题,提出了一种自适应的低秩稀疏分解算法。该算法首先用背景模型与待求解的帧向量构建增广矩阵,然后使用鲁棒的主成分分析(robust principal component analysis,RPCA)对降维后的增广矩阵进行低秩稀疏分解,分离出的低秩部分和稀疏噪声分别对应于视频帧的背景和运动前景,然后使用增量奇异值分解方法用当前得到的背景向量更新背景模型。实验结果表明,该算法能更好地处理光线变化、背景运动等复杂场景,并有效降低算法的延迟和内存的占用。
文摘视频合成孔径雷达(video synthetic aperture radar,VideoSAR)的超长相干孔径观测使得区域动态信息的快速浏览极其困难。为以机器视觉方式自动捕捉地物散射消失-瞬态持续-消失-瞬态持续-消失的关键帧变化全过程,提出了一种子孔径能量梯度(subaperture energy gradient,SEG)和低秩与稀疏分解(low-rank plus sparse decomposition,LRSD)相结合的VideoSAR关键帧提取器。提取器为系列性通用架构,适用于任何SEG和LRSD系列方法相结合的形式。所提技术首要针对同时单通道、单波段、单航迹等有限信息条件的解决途径,有助于打破应急响应场景中难以采集多通道、多波段、多航迹或多传感器数据的应用局限性。基于实测数据处理和多种先进LRSD算法进行了对比验证,其代表性散射信息的充分提取可促进未来快速地理解并浓缩区域动态。