Blind speech source separation via nonlinear time-frequency masking

来源 :Chinese Journal of Acoustics | 被引量 : 0次 | 上传用户:qiushuicai
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Aim at the underdetermined convolutive mixture model,a blind speech source separation method based on nonlinear time-frequency masking was proposed,where the ap- proximate W-disjoint orthogonality(W-DO)property among independent speech signals in time-frequency domain is utilized.In this method,the observation mixture signal from multi- microphones is normalized to be independent of frequency in the time-frequency domain at first,then the dynamic clustering algorithm is adopted to obtain the active source information in each time-frequency slot,a nonlinear function via deflection angle from the cluster center is selected for time-frequency masking,finally the blind separation of mixture speech signals can be achieved by inverse STFT(short-time Fourier transformation).This method can not only solve the problem of frequency permutation which may be met in most classic frequency-domain blind separation techniques,but also suppress the spatial direction diffusion of the separation matrix.The simulation results demonstrate that the proposed separation method is better than the typical BLUES method,the signal-noise-ratio gain(SNRG)increases 1.58 dB averagely. Aim at the underdetermined convolutive mixture model, a blind speech source separation method based on nonlinear time-frequency masking was proposed, where the ap-proximate W-disjoint orthogonality (W-DO) property among independent speech signals in time-frequency domain is applied .In this method, the observation mixture signal obtained from multi- microphones is normalized to be independent of frequency in the time-frequency domain at first, then the dynamic clustering algorithm is obtained to obtain the active source information in each time-frequency slot, a nonlinear function via deflection angle from the cluster center is selected for time-frequency masking, finally the blind separation of mixture speech signals can be achieved by inverse STFT (short-time Fourier transformation). This method can not only solve the problem of frequency permutation which may be met in most classic frequency-domain blind separation techniques, but also suppress the spatial direction diffusion of the separation matrix.T he simulation results demonstrate that the proposed separation method is better than the typical BLUES method, the signal-noise-ratio gain (SNRG) increases 1.58 dB averagely.
其他文献
触电事故是施工现场“五大伤害”之一,本文指出施工现场临时用电存在的安全通病,从施工的角度提出对现场临电的正确设置、管理和触电防护方法。以保障用电安全。
GIS隔离开关操作引起的暂态地电位升高对一、二次设备及人员安全构成威胁。本文结合某500kV变电站刀闸电机电源跳闸故障阐述了GIS暂态地电位升高的形成机理,对二次设备特别是
在新时期背景下,财务共享中心模式已被广泛运用于企业财务管理当中,其将管理会计职能从传统转移到新型管理模式下,使企业财务管理工作更加规范、集中、科学和合理化,极大程度上提
财务管理一直都是促进企业发展的重要环节,对财务管理模式进行创新和优化,不仅是为提高财务管理水平,也是规范企业各项业务活动,降低企业运营风险的关键。本文就对电力财务管理模
阐述火灾自动报警系统总线联动控制、多线联动控制及多台主机组网控制方式,结合工程实例,谈谈如何合理地做好火灾自动报警系统的联动控制设计。 Describes the automatic fi
对于成员国保险合同法的差异和由此造成的冲突,欧盟一直采用国际私法的冲突规则来解决。这一方法被认为影响了保险的跨境交易,也使得保险市场一体化很难实现。为了改变这一现
主持人在新媒体时代迎来了新的变化,在过去的传统媒体中,主持人只要具有扎实的专业能力,就可以在主持道路上一帆风顺。随着媒体的变革发展,社会对主持人的要求也在与时俱进,对主持
本文介绍对中煤重新洗选的利用,通过配煤结构优化,对比本部焦炉与南疆焦炉使用优劣,找寻最佳的利用方案,实现降本增效的目的。
接地故障是指相线对地或与地有联系的导电体之间的短路,它包括相线与大地、PE线,PEN线、配电和用电设备的金属外壳、敷线管槽等之间的短路。本文接地故障保护措施所保护的电气
有关考虑索赔大小的BMS理论研究已比较成熟,但目前除韩国外,世界上其他国家的保险公司在设计机动车保险的BMS时,都未考虑索赔大小。不考虑索赔大小的BMS存在很多弊端,将来会