毕业论文

打赏
当前位置: 毕业论文 > 外文文献翻译 >

迭代频谱减法和双重阈值CVR英文文献和中文翻译(3)

时间:2022-04-14 20:25来源:毕业论文
no-sensitive to phase distortion, phase signal of noise is implemented when phase is restored [5][6]。 In the frequency domain, equation (5) is expressed as 5。2。 Proposed VAD based on improved sp

no-sensitive to phase distortion, phase signal of noise is implemented when phase is restored [5][6]。 In the frequency domain, equation (5) is expressed as

5。2。 Proposed VAD based on improved    spectral

transforms   (DFT)   of  y(i) , s(i)  and d (i)  , respectively。

Because s(i) and d (i) are independent and Nk is gauss distribution, equation (6) is expressed as (7) in frequency domain。

On the basis of section above, we can firstly apply spectral subtraction for noisy sound of CVR to reducing noise and enhance speech, and then enhanced signal is filtered by a preceding filter, finally cockpit voice is extracted by means of double thresholds VAD。 Figure    2

shows the flow chart of proposed VAD。 The preceding



filter is a high-pass filter, such as10。9375z1 ,   which

can filter low-frequency interference, especially

For a frame of speech signal, we

interference of frequency 50Hz or 60Hz, and advance spectrum of  high  frequency which  is  useful  for cockpit

For a wonderful VAD, two requirements must be taken into considered comprehensively: to detect  more  speech

where n (k ) is  statistical  mean  of  unvoiced speech,

2

Sk   is amplitude of enhanced speech。

However, basic SS can generate much musical noises in residual noises。 Some modified SS are proposed to reduce effect。 Weighting factorand power coefficient 

sections and more unvoiced speech sections。 However, when VAD tries to detect more speech frames, it misjudges silence as speech or otherwise。 The latter is ever worse than the former for accident investigation。 Therefore, two evaluation standards are compared to weighing  quantificationally  the  performance  of   VAD:

are introduced into SS, so equation (8) is modified as

probability  of  correctly  detecting  speech  frame Pcs

 

probability of correctly detecting noise frame Pcn  ,   which are expressed as

Modified SS is degraded  to  basic  SS  when =2 and =1[5]。  Other  modified  SS  is  showed  in    relative

references [6][7]。 Better enhancement performance can be gained by adjusting two parameters suitably, but voice

where  N handand  N handare   relatively   the  overall

distortion becomes severer as the degree of noise reduction is larger。

5。Proposed VAD based on improved spectral subtraction

5。1。Improved spectral subtraction

In this paper, we propose iterative spectral subtraction to formerly reducing noise and enhancing speech。 This method uses basic SS or modified SS for appropriate times。 The former enhanced speech becomes latter input signal, so music noise is seen as input noise to reduce again。

number of hand-labeling speech frames and noise  frames

by hand-labeling, N1 and N0 are relatively number of being detected correctly by VAD。

6。2。 Experiment results

In this paper, a section of speech in car (SNR =8) from standard voice bank Aurora2 and a section of true cockpit sound are used, simulation experiments based on traditional double thresholds VAD only and the proposed VAD are carried out。 Figure 3 and table 2 compare the performance of various methods in different environment。 Due to former spectral subtraction, the SNR increases, the curves of STE and ZCR become smoother and proper probability increases。 迭代频谱减法和双重阈值CVR英文文献和中文翻译(3):http://www.youerw.com/fanyi/lunwen_92555.html

------分隔线----------------------------
推荐内容