BROWSE

Related Researcher

Author

Jang, Gil-Jin
Machine Intelligence Lab
Research Interests
  • Acoustic signal processing

ITEM VIEW & DOWNLOAD

Particle filtering based pitch sequence correction for monaural speech segregation

Cited 0 times inthomson ciCited 0 times inthomson ci
Title
Particle filtering based pitch sequence correction for monaural speech segregation
Author
Kim, Han-GyuJang, Gil-JinPark, Jeong-SikKim, Ji-HwanOh, Yung-Hwan
Keywords
Autocorrelation functions; Conventional approach; Correction method; Energy loss ratio; Evaluation measures; Importance weights; Monaural speech segregations; Noise source; Noisy environment; Particle Filtering; Particle filtering algorithms; Pitch period; Residue ratio; Sound segregation; Speech segregation
Issue Date
201303
Publisher
WILEY-BLACKWELL
Citation
INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, v.23, no.1, pp.64 - 70
Abstract
This article proposes a novel speech and sound segregation framework incorporating a technique for correcting a series of pitch periods based on particle filtering. The conventional pitch track correction method finds the peak locations of the autocorrelation functions to estimate the pitch period, and only the longest reliable pitch streak is used to correct unreliable pitch tracks. Especially in noisy environments, it is hard to find long and reliable pitch streaks, resulting in the degradation of the speech segregation performance. The proposed algorithm based on particle filtering considers all the reliable pitch streaks rather than the longest one and smoothly connects the scattered pitch streaks. To apply the particle filtering algorithm to pitch track correction, the importance weight computation to account for the degree of matchness of the found pitch to the individual spectro-temporal components is also proposed. The performance of the proposed method is evaluated by the results of speech segregation experiments for the mixtures of speech and various noise sources in various mixing signal-to-noise ratios (SNRs). The evaluation measures were SNR, energy loss ratio, and noise residue ratio of the segregated speech, and all these measures showed that the proposed segregation method achieved superior performance compared to the conventional approach.
URI
Go to Link
DOI
http://dx.doi.org/10.1002/ima.22039
ISSN
0899-9457
Appears in Collections:
ECE_Journal Papers

find_unist can give you direct access to the published full text of this article. (UNISTARs only)

Show full item record

qr_code

  • mendeley

    citeulike

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

MENU