Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and apparatus for estimating fundamental tone period and adjudging unvoiced/voiced classification

A technology of pitch cycle and voicing, applied in speech analysis, instruments, etc.

Inactive Publication Date: 2010-09-29
VIMICRO CORP
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0016]In practical applications, due to the formant structure in the input speech signal, the pitch period estimate may be misestimated as a multiple or fraction of the actual value; and the above method No formant structure removal is performed on the input speech signal, so the above approach will result in an inaccurate estimate of the pitch period
In addition, because the above method is too simple, pitch period estimation and voicing judgment cannot be accurately performed in the transition part of unvoiced sound, resulting in poor accuracy of pitch period estimation and voicing judgment results obtained by using the above method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for estimating fundamental tone period and adjudging unvoiced/voiced classification
  • Method and apparatus for estimating fundamental tone period and adjudging unvoiced/voiced classification
  • Method and apparatus for estimating fundamental tone period and adjudging unvoiced/voiced classification

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0089] How to use the method of the present invention to perform pitch period estimation and voicing judgment is described below through an exemplary process.

[0090] figure 2 It is an exemplary flow chart of the method for pitch period estimation and voicing determination in Embodiment 1 of the present invention. see figure 2 , the method includes the following steps:

[0091] Step 201: Perform preprocessing on the current speech signal frame to filter out higher harmonic components and remove formant structures, to obtain a preprocessed speech signal frame.

[0092] Before performing this step, the input speech signal may be divided into frames according to the prior art to obtain a current speech signal frame with an appropriate frame size. In this step, the spectrum flattening process can be performed on the current speech signal frame first, and then the high-frequency part of the speech signal frame processed by the spectrum flattening process is filtered out by us...

Embodiment 2

[0109] image 3 It is a schematic flowchart of the method for pitch period estimation and voicing determination in Embodiment 2 of the present invention. see image 3 , the method includes the following steps:

[0110] Step 301: Perform preprocessing on the current speech signal frame to obtain a preprocessed speech signal frame.

[0111] Before performing this step, the input speech signal may be divided into frames according to the prior art to obtain a current speech signal frame with an appropriate frame size. In this step, the spectrum of the current speech signal frame is firstly flattened, and then the high-frequency part of the speech signal frame processed by the spectrum flattening is filtered out by using a linear phase low-pass filter, so as to filter out the high-order part of the input speech signal Harmonic components, remove formant structure in input speech signal.

[0112] There are two commonly used spectrum flattening methods: linear predictive coding (...

Embodiment 3

[0202] The apparatus for pitch period estimation and voicing determination in this embodiment corresponds to the first mode of pitch period estimation and voicing determination in the present invention.

[0203] Figure 4 It is a schematic structural diagram of the device for pitch period estimation and voicing determination in Embodiment 3 of the present invention.

[0204] see Figure 4 , the device includes: a preprocessing module 410 and a pitch period estimation / voicing decision unit.

[0205] Figure 4 In the shown device, the preprocessing module 410 is used to perform preprocessing for filtering out higher harmonic components and removing formant structures on the current speech signal frame, and sending the obtained preprocessed speech signal frame to the pitch period estimation / Voicing judgment module 420;

[0206] The pitch period estimation / voicing and judging module 420 is used to perform pitch period estimation and voicing and voicing judgment on the preproc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method of evaluating the basic tone cycle and judging the surd and sonant is disclosed that contains following steps: A. the higher harmonic component is filtered and the resonance peak structure is removed on the present sound signal frame, and the pre-processing sound signal frame is obtained; B. the pre-processing sound signal frame is evaluated the basic tone cycle and judged the surd and sonant, the estimate value of the basic tone cycle and the judged result of surd and sonant are got. The invention also discloses a evaluating the basic tone cycle and multiple detecting devices, the device contain: pre-processing module and basic tone cycle evaluation / surd and sonant judgment module. In the invention, the estimate value of basic tone cycle of sound signal and the accuracy of judged result of surd and sonant are improved.

Description

technical field [0001] The invention relates to speech signal processing technology, in particular to a method and device for pitch period estimation and voicing judgment. Background technique [0002] In the technical field of speech signal processing, discrete time-domain generation models of speech signals are widely used in applications such as parametric speech coding, speech synthesis, and pitch scaling. [0003] figure 1 Schematic diagram of an existing general-purpose discrete time-domain generation model for speech signals. see figure 1 , the model includes three parts: an excitation source 110, a vocal tract model 120 and a radiation model 130, wherein the excitation source 110 includes two branches of a voiced sound excitation unit 111 and an unvoiced sound excitation unit 112, and a voiced / unvoiced sound switch 113. [0004] figure 1 In the shown model, the voiced sound excitation unit 111 is used to generate the excitation of voiced sounds according to the i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L11/00G10L11/06G10L19/00G10L25/93
Inventor 邓昊冯宇红张晨
Owner VIMICRO CORP