Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and apparatus for estimating fundamental tone period and adjudging unvoiced/voiced classification

A technology of pitch period and voicing, which is applied in speech analysis, instruments, etc., can solve problems such as misestimation of pitch period estimation, inaccurate pitch period estimation, poor accuracy of pitch period estimation and voicing judgment results, etc. To achieve the effect of improving accuracy

Inactive Publication Date: 2007-05-30
VIMICRO CORP
View PDF0 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0016]In practical applications, due to the formant structure in the input speech signal, the pitch period estimate may be misestimated as a multiple or fraction of the actual value; and the above method No formant structure removal is performed on the input speech signal, so the above approach will result in an inaccurate estimate of the pitch period
In addition, because the above method is too simple, pitch period estimation and voicing judgment cannot be accurately performed in the transition part of unvoiced sound, resulting in poor accuracy of pitch period estimation and voicing judgment results obtained by using the above method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for estimating fundamental tone period and adjudging unvoiced/voiced classification
  • Method and apparatus for estimating fundamental tone period and adjudging unvoiced/voiced classification
  • Method and apparatus for estimating fundamental tone period and adjudging unvoiced/voiced classification

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0093] The following describes how to use the method of the present invention to perform pitch period estimation and voicing decision through an exemplary process.

[0094] FIG. 2 is an exemplary flowchart of a pitch period estimation and voiced decision method in Embodiment 1 of the present invention. Referring to Figure 2, the method includes the following steps:

[0095] Step 201: Perform preprocessing on the current voice signal frame to filter out high-order harmonic components and formant structures to obtain a preprocessed voice signal frame.

[0096] Before performing this step, the input speech signal may be divided into frames according to the prior art to obtain a current speech signal frame with a suitable frame size. In this step, spectrum flattening can be performed on the current voice signal frame first, and then a linear phase low-pass filter is used to filter out the high-frequency part of the spectrum-flattened voice signal frame, so that the input voice ca...

Embodiment 2

[0113] FIG. 3 is a schematic flowchart of a pitch period estimation and voiced judgment method in Embodiment 2 of the present invention. Referring to Figure 3, the method includes the following steps:

[0114] Step 301: Preprocess the current speech signal frame to obtain the preprocessed speech signal frame.

[0115] Before performing this step, the input speech signal may be divided into frames according to the prior art to obtain a current speech signal frame with a suitable frame size. In this step, first perform spectrum flattening on the current speech signal frame, and then use a linear phase low-pass filter to filter out the high-frequency part of the speech signal frame processed by the spectrum flattening, so as to filter out the high-order parts in the input speech signal. Harmonic components, remove formant structures in the input speech signal.

[0116] There are mainly two common spectrum flattening methods: performing linear predictive coding (LPC) inverse fil...

Embodiment 3

[0205] The pitch period estimation and voiced judgment device in this embodiment corresponds to the first mode of pitch period estimation and voiced judgment of the present invention.

[0206] FIG. 4 is a schematic structural diagram of an apparatus for pitch period estimation and voiced judgment in Embodiment 3 of the present invention. Referring to FIG. 4 , the apparatus includes: a preprocessing module 410 and a pitch period estimation / voiced decision unit.

[0207] In the device shown in FIG. 4 , the preprocessing module 410 is configured to perform preprocessing of filtering out high-order harmonic components and formant structures on the current speech signal frame, and sending the obtained preprocessing speech signal frame to the pitch period Estimation / voiced decision module 420;

[0208] The pitch period estimation / voiced decision module 420 is configured to perform pitch period estimation and voiced judgment on the preprocessed speech signal frame from the preproces...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method of evaluating the basic tone cycle and judging the surd and sonant is disclosed that contains following steps: A. the higher harmonic component is filtered and the resonance peak structure is removed on the present sound signal frame, and the pre-processing sound signal frame is obtained; B. the pre-processing sound signal frame is evaluated the basic tone cycle and judged the surd and sonant, the estimate value of the basic tone cycle and the judged result of surd and sonant are got. The invention also discloses a evaluating the basic tone cycle and multiple detecting devices, the device contain: pre-processing module and basic tone cycle evaluation / surd and sonant judgment module. In the invention, the estimate value of basic tone cycle of sound signal and the accuracy of judged result of surd and sonant are improved.

Description

technical field [0001] The invention relates to speech signal processing technology, in particular to a method and device for pitch period estimation and voicing judgment. Background technique [0002] In the technical field of speech signal processing, discrete time-domain generation models of speech signals are widely used in applications such as parametric speech coding, speech synthesis, and pitch scaling. [0003] FIG. 1 is a schematic diagram of an existing general speech signal discrete time domain generation model. Referring to Fig. 1, the model includes three parts: an excitation source 110, a vocal tract model 120 and a radiation model 130, wherein the excitation source 110 includes two branches of a voiced sound excitation unit 111 and an unvoiced sound excitation unit 112, and a voiced / unvoiced sound switch 113 . [0004] In the model shown in Fig. 1, voiced sound excitation unit 111 is used to generate the excitation of voiced sound according to the pitch freq...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L11/00G10L11/06G10L19/00G10L25/93
Inventor 邓昊冯宇红张晨
Owner VIMICRO CORP