Method and apparatus for detecting voice fundamental tone frequency

A technology of pitch frequency and detection method, applied in the field of communication, can solve the problems of difficulty in judging the peak value of the autocorrelation function, insufficient precision, and large amount of autocorrelation calculation.

Active Publication Date: 2014-05-14
SPREADTRUM COMM (SHANGHAI) CO LTD
View PDF4 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The method based on autocorrelation has the following defects. Firstly, the calculation amount of autocorrelation is large; secondly, the number of periodic samples detected is an integer, and the corresponding frequency is a set of discrete numbers, and the accuracy is insufficient. The problem is prominent when it is high; Third, it is difficult to judge the peak value of the autocorrelation function. In most cases, there will be multiple peaks corresponding to the fundamental frequency, harmonic frequency or interference noise, which often leads to wrong judgment of the fundamental frequency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for detecting voice fundamental tone frequency
  • Method and apparatus for detecting voice fundamental tone frequency
  • Method and apparatus for detecting voice fundamental tone frequency

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0140] Example 1: The monotonic component X A [k] corresponding frequency T f [k] is kf s / N, where N is the frame length of the speech signal, f s is the sampling frequency of the voice signal. This is the frequency T f An approximate representation of [k].

Embodiment 2

[0141] Example 2: Monotonic component X A [k] corresponding frequency T f [k] is k f f s / N. Here, k f corresponds to X A [k-1], X A [k] and X A[k+1] Do the second-order polynomial fitting to find its highest point. The second-order polynomial fit is let the second-order polynomial curve ax 2 +bx+c=y through three points {k-1,X A [k-1]}, {k, X A [k]}, {k+1, X A [k+1]}, the maximum value of the curve will appear at

[0142] k f = x = - b 2 a = k + X A [ k - 1 ] - X A [ k + 1 ] ...

Embodiment 3

[0145] Embodiment 3: Monotonic Component X A [k] corresponding frequency T f [k] is k f f s / N. Here, k f corresponds to using X A [k-1], X A [k] and X A [k+1] Do the highest point obtained by isosceles triangle matching.

[0146] Isosceles triangle matching is to let the three points {k-1, X A [k-1]}, {k, X A [k]}, {k+1, X A [k+1]} On the two sides of the symmetry of an isosceles triangle, the base of the isosceles triangle is parallel to the axis corresponding to the index. Optionally, the magnitude value X for isosceles triangle matching A [k-1], X A [k] and X A [k+1] can be replaced by their log domain values. Figure 4 It is a schematic diagram of an embodiment of the isosceles triangle matching of the present invention, wherein X A [k-1]A [k+1].

[0147] if X A [k-1]A [k+1], then the vertices of the isosceles triangle appear at

[0148] k f = k + 1 2 ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and apparatus for detecting voice fundamental tone frequency. The method for detecting voice fundamental tone frequency comprises detecting monotone components in a voice signal frame; determining a candidate fundamental frequency by means of the frequencies of M monotone components with maximum frequency spectrum amplitudes; placing the candidate fundamental frequency in a candidate fundamental frequency set; calculating the harmonious criterion of each candidate fundamental frequency in the candidate fundamental frequency set; and using the candidate fundamental frequency with the maximum harmonious criterion as the fundamental frequency of the voice signal frame. The monotone component with large energy is used as the basis of fundamental tone detection such that the accuracy of fundamental tone detection in noise environment is increased.

Description

technical field [0001] The present invention relates to the field of communications, in particular to a method and device for detecting a voice pitch frequency. Background technique [0002] Human speech consists of unvoiced and voiced sounds. The unvoiced sound is the air flow through the glottis with a reduced diameter, and the air flow at high speed rushes through the friction sound caused by the turbulent flow. This kind of sound accounts for a small proportion in the speech, and the vocal cords do not vibrate when the sound is produced, so there is no periodicity, similar to broadband noise; Qi makes the vocal cords close together to vibrate the sound with periodicity. This sound is composed of a fundamental tone and a series of harmonics in terms of frequency composition. The frequency of the fundamental tone is the reciprocal of the sound period. Audio frequency is one of the most basic parameters in speech signal processing, and it plays a very important role in var...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/90
Inventor 吴晟林福辉徐晶明蒋斌
Owner SPREADTRUM COMM (SHANGHAI) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products