Unlock instant, AI-driven research and patent intelligence for your innovation.

A method and device for extracting multiple fundamental frequencies

A fundamental frequency, time-frequency unit technology, applied in the construction of hidden Markov model likelihood probability and transition probability, can solve the influence of multi-pitch frequency extraction, peak height and peak position offset, amplitude modulation rate variation, etc. question

Inactive Publication Date: 2019-04-02
INST OF AUTOMATION CHINESE ACAD OF SCI +2
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the case of multi-pitch extraction, there may be high-order harmonics with similar energy but belonging to different pitch frequencies in a time-frequency unit, which will cause its amplitude modulation rate to not belong to any pitch frequency. In the case of harmonics, this will cause the peak height and peak position of the corresponding autocorrelation function to be erroneously shifted, thereby negatively affecting the extraction of multi-pitch frequencies

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and device for extracting multiple fundamental frequencies
  • A method and device for extracting multiple fundamental frequencies
  • A method and device for extracting multiple fundamental frequencies

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] It should be understood that the following detailed description of the various examples and drawings is not intended to limit the invention to the particular illustrative embodiments; the described illustrative embodiments merely exemplify the various steps of the invention, the scope of which is defined by the appended claims to define.

[0031] The invention decomposes the autocorrelation function of the time-frequency unit in the speech two-dimensional auditory spectrogram to obtain the dominant instantaneous frequency, and calculates the frequency matching function on the basis of it. Compared with the autocorrelation function, the frequency matching function can overcome the unfavorable amplitude modulation effect in the channel of the high-frequency gammatone filter bank when multiple fundamental frequencies are extracted, so the fundamental frequency state likelihood function constructed on the basis of the frequency matching function is more stable and reliable ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multi-fundamental frequency extraction method and a multi-fundamental frequency extraction device based on empirical mode decomposition and a hidden Markov model. The method comprises steps: an auditory filter bank is used for filtering a speech signal, and framing is carried out on the signal after filtering; an auto-correlation function is calculated on each time frequency unit for an auditory spectrum; on the basis of an intrinsic mode function obtained through the empirical mode decomposition, the instantaneous frequency of each time frequency unit dominant sound source is calculated; on the basis of each instantaneous frequency, a frequency matching function is calculated; the frequency matching function is used for building the likelihood probability of each fundamental frequency state, and a corpus is used for counting the transition probability between each fundamental frequency state and a fundamental frequency value; and the likelihood probability of each fundamental frequency state is enhanced, the enhanced likelihood probability is combined with the corresponding transition probability, and the hidden Markov model is used for extracting a multi-fundamental frequency track of the speech signal.

Description

technical field [0001] The invention relates to the decomposition of the empirical mode of digital signal processing, the analysis of the speech signal filter group, the extraction of the pitch frequency of the speech signal, the construction of the likelihood probability and transition probability of the hidden Markov model. Background technique [0002] Pitch extraction and track tracking are of great significance to many speech and audio signal processing technologies, such as audio retrieval and classification, Chinese intonation recognition, and single-channel speech separation technology. Some well-performed pitch extraction algorithms exist for detecting a single pitch in clean or slightly noisy speech. However, the assumption of a single fundamental frequency makes this type of algorithm unable to be used in the case where there are multiple fundamental frequencies in speech at the same time, such as the situation where two speakers speak at the same time or the situ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L25/18G10L15/14
CPCG10L15/142G10L25/18
Inventor 刘文举江巍王天正李杰梁基重李艳鹏乔利玮刘元华
Owner INST OF AUTOMATION CHINESE ACAD OF SCI