Bird voice recognition method using anti-noise power normalization cepstrum coefficients (APNCC)

A cepstral coefficient, sound recognition technology, used in speech recognition, speech analysis, instruments, etc.

Inactive Publication Date: 2013-02-13
FUZHOU UNIV
View PDF7 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to propose a bird voice recognition technology based on new anti-noise feature extraction for the bird voice recognition problem under various background noises in the ecological environment

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Bird voice recognition method using anti-noise power normalization cepstrum coefficients (APNCC)
  • Bird voice recognition method using anti-noise power normalization cepstrum coefficients (APNCC)
  • Bird voice recognition method using anti-noise power normalization cepstrum coefficients (APNCC)

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] The present invention will be further described below in conjunction with the accompanying drawings and implementation examples.

[0020] The general noise power spectrum estimation algorithm cannot effectively estimate the highly non-stationary background noise in the real environment. Therefore, the present invention is based on an improved noise estimation algorithm that has good adaptability to both stable and highly non-stationary environmental sounds. [11] Perform noise power spectrum estimation. like figure 1 as shown, figure 1 It is a schematic flow chart of the present invention. The method includes: step S01: calculate the noise power spectrum according to the noise estimation algorithm suitable for highly non-stationary environments; step S02: use multi-band spectrum subtraction to perform noise reduction processing on the sound power spectrum; step S03: combine the noise-reduced sound Power Spectrum Extraction Anti-Noise Power Normalized Cepstral Coeffici...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a bird voice recognition technology based on novel noise-proof feature extraction by aiming at the problem of bird voice recognition in various kinds of background noise in ecological environment. The bird voice recognition technology comprises the following steps of firstly, obtaining noise power spectrums by a noise estimation algorithm suitable for highly nonstationary environment; secondly, performing the noise reduction on the voice power spectrums by a multi-band spectral subtraction method; thirdly, extracting anti-noise power normalization cepstrum coefficients (APNCC) by combining the voice power spectrums for noise reduction; and finally, performing contrast experiments under the conditions of different environments and signal to noise ratios (SNR) on the voice of 34 species of birds by means of extracted APNCC, power normalization cepstrum coefficient (PNCC) and Mel frequency cepstrum coefficients (MFCC) by a support vector machine (SVM). The experiments show that the extracted APNCC have a better average recognition effect and higher noise robustness and are more suitable for bird voice recognition in the environment with less than 30 dB of SNR.

Description

technical field [0001] The invention relates to a bird voice recognition method using anti-noise power normalized cepstrum coefficient. Background technique [0002] The sound of birds in the ecological environment contains a wealth of information. For example, by judging whether there are specific bird calls in a certain area throughout the year, we can understand the ecological status and climate change of the area. By using the technology of automatic monitoring and recognition of bird voices in forests, fields and other places to detect endangered birds, it is beneficial for humans to discover their whereabouts in time and take corresponding protection measures. The recognition of bird sounds can not only analyze the behavior and other characteristics of birds themselves, but also analyze the external ecological environment and related impact areas related to birds. [0003] In recent years, with reference to relatively mature speech recognition technology, scholars hav...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L17/26G10L17/08G10L15/20
Inventor 颜鑫李应
Owner FUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products