Wild animal weak supervision sound recognition method based on deep learning

A wild animal and voice recognition technology, applied in the field of wild animal voice recognition, can solve the problems of reducing the human, financial and material investment of existing voice annotation samples, and achieve the goal of saving development and deployment time, saving resources, and improving the accuracy of intelligent recognition. Effect

Pending Publication Date: 2022-04-29
CHINA FORESTRY STAR BEIJING TECH INFORMATION CO LTD
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the present invention is to provide a wild animal weakly supervised sound recognition method based on deep learning to solve the problems existing in the existing wild animal sound recognition methods, reduce the human, financial and material input of the existing sound labeling samples, and pass the depth The method of learning, intelligent classification and recognition of the sounds of wild animals obtained

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Wild animal weak supervision sound recognition method based on deep learning
  • Wild animal weak supervision sound recognition method based on deep learning
  • Wild animal weak supervision sound recognition method based on deep learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The present invention will be described in further detail below in conjunction with the accompanying drawings.

[0040] Such as figure 1 As shown, a wild animal weakly supervised voice recognition method based on deep learning of the present invention mainly includes two steps, one is voice recognition model training, and the other is voice recognition model reasoning.

[0041] Wherein, the voice recognition model training mainly includes: audio and video channel separation of the video segment, image recognition, voice time domain conversion, training database creation, voice recognition model training and other steps. Such as figure 2 As shown, the specific process is as follows:

[0042] S1.1 Obtain a video collection containing wild animal images and sounds

[0043] In the field of wild animal protection, the common existing wild animal video surveillance camera that is set is triggered to shoot video immediately after being triggered by wild animals, and the vi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A wildlife weak supervision sound recognition method based on deep learning relates to the field of wildlife sound recognition, and comprises the following steps: training a sound recognition model: obtaining a video set containing wildlife images and sounds; extracting an image frame sequence in the video and a corresponding audio; identifying the image frame by using a deep learning network to obtain a category attribute tag; converting the audio time domain data into frequency domain data; creating a training database; performing voice recognition model training by utilizing the category attribute labels and the frequency domain data; voice recognition model reasoning: acquiring a video set only containing voices of wild animals; extracting audio data in the video; converting the audio time domain data into frequency domain data; and recognizing the frequency domain data by using the voice recognition model to obtain a final category attribute tag. According to the method, the sample labeling cost is reduced, the deep learning method is adopted, the feature template does not need to be manually screened, and the accuracy and the recognition efficiency are improved.

Description

technical field [0001] The invention relates to the technical field of wild animal voice recognition, in particular to a method for wild animal weakly supervised voice recognition based on deep learning. Background technique [0002] In the field of wildlife protection, in order to meet the needs of wildlife statistics, it is often necessary to obtain wild animal-related videos, images and sound clips by setting up wild animal video surveillance cameras, etc., and identify the captured wild animals through certain technical means type. [0003] At present, the research content of sound recognition technology is mainly divided into the following two aspects: one is the acquisition and separation of sound, such as the use of professional recording equipment to capture sound, signal processing and separation of audio signals, etc. The second is the classification and recognition of sounds, such as using traditional pattern recognition methods for audio recognition. Especially...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L17/26G10L19/02G10L25/30G06F17/14G06N3/04G06N3/08
CPCG10L17/26G10L25/30G10L19/0212G06F17/141G06N3/08G06N3/045
Inventor 王金龙蔡宇黄艳金
Owner CHINA FORESTRY STAR BEIJING TECH INFORMATION CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products