Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Infant cry detection deep learning method

A technology of deep learning and crying, which is applied in the field of deep learning of baby crying detection, which can solve the problems of low system robustness and easy to be affected by the environment, and achieve high recognition rate and rich dimensions

Active Publication Date: 2020-06-23
浙江芯劢微电子股份有限公司
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The current speech signal recognition system usually consists of three parts, which are speech signal preprocessing, feature extraction and classification. Feature extraction is the most important part, and its quality directly affects the recognition results. The speech gender proposed by previous researchers Most of the features are based on the prosody features and sound quality features of speech, which are all artificially designed features. The robustness of the system is not high, and it is easily affected by the environment.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Infant cry detection deep learning method
  • Infant cry detection deep learning method
  • Infant cry detection deep learning method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0060] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0061] see Figure 1-3 Shown, the present invention is a kind of deep learning method of infant cry detection, and this method comprises the following steps:

[0062] S001: voice signal collection;

[0063] S002: Framing the voice signal segment into frames, and extracting cochlear voice features for each frame;

[0064] S003: Perform baby cry detection to output the detection result, and the baby cry detection includes the following steps:

[0065] S0031: Estab...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an infant cry detection deep learning method, and relates to the technical field of voice signal processing. The method comprises the following steps: a, acquiring a voice signal; b, framing the voice signal segment and performing cochlear voice feature extraction on each frame; and c, inputting the adjacent N frames of voice features into a pre-trained baby cry detection deep learning model for reasoning and judging whether cry exists or not and d, voting the N frames of classification results by applying a majority-first voting principle, and judging whether baby cryexists in the N frames or not. The cochlear speech features adopted by the method are speech feature parameters more conforming to auditory perception analysis of people, and the convolutional networkand the long-short-term memory recurrent neural network are adopted as acoustic inference models for infant cry detection, so that the method can adapt to a speech environment with a low signal-to-noise ratio, and has higher accuracy compared with a traditional method.

Description

technical field [0001] The invention belongs to the technical field of voice signal processing, and in particular relates to a deep learning method for baby crying detection. Background technique [0002] In modern society, parents or elders tend to neglect newborn babies due to busy work or housework, and babies can only express their emotions and needs by crying. Therefore, there is a great market demand for home care based on baby cry detection. [0003] The current speech signal recognition system usually consists of three parts, which are speech signal preprocessing, feature extraction and classification. Feature extraction is the most important part, and its quality directly affects the recognition results. The speech gender proposed by previous researchers Most of the features are based on the prosody features and sound quality features of speech, which are all artificially designed features. The robustness of the system is not high, and it is easily affected by the e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/63G10L25/45G10L25/30G10L15/02G10L15/04G10L15/08
CPCG10L25/63G10L25/30G10L25/45G10L15/02G10L15/04G10L15/08
Inventor 罗世操
Owner 浙江芯劢微电子股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products