Method for identifying sound fault based on mel energy spectrum and convolution neural network

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A convolutional neural network and fault identification technology, applied in speech analysis, instruments, etc., can solve the problems of large differences in staff discrimination, high detection costs, and slow information transmission speed, achieving strong separability, improving working conditions, The effect of saving manpower

Active Publication Date: 2019-04-09

广州丰石科技有限公司

View PDF6 Cites 30 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] The purpose of the present invention is to solve the defects of large discriminative differences among staff members, slow information transmission speed and high detection cost in the prior art, and propose a sound fault identification method based on mel energy spectrum and convolutional neural network

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0043] A sound fault identification method based on mel energy spectrum and convolutional neural network, comprising the following steps:

[0044] S1: Pre-emphasize the voice signal to increase the high-frequency resolution of the sound;

[0045] Pre-emphasizes the speech signal to increase the high-frequency resolution of the sound. The general transfer function of pre-emphasis is H(z)=1-az^-1. The present invention uses a first-order FIR high-pass filter to realize pre-emphasis, wherein a is a pre-emphasis coefficient, and the voice sampling value at n moments is x(n), and the result after pre-emphasis processing is y(n)=x(n) -ax(n-1), take a=0.95 here.

[0046] S2: Framing the voice signal. In terms of timing, a part of the audio data is intercepted at a certain interval to form a frame, and the interval time is the step size of the frame. Since the sound signal has short-term stationary characteristics, framing the audio helps to further subdivide the characteristics of...

Embodiment 2

[0075] A sound fault identification method based on mel energy spectrum and convolutional neural network, comprising the following steps:

[0076] S1: For the input audio data, pre-emphasize y(n)=x(n)-0.95*x(n-1) according to the following formula;

[0077] S2: Take the average value of the two-channel audio and change it to a single channel, and divide the data into frames with a single frame sampling point of 612 and a step size of 306

[0078] S3: Add a window to each frame, the window is a Hamming window, and the coefficient a=0.46

[0079] S4: Perform fast Fourier transform on each frame of data to generate energy spectrum

[0080] S5: passing the energy spectrum through a Mel-scale triangular band-pass filter. The number of filters is 64, the maximum frequency is 22050 (half of the sampling point frequency 44100)

[0081] S6: The data generated by S5, with the frequency domain as the Y axis and the time domain as the X axis, is converted into a Mei energy spectrum

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a method for identifying a sound fault based on a mel energy spectrum and a convolution neural network. The method comprises the following steps: first, performing pre-emphasison initially input audio data; then, performing framing and windowing processing on the data; after that, performing fast Fourier transform on framed and windowed data; extracting energy features ona frequency domain, processing an energy spectrum by using a set of mel-scale triangular filter banks; and after that, converting the data into a Mel energy spectrum by using energy in different frequency domains corresponding to each frame as a Y axis, and different frames in a time domain as an X axis. After that, the energy spectrum needs to be further framed to adapt to an input of a CNN (convolution neural network). Each frame is a sample, and one-hot coding of a label corresponding to each sample is used as an output of the CNN to train a CNN model until a network training error reachesthe minimum. During prediction, a probability value of each type of label is output, and a label with the largest probability value is taken as a final discrimination result.

Description

technical field [0001] The invention relates to the field of AI sound fault detection and recognition, and more specifically relates to a sound fault recognition method based on mel energy spectrum and convolutional neural network. Background technique [0002] Large-scale air-conditioning cooling equipment has been quite popular in practical applications, and manual detection is mostly used for the detection of its failure. The method of patrol inspection by professional staff can rely on the staff's senses and feelings to check the appearance, vibration, operating sound, etc. of the equipment, so as to determine whether the equipment is damaged, loose and other abnormalities. This method is simple and convenient, but the information obtained is very limited, and the knowledge, skills, experience, and observation ability of the staff vary greatly, and the inspection results are not satisfactory. Contents of the invention [0003] The purpose of the present invention is t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L25/51G10L25/30G10L25/21G10L25/18G10L25/15

CPCG10L25/15G10L25/18G10L25/21G10L25/30G10L25/51

Inventor 陈曦蓝志坚陈卓李学辉喻春霞容伯杰

Owner 广州丰石科技有限公司

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Method for identifying sound fault based on mel energy spectrum and convolution neural network

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology