Real-time sound classification method and system based on FPGA

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A classification method and sound technology, applied in speech analysis, biological neural network models, instruments, etc., can solve the problems of large amount of parameters, high power consumption of CPU platform, high computational complexity, etc., and achieve fast processing speed, low cost, and small size small effect

Active Publication Date: 2021-02-23

UNIV OF ELECTRONICS SCI & TECH OF CHINA

View PDF10 Cites 2 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Due to the high computational complexity and large amount of parameters of MFCC and CNN, they are generally implemented on the CPU (or GPU) platform. When implemented on the CPU platform, the network scale is large, and it is not easy to meet the real-time requirements. The CPU platform has high power consumption and high cost, which is not conducive to portable deployment

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0073] An FPGA-based real-time sound classification method, such as figure 1 shown, including the following steps:

[0074] S1. Acquiring sound digital data;

[0075] Connect the microphone to the FPGA to obtain sound digital data. The sound data sampling rate is 22050Hz, and the 1.61s sound data is processed as a block;

[0076] S2. Extracting sound features in the acquired sound digital data to obtain a sound feature map;

[0077] The sound digital data is first input into the sound feature extraction module in the FGPA, and the sound feature is extracted to obtain the MFSC sound feature map;

[0078] In this embodiment, the sound data is first input into the framing module, and the sound data is divided into data frames with a length of 512. The number of moving data points between frames is 220. The input is a pipeline operation, and the output is a data frame with a length of 512. Among them, in order to facilitate FFT processing, N points are usually taken as one fram...

Embodiment 2

[0110] A real-time sound classification system based on FPGA, including the following modules:

[0111] Acquisition module, is used for obtaining sound digital data;

[0112] The sound MFSC feature extraction module is used to extract the sound features in the acquired sound digital data, and obtain the sound feature map;

[0113] The CNN classification network module is used to obtain the sound feature map for classification calculation and obtain the sound judgment probability.

[0114] The output module is used to obtain the classification result of the sound according to the maximum probability of the sound judgment.

[0115] Further, the sound MFSC feature extraction module includes:

[0116] A framing module is used for framing the sound digital data to form a data frame;

[0117] Among them, the asynchronous FIFO is used to perform frame division operation on the input sound data.

[0118] A windowing module is used to add a hann window to the data frame to obtain t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a real-time sound classification method based on an FPGA, wherein the method comprises the steps: carrying out feature extraction of sound data through the FPGA, obtaining an MFSC feature map of the sound data, carrying out calculation of the obtained MFSC feature map through a CNN classification network, and achieving the classification function of collected sounds; and monitoring and classifying the external sound conveniently and quickly anytime and anywhere. The system has the advantages of low power consumption, low cost, portability, real-time performance, high practicability and the like.

Description

technical field [0001] The invention relates to the field of sound recognition and processing, in particular to an FPGA-based real-time sound classification method and system. Background technique [0002] Acoustic signals contain rich information, and are one of the important information sources for human beings to perceive the environment, as well as an important feature to reflect human behavior. At the same time, sound signals can be received outside the field of vision and are not affected by light. The required storage space and subsequent processing calculation difficulty are lower than those of video signals, which makes sound classification widely used in many fields, including navigation. , intelligent robots, security monitoring, sound event tracking and positioning, nature protection, public safety and other fields. [0003] For sound classification, the research direction mainly focuses on the feature extraction and pattern classification of sound signals, and ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L25/51G06N3/04G10L25/03G10L25/18G10L25/30G10L25/45

CPCG10L25/51G10L25/03G10L25/45G10L25/18G10L25/30G06N3/048G06N3/045

Inventor肖卓凌柴进孟子杰王志轩阎波袁子强

OwnerUNIV OF ELECTRONICS SCI & TECH OF CHINA

Real-time sound classification method and system based on FPGA

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology