Speech language classifying method based on CNN and GRU fused deep neural network

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A deep neural network and classification method technology, which is applied in the field of speech language classification based on the fusion of CNN and GRU deep neural network, can solve the problems of difficult speech processing, diversity of speech signal collection and instability of manual extraction features, and achieve convenient business. The effect of docking, high recognition rate and strong robustness

Active Publication Date: 2019-03-26

深圳市网联安瑞网络科技有限公司

View PDF7 Cites 28 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, due to the limitations of the acoustic model and language model, and the diversity of human language in daily life due to the environment, personal diversity and colloquial arbitrariness, it brings great challenges to speech processing, resulting in great difficulties in speech processing.

[0004] Therefore, there is a need for a speech processing method that can cope with the diversity of speech signal collection and the instability of manual feature extraction.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0045] The technical solutions of the present invention will be clearly and completely described below in conjunction with the embodiments. Apparently, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without making creative efforts belong to the protection scope of the present invention.

[0046] refer to Figure 1-2 , the present invention provides a technical solution: a speech language classification method based on the fusion of CNN and GRU deep neural network includes the following process: (1) the input terminal directly inputs the time-domain related speech spectrogram image; (2) model construction Use the CNN network to automatically extract high-level features in the time domain and frequency domain through a series of operations such as convolution, pooling, and normalization. (3) Connect the GRU neural ne...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a speech language classifying method based on a CNN and GRU fused deep neural network. The method comprises the following steps that S1, source audio data of a server is obtained, audio preprocessing is conducted, and the source audio data is cut; S2, audio data file information is read, and an audio data inventory CSV file is generated; S3, an audio data file is subjectedto short-time Fourier transformation, and two-dimensional speech spectrums associated with time and frequency domains of expansion of a series of frequency spectrum functions obtained after speech signal time domains are analyzed are obtained; S4, a model is built; S5, two-dimensional speech spectrum image data is input into the CNN and GRU fused speech language classifying deep neural network model, and language classification data is classified and output; S6, the language classification data and source audio data file information are stored. By means of the method, the problem about speechlanguage classification is solved, the method has the advantages of being automatic, high in identification rate, high in robustness, low in cost, high in portability and the like, and the business connection with a third-party system can be facilitated.

Description

technical field [0001] The invention relates to the technical field of speech processing, in particular to a speech language classification method based on CNN and GRU fusion deep neural network. Background technique [0002] With the advent of the multimedia era, computers are becoming more and more popular, and human-computer interaction is becoming more and more frequent. People are committed to making machines understand human speech instructions and operating and controlling machines through voice. Compared with traditional input devices such as keyboards and mice, speech technology, as an important means of human-computer interaction, has gradually become a key technology of human-computer interface in information transmission. [0003] At present, most of the traditional speech processing methods adopt the pattern matching strategy, and the language model method based on template matching and statistical probability is the main method of speech processing technology. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/08G10L15/16G10L15/06

CPCG10L15/063G10L15/08G10L15/16

Inventor贾宇沈宜邹严张明亮

Owner深圳市网联安瑞网络科技有限公司

Speech language classifying method based on CNN and GRU fused deep neural network

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements:Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology