Speech language classifying method based on CNN and GRU fused deep neural network

A deep neural network and classification method technology, which is applied in the field of speech language classification based on the fusion of CNN and GRU deep neural network, can solve the problems of difficult speech processing, diversity of speech signal collection and instability of manual extraction features, and achieve convenient business. The effect of docking, high recognition rate and strong robustness

Active Publication Date: 2019-03-26
深圳市网联安瑞网络科技有限公司
View PDF7 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the limitations of the acoustic model and language model, and the diversity of human language in daily life due to the environment, personal diversity and colloquial arbitrariness, it brings great challenges to speech pro

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech language classifying method based on CNN and GRU fused deep neural network
  • Speech language classifying method based on CNN and GRU fused deep neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The technical solutions of the present invention will be clearly and completely described below in conjunction with the embodiments. Apparently, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without making creative efforts belong to the protection scope of the present invention.

[0046] refer to Figure 1-2 , the present invention provides a technical solution: a speech language classification method based on the fusion of CNN and GRU deep neural network includes the following process: (1) the input terminal directly inputs the time-domain related speech spectrogram image; (2) model construction Use the CNN network to automatically extract high-level features in the time domain and frequency domain through a series of operations such as convolution, pooling, and normalization. (3) Connect the GRU neural ne...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a speech language classifying method based on a CNN and GRU fused deep neural network. The method comprises the following steps that S1, source audio data of a server is obtained, audio preprocessing is conducted, and the source audio data is cut; S2, audio data file information is read, and an audio data inventory CSV file is generated; S3, an audio data file is subjectedto short-time Fourier transformation, and two-dimensional speech spectrums associated with time and frequency domains of expansion of a series of frequency spectrum functions obtained after speech signal time domains are analyzed are obtained; S4, a model is built; S5, two-dimensional speech spectrum image data is input into the CNN and GRU fused speech language classifying deep neural network model, and language classification data is classified and output; S6, the language classification data and source audio data file information are stored. By means of the method, the problem about speechlanguage classification is solved, the method has the advantages of being automatic, high in identification rate, high in robustness, low in cost, high in portability and the like, and the business connection with a third-party system can be facilitated.

Description

technical field [0001] The invention relates to the technical field of speech processing, in particular to a speech language classification method based on CNN and GRU fusion deep neural network. Background technique [0002] With the advent of the multimedia era, computers are becoming more and more popular, and human-computer interaction is becoming more and more frequent. People are committed to making machines understand human speech instructions and operating and controlling machines through voice. Compared with traditional input devices such as keyboards and mice, speech technology, as an important means of human-computer interaction, has gradually become a key technology of human-computer interface in information transmission. [0003] At present, most of the traditional speech processing methods adopt the pattern matching strategy, and the language model method based on template matching and statistical probability is the main method of speech processing technology. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/08G10L15/16G10L15/06
CPCG10L15/063G10L15/08G10L15/16
Inventor 贾宇沈宜邹严张明亮
Owner 深圳市网联安瑞网络科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products