Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Automatic speech recognition method and automatic speech recognition system based on artificial intelligence

An automatic speech recognition and artificial intelligence technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of speech recognition technology that cannot be industrialized on a large scale, the number of models increases, and long iteration time, etc., to improve the quality of speech processing, The effect of speeding up convergence and simplifying the training process

Active Publication Date: 2020-02-21
成都无糖信息技术有限公司
View PDF12 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there are thousands of common words in any language, and learning thousands of models requires not only a huge corpus, but also a long iteration time
In addition, Chinese is also divided into tonal and non-tonal characters, homophones, etc., resulting in a multiplied number of models
This has brought a lot of inconvenience to users, making speech recognition technology unable to be industrialized on a large scale

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic speech recognition method and automatic speech recognition system based on artificial intelligence

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0044] Embodiment: the present embodiment provides a kind of automatic speech recognition system based on artificial intelligence, and it has mainly included four major modules, one, speech preprocessing module, two, speech feature extraction module, three, speech training recognition module and four, Text correction module.

[0045] One of them is the voice preprocessing module: before feature extraction, the original voice sequence is preprocessed, the purpose is to eliminate the aliasing, high-order harmonic distortion, Factors such as high frequency have an impact on the quality of the voice signal. Try to ensure that the signal obtained by subsequent speech processing is more uniform and smooth, provide high-quality parameters for signal parameter extraction, and improve the quality of speech processing.

[0046] The speech preprocessing module specifically includes the following parts:

[0047] 01 Speech detection module, which detects the endpoint of the speech and fi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an automatic speech recognition method and an automatic speech recognition system based on artificial intelligence. The system mainly comprises four modules of a speech preprocessing module, a speech feature extraction module, a speech training recognition module and a text correction module. According to the system, the speech training recognition module is adopted to learn the speech features and speech corresponding character codes, firstly, convolution learning of frequency spectrum characteristics is carried out through a feature learning layer, then the semantic information among the frequency spectrum features is learned through a semantic learning layer, finally, the comprehensively learned information is decoded through an output layer, and a correspondingtext is output. Therefore, a Chinese character mapping table is directly used for encoding and decoding labels, phoneme encoding and decoding of the text are not needed, then the text is decoded intothe text, and the training process is simplified.

Description

technical field [0001] The invention relates to the technical field of speech recognition in artificial intelligence, in particular to an automatic speech recognition technology based on artificial intelligence. Background technique [0002] Artificial Intelligence (AI for short) is a new technical science that studies and develops theories, methods, technologies and application systems for simulating, extending and expanding human intelligence. Artificial intelligence is a branch of computer science that attempts to understand the essence of intelligence and produce a new intelligent machine that responds in a manner similar to human intelligence. Research in this field includes robotics, speech recognition, computer vision, natural language processing and expert systems, etc. [0003] The development of existing automatic speech recognition technology mainly tends to two stages of training and decoding; training, that is, training the acoustic model through a large amount...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/02G10L15/06G10L15/16G10L15/18G10L15/22G10L15/26G10L25/24
CPCG10L15/02G10L15/063G10L15/16G10L15/1822G10L15/22G10L15/26G10L25/24
Inventor 漆伟马永霄童永鳌张瑞冬殷子凌张浩
Owner 成都无糖信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products