End-to-end speech recognition method, electronic device and computer readable storage medium

A speech recognition and electronic device technology, applied in speech recognition, speech analysis, biological neural network model, etc., can solve the problems of complicated process, inability to mix speech source input recognition, large processing capacity, etc., so as to reduce the amount of calculation and simplify the speech The effect of the identification process

Pending Publication Date: 2019-01-15
PING AN TECH (SHENZHEN) CO LTD
View PDF6 Cites 39 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The present invention provides an end-to-end speech recognition method, an electronic device, and a computer-readable storage medium to solve the problem that the existing speech recognition method and system have a larg

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • End-to-end speech recognition method, electronic device and computer readable storage medium
  • End-to-end speech recognition method, electronic device and computer readable storage medium
  • End-to-end speech recognition method, electronic device and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0054] The embodiments of the present invention will be described below with reference to the drawings. Those of ordinary skill in the art may realize that the described embodiments can be modified in various different ways or combinations thereof without departing from the spirit and scope of the present invention. Therefore, the drawings and description are illustrative in nature, and are not used to limit the protection scope of the claims. In addition, in this specification, the drawings are not drawn to scale, and the same reference numerals denote the same parts.

[0055] figure 1 It is a schematic flow diagram of the end-to-end speech recognition method of the present invention, such as figure 1 As shown, the present invention provides an end-to-end speech recognition method, which is applied to an electronic device, and the electronic device can be implemented by software and / or hardware. The end-to-end speech recognition method includes:

[0056] Step S1: Obtain a first...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of speech recognition and discloses an end-to-end speech recognition method. The method includes acquiring a first mixed speech signal of a plurality of speakers and tag sequences as training samples; building an Encoder-Decoder Architecture based neural network model; training the neural network model; acquiring a second mixed speech signal including aplurality of speakers to be recognized; inputting the second mixed speech signal into the trained neural network model to output text information corresponding to each speaker respectively. The invention outputs the pronunciation content corresponding to each speaker respectively for the mixed speech source input formed by the simultaneous vocalization of a plurality of speakers, and does not need to include an obvious speech segmentation stage to generate a plurality of independent outputs from the mono-channel mixed speech, thereby simplifying the speech recognition process and reducing thecomputational load. The invention also discloses an electronic device and a computer-readable storage medium.

Description

technical field [0001] The present invention relates to the technical field of voice recognition, in particular to an end-to-end voice recognition method, an electronic device and a computer-readable storage medium. Background technique [0002] Speech recognition, also known as Automatic Speech Recognition (ASR), can convert input speech signals into corresponding text or command output through recognition and understanding, and is an important branch of the development of modern artificial intelligence. With the rapid improvement of computer processing capabilities, speech recognition technology has also been greatly developed. Speech recognition technology can effectively promote the development of voice-activated interaction related fields and greatly facilitate people's lives. It is also increasingly changing human production and life. Way. With the development of voice interaction methods, the requirements for voice recognition technology are getting higher and higher...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/26G10L15/06G10L15/16G06N3/04
CPCG06N3/049G10L15/063G10L15/16G10L15/26Y02T10/40
Inventor 贾雪丽程宁王健宗肖京
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products