End-to-end speech recognition method, electronic device and computer readable storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech recognition and electronic device technology, applied in speech recognition, speech analysis, biological neural network model, etc., can solve the problems of complicated process, inability to mix speech source input recognition, large processing capacity, etc., so as to reduce the amount of calculation and simplify the speech The effect of the identification process

Pending Publication Date: 2019-01-15

PING AN TECH (SHENZHEN) CO LTD

View PDF6 Cites 39 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] The present invention provides an end-to-end speech recognition method, an electronic device, and a computer-readable storage medium to solve the problem that the existing speech recognition method and system have a large amount of processing for multi-speaker mixed speech input, the process is complicated, and it cannot target Mixing speech source input for direct recognition and outputting multiple independent pronunciation content questions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0054] Embodiments of the present invention will be described below with reference to the accompanying drawings. Those skilled in the art would recognize that the described embodiments can be modified in various ways or combinations thereof without departing from the spirit and scope of the invention. Accordingly, the drawings and description are illustrative in nature and not intended to limit the scope of the claims. Also, in this specification, the drawings are not drawn to scale, and like reference numerals denote like parts.

[0055] figure 1 It is a schematic flow chart of the end-to-end speech recognition method of the present invention, as figure 1 As shown, the present invention provides an end-to-end speech recognition method, the method is applied to an electronic device, and the electronic device can be implemented by software and / or hardware, and the end-to-end speech recognition method includes:

[0056] Step S1, obtaining a first mixed speech signal containin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to the technical field of speech recognition and discloses an end-to-end speech recognition method. The method includes acquiring a first mixed speech signal of a plurality of speakers and tag sequences as training samples; building an Encoder-Decoder Architecture based neural network model; training the neural network model; acquiring a second mixed speech signal including aplurality of speakers to be recognized; inputting the second mixed speech signal into the trained neural network model to output text information corresponding to each speaker respectively. The invention outputs the pronunciation content corresponding to each speaker respectively for the mixed speech source input formed by the simultaneous vocalization of a plurality of speakers, and does not need to include an obvious speech segmentation stage to generate a plurality of independent outputs from the mono-channel mixed speech, thereby simplifying the speech recognition process and reducing thecomputational load. The invention also discloses an electronic device and a computer-readable storage medium.

Description

technical field [0001] The present invention relates to the technical field of voice recognition, in particular to an end-to-end voice recognition method, an electronic device and a computer-readable storage medium. Background technique [0002] Speech recognition, also known as Automatic Speech Recognition (ASR), can convert input speech signals into corresponding text or command output through recognition and understanding, and is an important branch of the development of modern artificial intelligence. With the rapid improvement of computer processing capabilities, speech recognition technology has also been greatly developed. Speech recognition technology can effectively promote the development of voice-activated interaction related fields and greatly facilitate people's lives. It is also increasingly changing human production and life. Way. With the development of voice interaction methods, the requirements for voice recognition technology are getting higher and higher...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/26G10L15/06G10L15/16G06N3/04

CPCG06N3/049G10L15/063G10L15/16G10L15/26Y02T10/40

Inventor 贾雪丽程宁王健宗肖京

Owner PING AN TECH (SHENZHEN) CO LTD

Features

Generate Ideas
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

End-to-end speech recognition method, electronic device and computer readable storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology