Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech recognition method, device and system

A speech recognition and to-be-recognized technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as immature speech recognition solutions

Pending Publication Date: 2019-05-28
ALIBABA GRP HLDG LTD
View PDF7 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] At present, the speech recognition scheme for dialects is still immature, and a solution to the multi-dialect problem needs to be provided

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition method, device and system
  • Speech recognition method, device and system
  • Speech recognition method, device and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0109] In order to make the purpose, technical solution and advantages of the present application clearer, the technical solution of the present application will be clearly and completely described below in conjunction with specific embodiments of the present application and corresponding drawings. Apparently, the described embodiments are only some of the embodiments of the present application, rather than all the embodiments. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0110] In the prior art, the speech recognition scheme for dialects is still immature. For this technical problem, the embodiment of the present application provides a solution. The main idea of ​​the scheme is: construct ASR models for different dialects, , pre-identify the dialect to which the voice wake-up word belongs, and then select the ASR mode...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a speech recognition method, device and system. The method comprises the following steps: receiving a wake-on-voice word; identifying a first dialect to whichthe wake-on-voice word belongs; sending a service request to a server to request the server to select an ASR (Automatic Speech Recognition) model corresponding to the first dialect from ASR models corresponding to different dialects, so as to perform speech recognition on a speech signal to be identified by the server by using the ASR model corresponding to the first dialect. The method provided by the embodiment can automatically perform the speech recognition on multiple dialects and improve the speech recognition efficiency for multiple dialects.

Description

technical field [0001] The present application relates to the technical field of speech recognition, in particular to a speech recognition method, device and system. Background technique [0002] Automatic Speech Recognition (ASR) is a technology that can convert human voice audio signals into text content. With the development of software and hardware technology, the computing power and storage capacity of various smart devices have been greatly improved, making speech recognition technology widely used in smart devices. [0003] In speech recognition technology, speech phonemes need to be accurately recognized, and the speech phonemes based on accurate recognition can be converted into text. However, no matter what kind of language it is, there will be many different pronunciations of the language due to various factors, that is, multiple dialects. Taking Chinese as an example, there are Mandarin dialects, Jin dialects, Xiang dialects, Gan dialects, Wu dialects, Min dial...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/30G10L15/22G10L15/26G10L15/00
CPCG10L15/00G10L15/22G10L15/26G10L15/30
Inventor 牛也徐巍越冯伟国黄光远
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products