Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech recognition method, device and system

A speech recognition and speech technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as inability to mix speech recognition

Pending Publication Date: 2020-11-10
ALIBABA GRP HLDG LTD
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The embodiment of the present application provides a speech recognition method, device and system to at least solve the technical problem in the related art that only speech in a specific language can be recognized, but mixed speech cannot be recognized

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition method, device and system
  • Speech recognition method, device and system
  • Speech recognition method, device and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0033] According to the embodiment of the present application, an embodiment of a speech recognition method is also provided. It should be noted that the steps shown in the flow charts of the accompanying drawings can be executed in a computer system such as a set of computer-executable instructions, and, although A logical order is shown in the flowcharts, but in some cases the steps shown or described may be performed in an order different from that shown or described herein.

[0034] The method embodiment provided in Embodiment 1 of the present application may be executed in a mobile terminal, a computer terminal, or a similar computing device. image 3 A block diagram of a hardware structure of a computer terminal (or mobile device) for implementing a voice recognition method is shown. Such as image 3 As shown, the computer terminal 10 (or mobile device 10) may include one or more (shown by 102a, 102b, ..., 102n in the figure) processor 102 (the processor 102 may include...

Embodiment 2

[0069] According to the embodiment of the present application, a speech recognition method is also provided, such as Figure 10 As shown, the method includes the following steps:

[0070] Step S1002, inputting a voice to be recognized, wherein the voice to be recognized is voice data including at least one language.

[0071] Optionally, the user can input the voice to be recognized into the intelligent interactive device, wherein the intelligent interactive device is a device capable of interacting through voice, for example, a subway ticket machine, a voice dialogue robot, and a voice assistant. The intelligent interactive device has a voice collection device, and the voice collection device may be but not limited to a microphone.

[0072] It should be noted that the voice to be recognized may include one language, for example, the voice to be recognized may be Chinese, or may include multiple languages. Optionally, when the speech to be recognized only includes one languag...

Embodiment 3

[0081] According to an embodiment of the present application, a speech recognition system for implementing the above speech recognition method is also provided, such as Figure 11 As shown, the system includes: an input unit 1101 , a recognition unit 1103 and an output unit 1105 .

[0082] Wherein, the input unit 1101 is used to obtain the speech to be recognized, wherein the speech to be recognized is speech data including at least one language; the recognition unit 1103 is used to recognize the speech to be recognized based on the recognition model, and obtain the recognition result, wherein the recognition model Including at least: a mixed acoustic model, a mixed language model, a mixed dictionary, the mixed acoustic model includes acoustic models of multiple languages, the mixed language model includes language models of multiple languages, and the mixed dictionary includes dictionaries of multiple languages; the output unit 1105 uses To output feedback information corresp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech recognition method, device and system. The method comprises the steps that speech to be recognized is acquired, wherein the speech to be recognized is speech data containing at least one language; the speech to be recognized is recognized based on a recognition model, and a recognition result is obtained, wherein the recognition model at least comprises a mixed acoustic model, a mixed language model and a mixed dictionary, the mixed acoustic model comprises acoustic models of multiple languages, the mixed language model comprises language models of multiple languages, and the mixed dictionary comprises dictionaries of multiple languages. According to the invention, the technical problem that only the speech of a specific language can be recognized and the mixed speech cannot be recognized in the prior art is solved.

Description

technical field [0001] The present application relates to the field of speech recognition, in particular, to a speech recognition method, device and system. Background technique [0002] With the rapid development of the Internet and the popularization and application of smart mobile terminals, speech recognition technology has been widely used in people's work, life and study, such as voice chat robots, voice assistants and related interactive tools. These devices usually obtain the user's recognition instruction by recognizing the user's voice, and then perform actions corresponding to the recognized instruction. [0003] However, different countries use different languages, and different regions of the same country also use various dialects. The existing technology needs to use the collected data to train a set of recognition systems according to each language, usually including specialized acoustic models, language models, decoders and pronunciation dictionaries, such a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06G10L15/22
CPCG10L15/06G10L15/22
Inventor 张仕良刘媛雷鸣李威
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products