Speech recognition method and device, and related system and equipment

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech recognition and corresponding relationship technology, applied in the field of data processing, can solve problems such as the inability to correctly recognize multilingual mixed speech, achieve high speech recognition efficiency, and reduce the number of entries

Pending Publication Date: 2021-05-25

ALIBABA GRP HLDG LTD

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] This application provides a voice interaction system to solve the problem that the existing technology cannot correctly recognize multilingual mixed voices

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

no. 1 example

[0220] Please refer to figure 1 , which is a flowchart of an embodiment of the speech recognition method of the present application. The execution body of the method is a speech recognition device, which is usually deployed at the server end, but is not limited to the server end, and can also be any device capable of implementing the speech recognition method. The voice recognition method provided in this embodiment includes:

[0221] Step S101: Construct a first correspondence set between words in the first language and pronunciations in the first language, a second correspondence set between words in the second language and pronunciations in the second language, words in the first language and at least one second language a third set of correspondences between words; and, constructing a language model of the first language.

[0222] The first language can be any language, such as Chinese, English or French. The second language is a language other than the first language. ...

no. 2 example

[0275] Please refer to image 3 , which is a schematic diagram of an embodiment of a speech recognition device provided in the present application. The parts in this embodiment that are the same as those in the first embodiment will not be described again. Please refer to the corresponding parts in Embodiment 1. A speech recognition device provided by the application includes:

[0276] Thesaurus construction unit 301 is used to construct the first correspondence relation set between the first language word and the first language pronunciation, the second correspondence relation set between the second language word and the second language pronunciation, the first language word and the first language pronunciation a third correspondence set between at least one second language word;

[0277] A language model construction unit 302, configured to construct a language model of the first language;

[0278] Pronunciation unit determination unit 303, used to determine the candidate ...

no. 3 example

[0283] Please refer to Figure 4 , which is a schematic diagram of an electronic device embodiment of the present application. Since the device embodiment is basically similar to the method embodiment, the description is relatively simple, and for related parts, please refer to part of the description of the method embodiment. The device embodiments described below are illustrative only.

[0284] An electronic device in this embodiment, the electronic device includes: a processor 401 and a memory 402; the memory is used to store a program for realizing the voice recognition method, and after the device is powered on and runs the program of the voice recognition method through the processor , perform the following steps: construct the first correspondence relation set between the first language word and the first language pronunciation, the second correspondence relation set between the second language word and the second language pronunciation, the first language word and at ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a speech recognition method and device, a related system and equipment, and a lexicon construction method, device and equipment. The speech recognition method comprises the following steps: determining a candidate pronunciation unit sequence of multilingual mixed speech data through a multilingual acoustic model; according to the first corresponding relation set, the second corresponding relation set and the third corresponding relation set, determining a first language text corresponding to a second language pronunciation unit in the candidate pronunciation unit sequence, and forming a candidate first language text sequence of the voice data; determining a first language score of the candidate first language text sequence through a language model of the first language; and according to the first language score and the third corresponding relation set, determining a multi-language mixed text sequence corresponding to the voice data. By adopting the processing mode, multilingual mixed reading speech recognition is carried out in a first language space decoding mode; therefore, the accuracy of multilingual mixed speech recognition can be effectively improved.

Description

technical field [0001] This application relates to the technical field of data processing, specifically to voice interaction systems, methods and devices, voice transcription systems, methods and devices, voice recognition methods and devices, lexicon construction methods and devices, ordering equipment, smart speakers, terminal equipment, and electronic equipment. Background technique [0002] With the advent of the era of artificial intelligence, a significant change is that more and more smart Internet of Things (IoT) devices appear in daily life, such as smart speakers, smart TVs, subway voice ticket machines, ordering machines and so on. The emergence of smart IoT devices greatly facilitates people's daily life, but also raises a question: how to interact with these devices more conveniently. Voice interaction is the most convenient way of interaction between people, so you can also choose voice interaction for how to interact with IoT devices. [0003] For an intelli...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L15/00G10L15/183G10L15/26G10L25/51

CPCG10L15/005G10L15/183G10L15/26G10L25/51

Inventor 张仕良刘媛雷鸣

Owner ALIBABA GRP HLDG LTD

Speech recognition method and device, and related system and equipment

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

no. 1 example

no. 2 example

no. 3 example

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology