Voice recognition method and device

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech recognition and speech technology, applied in the field of data processing, can solve problems such as slow recognition response speed and mismatched recognition scenes, and achieve the effect of improving recognition efficiency

Pending Publication Date: 2020-11-03

BEIJING UNISOUND INFORMATION TECH +1

View PDF0 Cites 4 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] The purpose of the embodiments of the present invention is to provide a method and device for speech recognition to solve the problems of ASR recognition in the prior art that requires multiple language models to be preset, the recognition response speed is reduced, and the recognition scene does not match.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0042] The application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain related inventions, rather than to limit the invention. It should also be noted that, for the convenience of description, only the parts related to the related invention are shown in the drawings.

[0043] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present application will be described in detail below with reference to the accompanying drawings and embodiments.

[0044] figure 2 It is a schematic flow chart of the speech recognition method of the embodiment of the present invention, and the execution subject of the method is the intelligent outbound call platform. Such as figure 2 As shown, the speech recognition method includes ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a voice recognition method. The method comprises the following steps: adding scene classification information of an acoustic model; obtaining acoustic model output of to-be-tested voice under the scene classification information, and determining a scene meeting a condition; and dynamically loading the voice model corresponding to the scene meeting the condition to obtain a voice recognition result. By applying the voice recognition method provided by the embodiment of the invention, scene information is added to the acoustic model, the scene model meeting the condition is dynamically loaded, the limitation of the original preset scene model is removed, the recognition efficiency is improved, and a dynamic loading mode is adopted after the model numerical value of therecognition scene is set. Therefore, the numerical value is not changed due to the change of the service demand, so that the response speed is stabilized at the decoding speed of the model with the set numerical value.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a voice recognition method and device. Background technique [0002] Automatic Speech Recognition (ASR) consists of three parts: acoustic model, language model and decoder, as follows figure 1 shown. Among them, the acoustic model and the language model have their own training methods. The acoustic model uses speech data to train the sound-mapping pronunciation model; the language model uses text data to train the pronunciation-mapping text model. Generally, the language model will pre-train multiple models according to the usage scenarios. Use the scene to load the scene model that may be used; the two can be trained separately and in parallel; when using the ASR recognition project, it is currently necessary to manually set the boundary of the scene, that is, the acoustics need to be configured with a near-speaking scene or a far-speaking scene, and the language model ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L15/183G10L15/08

CPCG10L15/183G10L15/08

Inventor 李旭滨沈华东

Owner BEIJING UNISOUND INFORMATION TECH

Voice recognition method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology