Supercharge Your Innovation With Domain-Expert AI Agents!

Language modeling for voice recognition by use of knowledge graph

A technology of knowledge graph and language model, applied in speech recognition, speech analysis, natural language data processing, etc., can solve difficult problems

Inactive Publication Date: 2016-12-07
MICROSOFT TECH LICENSING LLC
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although modern commercial speech recognition systems try to add sufficient and large training corpora as much as possible, due to constraints such as training performance, model size, and costs (financial and resource, etc.) for collecting and maintaining a training corpus containing named entities Considering that it is difficult to update the training corpus to collect all possible entity information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Language modeling for voice recognition by use of knowledge graph
  • Language modeling for voice recognition by use of knowledge graph
  • Language modeling for voice recognition by use of knowledge graph

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0014] Examples of the present disclosure describe utilizing knowledge graphs to enhance statistical language modeling for online speech recognition systems / services to more correctly recognize named entities. A speech recognition system / service applies at least two models to evaluate input, namely 1) an acoustic model, and 2) a language model. An input is any data provided to and processed by a processing device. While the examples may describe input as relating to speech recognition processing, those skilled in the art will recognize that the examples described herein are applicable to any type of input, including but not limited to: speech / sound input, text input, gesture input , and examples such as handwriting input. An acoustic model is used to evaluate the pronunciation of utterances. Voice modeling processing is used to model received utterances. The output of the acoustic model is a lattice for each utterance. Lattice is an intermediate probability model in the fo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to language modeling for voice recognition by use of a knowledge graph. One unrestricted embodiment of the invention discloses language model processing on a received speech by use of a combined language model. The combined language model is applied to evaluating a transcription probability associated with the received speech. The combined language model evaluates naming entity data associated with the received speech by use of a position-based language model and an entity relation probability model. In at least one embodiment of the invention, a final probability that one or more transcriptions comprise the naming entity data is generated by evaluating the naming entity data of the received speech by use of at least one entity knowledge graph and query click log data associated with a naming entity of the knowledge graph. A result is output based on the final probability calculated through the combined language model. The output result comprises one or more transcriptions for ranking probabilities of candidate transcriptions on the basis of the combined voice model. Other embodiments are also disclosed.

Description

technical field [0001] The present invention relates to language models and, more particularly, to language modeling for speech recognition using knowledge graphs. Background technique [0002] Statistical language model is a classical model designed for speech recognition to estimate the prior probability of word strings. In speech recognition systems, the applied language models typically work with very large but limited training corpora. Although modern commercial speech recognition systems try to add sufficient and large training corpora as much as possible, due to constraints such as training performance, model size, and costs (financial and resource, etc.) for collecting and maintaining a training corpus containing named entities Considering that it is difficult to update the training corpus to collect all possible entity information. However, named entities are often present in speech recognition systems, especially spoken dialogue systems. Such entities are very l...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/06
CPCG06F40/295G10L15/1815G10L15/183
Inventor 朱卫武随明彭圣才
Owner MICROSOFT TECH LICENSING LLC
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More