Multimodal disambiguation of speech recognition
A speech recognition, non-sound technology, applied in the field of speech recognition, can solve the problems of reducing the accuracy of handwriting recognition, slow handwriting typing, and large differences in handwriting styles.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0063] The present invention provides a device and method for intelligent editing of speech recognition output, which can provide the most likely choice or hypothesis (hypotheses) according to the user's input. The speech recognition engine scores alternative hypotheses that add numerical values to the information presented to the user. For example, if speech recognition provides the user with a wrong first choice hypothesis, the user would want to obtain other N-best hypotheses to modify the hypothesis returned by the recognizer. In a multimodal environment, a list of N best hypotheses from the speech recognition output can be obtained. Specifically, the list of the N best hypotheses is added to the current text menu for easy editing.
[0064] One embodiment of the invention uses both acoustic information and textual context in providing the N best hypotheses. This can be syntactically dependent or independent. That is, a language model may provide grammatical informatio...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com