Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for training filter model and speech recognition method

A speech recognition and model technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as inability to recognize speech and inconsistency of sound signals

Active Publication Date: 2019-05-14
HUAWEI TECH CO LTD
View PDF5 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, in the prior art, the corresponding relationship between the syllable sequence and the character sequence in the language model is obtained based on the preset database training. In actual use, it may be affected by the environment and the user's pronunciation habits, etc. The sound signal of the voice in the database is inconsistent with the actual collected sound signal, resulting in the final failure to recognize the voice

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for training filter model and speech recognition method
  • Method for training filter model and speech recognition method
  • Method for training filter model and speech recognition method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] The technical solution in this application will be described below with reference to the accompanying drawings.

[0058] It should be understood that the division of methods, situations, categories and embodiments in the embodiments of the present application is only for the convenience of description and should not constitute a special limitation. The features in various methods, categories, situations and embodiments are not contradictory cases can be combined.

[0059] It should also be understood that "first", "second" and "third" in the embodiments of the application are only for distinction and shall not constitute any limitation to the present application.

[0060] It should also be understood that in various embodiments of the present application, the sequence numbers of the processes do not mean the order of execution, and the execution order of the processes should be determined by their functions and internal logic, and should not be used in the embodiments o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The application provides a method and device for training a filter model. The method comprises the following steps: determining N original syllables, wherein the N original syllables are syllables included in the actual pronunciation of a first corpus; determining N recognition syllables, wherein the N recognition syllables are syllables of a recognition result obtained by first speech recognitionprocessing of a sound signal of the first corpus, and the first speech recognition processing includes filter processing based on the filter model and recognition processing based on a speech recognition engine; determining N syllable distances according to the N original syllables and the N recognition syllables, wherein the N syllable distances correspond to the N syllables one to one, and theN original syllables and the N recognition syllables constitute N syllable pairs, each syllable pair includes one original syllable and one recognition syllable which correspond to each other; each syllable distance is used for indicating the similarity between the original syllable and the recognition syllable included in the corresponding syllable pair. Thus, the recognition accuracy of the speech recognition engine can be improved.

Description

technical field [0001] The present application relates to the technical field of speech recognition, and more specifically, to a method for training a filtering model, a method for speech recognition, a training device, a speech recognition device, and a speech recognition system. Background technique [0002] Automatic speech recognition (Automatic Speech Recognition, ASR) is a key technology of the voice interaction system, and the ASR engine (also called a speech recognition engine) is responsible for converting speech signals into text. figure 1 It is a schematic diagram of an example of speech recognition performed by an ASR engine. Such as figure 1 Said, the sound is collected through the sound pickup device, and the obtained speech signal is converted into a syllable sequence (for example, the initial and final sequence in Chinese Pinyin) by the acoustic model after the feature module extracts (frequency domain) features. Then, a character sequence (for example, a C...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06
CPCG10L15/16G10L15/065G10L15/20G10L15/01G10L15/02G10L15/063G10L15/187G10L2015/027
Inventor 聂为然于海
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products