Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A method and device for cross-language speech recognition

A speech recognition, cross-language technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of low accuracy, long training time, high cost, etc., to achieve a wide range of support, high recognition rate, high accuracy Effect

Active Publication Date: 2021-09-24
AISPEECH CO LTD
View PDF12 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the process of realizing the present invention, the inventors found that there are at least the following problems in the prior art: (1) training a language model for a language from scratch requires a large amount of manually labeled data, which is not only expensive, but also takes a lot of time to obtain; building separate language models for each language hinders smooth recognition and increases the cost of recognizing mixed-language speech
(2) The training takes a long time, and the input of manpower and material resources is large. The range of other language recognition is based on the field of investment. The overall support range is relatively narrow, and there are similar pronunciations, which may easily cause misidentification and low accuracy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and device for cross-language speech recognition
  • A method and device for cross-language speech recognition
  • A method and device for cross-language speech recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] Exemplary embodiments of the present invention are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present invention to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0034] figure 1 is a schematic diagram of the main flow of the cross-language speech recognition method according to an embodiment of the present invention, such as figure 1 As shown, the method includes:

[0035] Step S101: Obtain cross-language sample data, use the sample data as input data of a preset neural network model for training, and obtain a language class disc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and device for cross-language speech recognition, and relates to the technical field of speech processing. A specific implementation of the method includes: obtaining cross-lingual sample data, using the sample data as the input data of the preset neural network model for training to obtain a language class discriminator; inputting the audio to be identified into the language class The discriminator divides the audio to be recognized according to the language type determined by the language type discriminator; uses the recognition engine corresponding to the determined language type to recognize the segmented audio to be recognized respectively. This embodiment does not need to modify the existing speech recognition engine, and has low cost, high recognition rate and high accuracy.

Description

technical field [0001] The invention relates to the field of speech processing, in particular to a cross-language speech recognition method and device. Background technique [0002] The intelligence and integration of electronic equipment are getting higher and higher, and the traditional information retrieval and menu operation methods are increasingly unable to meet the requirements. There is an urgent need for a more convenient information retrieval and command operation method to replace the traditional button operation. Speech recognition technology came into being. However, in most traditional automatic speech recognition systems, only the most commonly used language of the country is supported, and other languages ​​are less supported or not supported. For this situation, the conventional approach is: (1) Different languages ​​are considered independently, and a language model is trained from scratch for each language. (2) In the acoustic model, the most commonly us...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/00G10L15/05G10L15/06G10L15/16G10L15/26
CPCG10L15/005G10L15/05G10L15/063G10L15/16G10L15/26
Inventor 朱森
Owner AISPEECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products