Multilingual speech recognition model training method and device thereof, equipment and storage medium

A speech recognition model and training method technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problem of low efficiency of model training

Pending Publication Date: 2020-10-27
PING AN TECH (SHENZHEN) CO LTD
View PDF8 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the embodiments of the present application is to propose a multilingual speech recognition model training method, device, computer equipment and storage

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multilingual speech recognition model training method and device thereof, equipment and storage medium
  • Multilingual speech recognition model training method and device thereof, equipment and storage medium
  • Multilingual speech recognition model training method and device thereof, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by those skilled in the technical field of the application; the terms used herein in the description of the application are only to describe specific embodiments The purpose is not to limit the present application; the terms "comprising" and "having" and any variations thereof in the specification and claims of the present application and the description of the above drawings are intended to cover non-exclusive inclusion. The terms "first", "second" and the like in the description and claims of the present application or the above drawings are used to distinguish different objects, rather than to describe a specific order.

[0029] Reference herein to an "embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one embodiment of the present application. The occurrenc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multilingual speech recognition model training method, and relates to the field of artificial intelligence, and the method comprises the steps: carrying out the training of aspeech recognition model through a first language, and obtaining an initial speech recognition model; building an adaptive network function, and embedding the adaptive network function into a hiddenlayer of the initial speech recognition model to obtain an initial multilingual speech recognition model; performing model training on the initial multilingual speech recognition model through the speech data of the second language to obtain a training result; and iteratively updating the initial multilingual speech recognition model until the training result falls into a preset standard trainingresult range, and outputting the multilingual speech recognition model. In addition, the invention also relates to a blockchain technology, and the voice data of the first language and the voice dataof the second language can be stored in the blockchain. According to the invention, the adaptive network function is embedded into the hidden layer of the initial speech recognition model, so that thetraining efficiency of the multi-language speech recognition model can be improved.

Description

technical field [0001] The present application relates to the technical field of artificial intelligence, and in particular to a multilingual speech recognition model training method, device, equipment and storage medium. Background technique [0002] At present, speech recognition technology is very mature. Under the technical research of some speech recognition institutions, the recognition accuracy of the speech recognition model can reach 94.5%, which can be said to have reached the human auditory perception ability. However, this excellent speech recognition model is limited to a few widely used languages, such as English and French. At present, there are more than 5,000 languages ​​​​used by people all over the world, but only 10 languages ​​​​are widely used in these 5,000 languages. They are: Chinese, English, Russian, Spanish, Hindi, Arabic, Portuguese, Bengali, German and Japanese. For other languages, due to the small number of users, it is difficult to collect ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/00G10L15/02G10L15/06G10L15/16
CPCG10L15/005G10L15/063G10L15/02G10L15/16G10L2015/025
Inventor 郑振鹏王健宗罗剑程宁
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products