Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method, system and platform for visual generation of speech recognition network

A speech recognition and network technology, applied in speech recognition, speech analysis, character and pattern recognition, etc., can solve problems such as poor visibility, inconvenient version management, uncontrollable customization process, etc., to improve accuracy and efficiency, and speed up training Speed, the effect of shortening the product cycle

Active Publication Date: 2021-09-17
AISPEECH CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] It can be seen from this that the visual speech recognition network used in speech recognition in the prior art has uncontrollable customization processes during the generation process, and the version is not easy to manage, and cannot satisfy multi-task language model training
At the same time, the visibility in model making is poor, and it is not convenient for multiple users to edit at the same time, thereby reducing the efficiency and accuracy of speech recognition model generation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, system and platform for visual generation of speech recognition network
  • Method, system and platform for visual generation of speech recognition network
  • Method, system and platform for visual generation of speech recognition network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] In order to make the purposes, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments These are some embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0044] On the one hand, the present invention improves the visual generation method of the speech recognition network, and the method can run on the Web side, such as figure 1 As shown, the visual generation method of the speech recognition network in the present invention includes:

[0045] Step S101, acquiring keywords and general fi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for generating visualization of a speech recognition network, and the method includes: receiving keywords through a human-computer interaction interface. The current domain field is selected from a plurality of preset general domain fields, and each general domain field corresponds to a plurality of preset crawlers and corresponds to a plurality of preset web crawling pages. Get a general corpus. Get a specific corpus. Train the general corpus to obtain the general language model and the specific language model. After the WFST speech recognition network of the general language model and the WFST speech recognition network of the specific language model are connected in parallel, combined with the acoustic model and pronunciation dictionary, the WFST speech recognition network is synthesized through combination, determinization, and minimization operations. By configuring the system on the same platform, the training speed of the language model is accelerated, the product cycle is shortened, the labor consumption is shortened, and the labor cost is saved. At the same time, through the combination of general language model network and specific language model, the accuracy and efficiency of language recognition can be improved.

Description

technical field [0001] The invention belongs to the technical field of speech recognition, and in particular relates to a visual generation method, system and platform of a speech recognition network. Background technique [0002] At present, there are few relevant visual language model making systems on the market, and most language model making systems are customized at the command line level. The production of language models plays a pivotal role in speech recognition. Each speech company has its own team responsible for the model, but most of them are produced under the command line. In the prior art, the model customization process under the command line is uncontrollable, the version management is not easy, the risk is uncontrollable, and the process is not simplified enough. The reason for the above-mentioned defects is that the model is trained by manually inputting commands with various scripts under the command line. Manual training under the command line lacks c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/33G06F16/9535G06F40/30G06K9/62G10L15/06G10L15/28
CPCG06F16/9535G06F16/3344G10L15/063G10L15/28G10L2015/0638G06F18/214
Inventor 王雪志
Owner AISPEECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products