Speech recognition method and system, medium, computer equipment, terminal and application

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech recognition and speech technology, applied in the fields of media, terminals and applications, systems, speech recognition methods, and computer equipment, can solve the problems of random pauses, easy conflicts, and difficulties in accurate speech recognition, and achieve the effect of optimizing and improving the effect.

Active Publication Date: 2021-04-27

HARBIN INST OF TECH AT WEIHAI

View PDF16 Cites 5 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the construction of the rule method is often cumbersome and prone to conflicts. Therefore, the data-driven sequence-to-sequence method can combine semantic information for standardized transcription, which is an important direction for current research to solve the problem of text standardization.

[0006] Through the above analysis, the existing problems and defects of the existing technology are as follows: In text standardization, the hybrid model based on deep learning and rules has been applied very well.

However, the construction of the rule method is often cumbersome and prone to conflicts. Therefore, the data-driven sequence-to-sequence method can combine semantic information for standardized transcription, which is an important direction for current research to solve text standardization problems.

[0007] The difficulty in solving the above problems and defects is: the lack of audio data related to the medical field for the customized development of the recognition system, it is very difficult to achieve accurate speech recognition in the professional field; the results of speech recognition in the medical field lack punctuation information, pauses are random and Multi-subject crossing is frequent, adding punctuation marks that represent sentence boundary information makes it difficult to construct a punctuation prediction model; recognition in the medical field involves many complex forms such as dates, numbers, and unit symbols, and converts audio in the form of reading aloud into text in the form of reading to construct text Denormalizing the model is more difficult

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0103] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0104] Aiming at the problems existing in the prior art, the present invention provides a speech recognition method, system, medium, computer equipment, terminal and application. The present invention will be described in detail below in conjunction with the accompanying drawings.

[0105] Such as figure 1 As shown, the speech recognition method provided by the present invention comprises the following steps:

[0106] S101: Recognize speech in the medical field by performing language model modeling based on text data in the medical field;

[0107] S102: Perform post-processing on the recognized text to obtain the reading ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention belongs to the technical field of voice information processing, and discloses a voice recognition method and system, a medium, computer equipment, a terminal and an application thereof, and the method comprises the steps: carrying out the language model modeling based on the text data of a medical field, and recognizing the voice of the medical field; performing post-processing on the recognized text to obtain a reading form, wherein the reading form comprises two post-processing tasks of punctuation prediction and text inverse standardization; adding punctuation marks representing sentence boundary information to the punctuation prediction recognition text, and transliterating the symbol pronunciation text in the reading form in the recognition text into the symbol form in the reading form through text inverse standardization. According to the invention, analysis and design are carried out according to requirements of a voice recognition system in the medical field, and a web-based website system is developed. The two functional forms of universal input and template input meet the actual input form requirements of voice recognition. The voice recognition and post-processing technology is used as system support, and a software system which is excellent in performance and user-friendly is completed.

Description

technical field [0001] The invention belongs to the technical field of speech information processing, and in particular relates to a speech recognition method, system, medium, computer equipment, terminal and application. Background technique [0002] At present: Speech recognition technology has been developed for more than 60 years from the isolated digital recognition system for specific speakers, and the technical solution has gradually developed from the initial pattern matching method to today's statistical-based model. From the mid-1990s to the beginning of this century, the development of a hybrid speech recognition framework based on Gaussian Mixture Model (GMM) and Hidden Markov Model (HMM) has enabled large-scale continuous speech recognition systems. development possible. Since the beginning of this century, with the vigorous development of deep learning, speech recognition technology based on deep learning has become the mainstream. Before 2015, speech recogni...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/26G10L15/14G10L15/16G10L15/06G10L15/18G10L15/30H04L29/08H04L29/06

CPCG10L15/26G10L15/148G10L15/144G10L15/16G10L15/063G10L15/1815G10L15/30H04L67/02H04L67/133

Inventor 胡鑫涂志莹李春山李政佐赵云龙初佃辉

Owner HARBIN INST OF TECH AT WEIHAI

Speech recognition method and system, medium, computer equipment, terminal and application

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology