Method of converting phonetic file into text file

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A voice file and text file technology, applied in voice analysis, voice recognition, instruments, etc., can solve problems such as unrealistic, low recognition rate, and difficult recognition by voice recognizers, and achieve the effect of improving use efficiency

Inactive Publication Date: 2002-09-25

苏州孔雀电器集团有限责任公司

View PDF1 Cites 12 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

This method is usually used in the field of speech input. When used as an input method, the user can train the speech recognizer in advance. etc.), it is unrealistic to ask the speaker to train the speech recognizer. Moreover, due to the large number of dialects in Chinese, even if the speaker speaks in Mandarin, he often has a heavy local accent, which makes it difficult for those without The standard speech recognizer trained is difficult to recognize accurately; at the same time, even if the speech recognizer adopts a certain dialect, due to the great regional differences of Chinese dialects (for example, taking southern Jiangsu as an example, not only the neighboring cities of Suzhou and Wuxi The accents are different, and the accents of Suzhou City and its subordinate county-level cities are also different, even in Wuzhong District of Suzhou, there are many dialects), and untrained dialect speech recognizers cannot accurately recognize the dialects of neighboring regions

[0005] Therefore, with the existing speech recognizer training method, it is impossible to realize the recognition and conversion of speech files. Even if the standard recognizer that comes with it is barely used, the recognition rate is also very low, which cannot meet the practical requirements.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0028] Embodiment one: see attached figure 1 and figure 2 Shown, a kind of method that speech file is converted into text file, comprises the steps:

[0029] (1) Obtain the voice file that needs to be converted, utilize the voice player to play the voice file, the playback length includes 50 to 250 words, the playback position is specified by the user, and the voice player adopts speed-regulating playback software, and the playback speed is regulated by the user To be consistent with the speed at which the user enters the corresponding text;

[0030] (2) Recognized by the user, input the corresponding text, obtain the voice file and the corresponding text file used for training, the user can adopt the keyboard input method, also can adopt handwriting board input;

[0031] (3) Utilize the training file that step 2 obtains, to the basic speech recognizer that has speech recognizer storehouse, adopt speech self-adaptive technology to re-estimate speech parameter;

[0032] (4)...

Embodiment 2

[0034] Embodiment two: a kind of method that voice file is converted into text file, comprises the steps:

[0035] (1) Obtain the voice file to be converted, and use the voice player to play the voice file, and the playback length must contain at least 50 characters;

[0036] (2) Recognized by the user, input the corresponding text, and obtain the voice file and the corresponding text file for training;

[0037] (3) Utilize the training file that step 2 obtains, to the basic speech recognizer that has the speech recognizer storehouse, adopt speech self-adaptive technology to re-estimate speech parameter; Described basic speech recognizer has 6 speech recognizer storehouses , which are the standard Mandarin database, Mandarin database, Wu language database, Sichuan language database, Cantonese database, and Hokkien language database. First, select a relatively close speech recognizer database based on the training file, and then use speech adaptive technology for the speech rec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The process of converting phonetic file into text file includes the following steps: to obtain the phonetic file to the be converted and to play back in player the phonetic file including at least 50characters; the user to identify and input corresponding text to obtain phonetic file for training and corresponding text file; to re-estimate phonetic parameters by means of phonetic adapting technology with the obtained training file and basic phonetic identifier with phonetic identifying library; and to obtain phonetic identifier related to the recorder and convert the identified phonetic fileinto text file.

Description

technical field [0001] The invention relates to a speech recognition method, in particular to an adaptive speech recognition method which can directly process and recognize speech files and convert them into text files. Background technique [0002] The wide application of computers has promoted the progress of speech recognition research, especially in the past two decades, with the introduction and gradual engineering of the Hidden Markov Model (HMM) theory, researchers have used the Hidden Markov Model to establish Some of the speech recognition systems in China have entered commercial applications. As a speech recognition system for commercial applications, it usually includes a basic speech recognizer that has nothing to do with the speaker. Since the pronunciation of different users is quite different, the basic speech recognizer must be trained, using speech adaptive technology Re-estimate the language parameters of a specific user to obtain a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L15/26

Inventor倪苏平丁祁正

Owner苏州孔雀电器集团有限责任公司

Method of converting phonetic file into text file

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology