Voice input method and system

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A voice input and voice technology, applied in voice input/output, voice analysis, voice recognition, etc., can solve problems such as unfriendly and cumbersome users

Active Publication Date: 2013-10-23

SHANGHAI GUOKE ELECTRONICS

View PDF12 Cites 39 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Therefore, this confirmation process is usually cumbersome and not friendly enough for users

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0048] Such as Figure 3-5 As shown, the present invention provides a voice input method, comprising:

[0049] Step S11, while recording, the input voice is continuously segmented into voice segments and the text of each voice segment is generated. Specifically, the present invention can automatically segment the voice recognition results and return them in segments for secondary confirmation by the user, which can be provided by the cloud The server continuously divides the input voice into voice segments and generates the text of each voice segment, and continuously divides the input voice into voice segments through the voice endpoint detection algorithm. The voice endpoint detection is to accurately determine the voice from a signal containing voice The start point and the end point distinguish voice and non-voice signals. Voice endpoint detection is an important aspect of voice processing technology. For example, when the user continuously inputs voice, the cloud server c...

Embodiment 2

[0061] like Figure 6 and Figure 7 As shown, the present invention provides another voice input method. The difference between this embodiment and the embodiment is that the step of monitoring the noise of the recording environment to obtain the signal-to-noise ratio is added during recording, and the candidate can be adjusted according to different signal-to-noise ratios. The number of results, and prompt the user in the case of strong noise that is not suitable for voice input. This example can specifically include:

[0062] Step S21, monitor the noise of the recording environment to obtain the signal-to-noise ratio during recording. Specifically, this step can automatically detect the signal-to-noise ratio of the input voice and feed it back on the interactive interface, which can be used in the case of strong noise that is not suitable for voice input The user is reminded that the number of candidate results can also be adjusted according to different signal-to-noise rat...

Embodiment 3

[0077] like Figure 8 As shown, the present invention also provides another voice input system, including a segmentation module 41 , a correction module 42 and a noise monitoring unit 43 .

[0078] Segmentation module 41 is used for constantly cutting the input voice into speech segments and generating the text of each speech segment while recording, specifically, described segmentation module 41 is positioned on the cloud server, and described segmentation module 41 passes voice The endpoint detection algorithm continuously divides the input voice into voice segments. This module can automatically segment the voice recognition results and return them in segments for the user's second confirmation.

[0079] The correction module 42 is used to sequentially display the text of each speech segment, and modify the text of each speech segment in turn according to the user's selection. Specifically, this module can realize that the user can modify and confirm the returned text while...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a voice input method and system. The method includes: recording a voice and at the same time segmenting the input voice into voice segments and generating a text for each voice segment; and displaying the text of each voice segment in order and correcting the text of each voice segment in order according to a selection of a user. The voice input method and system enable a voice identification result to be segmented automatically and paragraphed and then returned for a second confirmation of the user so that the user can record a voice while correcting and confirming a returned text.

Description

technical field [0001] The invention belongs to the field of voice recognition, in particular to a voice input method and system. Background technique [0002] With the advancement of speech recognition technology and the rise of cloud computing, it has become a trend to use speech input on mobile terminals, perform speech-to-text transcription through cloud servers, and return text to mobile terminals. Due to the size limitation of mobile terminals, the convenience of text input directly through a physical or virtual keyboard is always unsatisfactory. It is foreseeable that voice input will replace key input in more and more places. [0003] However, the current situation that the accuracy of voice recognition is difficult to reach 100% hinders the process of completely replacing key input by voice input. In fact, due to the complexity of real pronunciation under various conditions in life, the accuracy of speech recognition can never reach 100%, especially in a noisy envi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/26G06F3/16

Inventor 李曜许东星

Owner SHANGHAI GUOKE ELECTRONICS

Voice input method and system

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology