Speech input method and system
A voice input and voice fragment technology, applied in voice input/output, voice analysis, voice recognition and other directions, can solve problems such as unfriendly and cumbersome users
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0048] Such as Figure 3~5 As shown, the present invention provides a voice input method, including:
[0049] Step S11: While recording, the input voice is continuously divided into voice fragments and the text of each voice fragment is generated. Specifically, the present invention can automatically split the voice recognition result and perform the segment return for the user to confirm again. The server continuously divides the input voice into voice fragments and generates the text of each voice fragment, and continuously divides the input voice into voice fragments through the voice endpoint detection algorithm. The voice endpoint detection accurately determines the voice from a signal containing voice The starting point and the ending point distinguish between voice and non-voice signals. Voice endpoint detection is an important aspect of voice processing technology. For example, when the user continuously inputs voice, the cloud server can use the endpoint detection algori...
Embodiment 2
[0061] Such as Image 6 with Figure 7 As shown, the present invention provides another voice input method. The difference between this embodiment and the embodiment is that it adds a step of performing noise monitoring on the recording environment to obtain the signal-to-noise ratio during recording, and the candidates can be adjusted according to different signal-to-noise ratios. The number of results, and prompts the user in the case of strong noise that is not suitable for voice input. This example can specifically include:
[0062] Step S21: Perform noise monitoring on the recording environment during recording to obtain the signal-to-noise ratio. Specifically, this step can automatically detect the signal-to-noise ratio of the input voice and feed it back on the interactive interface, which can be used in the case of strong noise that is not suitable for voice input The user is reminded that the number of candidate results can also be adjusted according to different signal-t...
Embodiment 3
[0077] Such as Figure 8 As shown, the present invention also provides another voice input system, including a segmentation module 41, a correction module 42, and a noise monitoring unit 43.
[0078] The segmentation module 41 is used to continuously segment the input voice into voice segments and generate the text of each voice segment while recording. Specifically, the segmentation module 41 is located on a cloud server, and the segmentation module 41 uses the voice The endpoint detection algorithm continuously divides the input voice into voice segments. This module can automatically split the voice recognition results and return them in segments for the user to confirm again.
[0079] The correction module 42 is used to display the text of each voice segment in turn, and correct the text of each voice segment in turn according to the user's selection. Specifically, this module can enable the user to modify and confirm the returned text while recording. However, in the interacti...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


