Voice input method and system
A voice input and voice technology, applied in voice input/output, voice analysis, voice recognition, etc., can solve problems such as unfriendly and cumbersome users
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0048] Such as Figure 3-5 As shown, the present invention provides a voice input method, comprising:
[0049] Step S11, while recording, the input voice is continuously segmented into voice segments and the text of each voice segment is generated. Specifically, the present invention can automatically segment the voice recognition results and return them in segments for secondary confirmation by the user, which can be provided by the cloud The server continuously divides the input voice into voice segments and generates the text of each voice segment, and continuously divides the input voice into voice segments through the voice endpoint detection algorithm. The voice endpoint detection is to accurately determine the voice from a signal containing voice The start point and the end point distinguish voice and non-voice signals. Voice endpoint detection is an important aspect of voice processing technology. For example, when the user continuously inputs voice, the cloud server c...
Embodiment 2
[0061] like Figure 6 and Figure 7 As shown, the present invention provides another voice input method. The difference between this embodiment and the embodiment is that the step of monitoring the noise of the recording environment to obtain the signal-to-noise ratio is added during recording, and the candidate can be adjusted according to different signal-to-noise ratios. The number of results, and prompt the user in the case of strong noise that is not suitable for voice input. This example can specifically include:
[0062] Step S21, monitor the noise of the recording environment to obtain the signal-to-noise ratio during recording. Specifically, this step can automatically detect the signal-to-noise ratio of the input voice and feed it back on the interactive interface, which can be used in the case of strong noise that is not suitable for voice input The user is reminded that the number of candidate results can also be adjusted according to different signal-to-noise rat...
Embodiment 3
[0077] like Figure 8 As shown, the present invention also provides another voice input system, including a segmentation module 41 , a correction module 42 and a noise monitoring unit 43 .
[0078] Segmentation module 41 is used for constantly cutting the input voice into speech segments and generating the text of each speech segment while recording, specifically, described segmentation module 41 is positioned on the cloud server, and described segmentation module 41 passes voice The endpoint detection algorithm continuously divides the input voice into voice segments. This module can automatically segment the voice recognition results and return them in segments for the user's second confirmation.
[0079] The correction module 42 is used to sequentially display the text of each speech segment, and modify the text of each speech segment in turn according to the user's selection. Specifically, this module can realize that the user can modify and confirm the returned text while...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 