Voice endpoint detection method, device, terminal and storage medium
An endpoint detection and speech technology, applied in the computer field, can solve the problems of false truncation, fast speech, slow speech, etc., and achieve the effect of improving accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0023] figure 1 It is a flow chart of the voice endpoint detection method provided by Embodiment 1 of the present invention. This embodiment is applicable to the situation of detecting the user's voice endpoint in the voice-based human-computer interaction process, and the method can be executed by the voice endpoint detection device. The device can be implemented in the form of software and / or hardware, and can be integrated on a terminal with a voice recognition function, such as an intelligent mobile terminal and a vehicle-mounted device.
[0024] Such as figure 1 As shown, the voice endpoint detection method provided in this embodiment may include:
[0025] S110. Determine whether the difference between the user's current speech rate and the historical average speech rate is within a preset difference range.
[0026] During the voice interaction process between the user and the terminal, the terminal can call a voice collection device, such as a microphone, to obtain the...
Embodiment 2
[0038] figure 2 It is a flow chart of the speech endpoint detection method provided by Embodiment 2 of the present invention, and this embodiment is further optimized on the basis of the foregoing embodiments. Such as figure 2 As shown, the method may include:
[0039] S210. Determine whether the difference between the user's current speech rate and the historical average speech rate is within a preset difference range.
[0040] S220. If the difference between the current speech rate and the historical average speech rate is not in the preset difference range, during the next speech endpoint detection process adjacent to the current speech endpoint detection, based on the moment when the user's speech energy starts to decrease, perform the target duration The speech energy corresponding to the end time of the duration extension is used as the speech energy threshold, wherein the target duration is a preset time length determined according to the current speech rate.
[00...
Embodiment 3
[0047] image 3 It is a schematic structural diagram of a speech endpoint detection device provided in Embodiment 3 of the present invention. This embodiment is applicable to the situation of detecting a user's speech endpoint during a speech-based human-computer interaction process. The device can be implemented in the form of software and / or hardware, and can be integrated on a terminal with a voice recognition function, such as an intelligent mobile terminal and a vehicle-mounted device.
[0048] Such as image 3 As shown, the speech endpoint detection device provided in this embodiment may include a speech rate determination module 310 and a speech energy threshold adjustment module 320, wherein:
[0049] Speech rate determination module 310, configured to determine whether the difference between the user's current speech rate and the historical average speech rate is within a preset difference range;
[0050] Speech energy threshold adjustment module 320, for if the dif...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com