Voice breakpoint detection method, device and equipment based on artificial intelligence

An artificial intelligence, breakpoint detection technology, applied in speech analysis, speech recognition, audio data retrieval and other directions, can solve the problems of low recognition effectiveness, high misjudgment and error judgment, and error-prone

Pending Publication Date: 2021-03-30
HUAWEI TECH CO LTD
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] 1) The effectiveness of the recognition of scenarios such as repeated speech and/or procrastination of the speaker is not high, error-prone, and user experience is unnatural;
[0006]...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice breakpoint detection method, device and equipment based on artificial intelligence
  • Voice breakpoint detection method, device and equipment based on artificial intelligence
  • Voice breakpoint detection method, device and equipment based on artificial intelligence

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0077] The terms used in the embodiments of the present application are only used to explain specific embodiments of the present application, and are not intended to limit the present application.

[0078] This application provides a speech breakpoint detection method based on artificial intelligence. On the basis of the traditional acoustic model, the semantic integrity model is used to query the query sentence input by the user, and dynamically judge whether the user's speech is over based on the semantic integrity. It can more accurately identify the real intention of the user, and can better adapt to scenarios such as repeated speech and dragging of the user.

[0079] This application is applicable to the dynamic judgment of the end point of the user's voice stream in the interactive voice scene. The interactive scene can be as follows: figure 1 as shown, figure 1 It is a schematic diagram of the interaction scene of the artificial intelligence-based speech breakpoint d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a voice breakpoint detection method, device and equipment based on artificial intelligence, and the method comprises the steps: the semantic integrity detection of candidate results with the probability being higher than a preset threshold value is carried out through a pre-trained semantic integrity model, after it is determined that the candidate resultswith the probabilities higher than the preset threshold value are complete in semantics, natural language understanding is conducted on the candidate results with the probabilities higher than the preset threshold value, and intentions corresponding to the candidate results with the probabilities higher than the preset threshold value are obtained. And finally, a response corresponding to the query statement is obtained according to the candidate result of which the probability is higher than a predetermined threshold and the corresponding intention. Therefore, whether the user speaks or not can be dynamically judged according to the semantic integrity, the real intention of the user can be recognized more accurately, whether the user speaks or not can be accurately judged under the scenesof repeated speaking, sound dragging and the like of the user, and then the user experience can be improved.

Description

technical field [0001] The present application relates to the technical field of speech recognition in artificial intelligence, in particular to a speech breakpoint detection method, device and equipment based on artificial intelligence. Background technique [0002] Automatic Speech Recognition technology (Automatic Speech Recognition; hereinafter referred to as: ASR) is a technology for converting human speech into text. The ASR speech recognition service is often triggered by a wake-up word or a button, and the end point of the speech (Endpoint; hereinafter referred to as: EP) depends on the automatic detection of the ASR. [0003] The EP detection scheme in the existing related art is mainly based on voice activity detection (Voice Activity Detection; hereinafter referred to as: VAD), and there are mainly two kinds of speech breakpoint detection schemes in the existing related art: a detection method based on silence, and a detection method based on prosody and tone. de...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/04G10L15/06G10L15/08G10L15/18G10L15/26
CPCG10L15/04G10L15/063G10L15/1815G10L15/08G10L15/26G10L15/06G10L15/18G10L15/183G06F16/63G10L15/22
Inventor 张桂成吴友国孟函可张跃柴海水陈家胜杨军
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products