Voice interaction method and voice interaction device

A speech interaction and speech recognition technology, applied in speech analysis, speech recognition, semantic analysis, etc., can solve problems such as affecting the correctness of semantic understanding, complex environment, and frequent occurrences.

Active Publication Date: 2017-11-03
IFLYTEK CO LTD
View PDF13 Cites 64 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, unlike the close-range voice interaction of mobile terminals such as mobile phones, in application environments such as smart homes or cars, the distance between the user and the microphone is relatively long, and noise in noisy environments, tire noise in car environments, air-conditioning noise, Factors such as human voice interference from the co-pilot and rear passengers make the environment very complicated
In this way, even when the user has no interaction intention, due to the influence of noise, the recognition and semantics are falsely triggered, and the result of semantic understanding is given, which often leads to the corresponding response from the client.
This will not only bring poor user experience to users, but also affect the correctness of subsequent semantic understanding due to false triggering of semantics, especially in the process of voice interaction considering historical information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice interaction method and voice interaction device
  • Voice interaction method and voice interaction device
  • Voice interaction method and voice interaction device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0099] In order to enable those skilled in the art to better understand the solutions of the embodiments of the present invention, the embodiments of the present invention will be further described in detail below with reference to the accompanying drawings and implementation manners.

[0100] At present, in the voice interaction between vehicles and smart homes, most of them only use input text for semantic understanding to obtain the final semantic understanding result, and use less information. In complex scenarios, a good semantic rejection effect cannot be achieved. In order to enhance the effect of semantic rejection, the existing technology has been improved, for example: 1. If a fixed threshold is set for each business semantic understanding score, only output above the threshold will be output, otherwise it will be rejected; 2. Set business priority , In the case of multiple business with the same score, through artificially set business priorities, the higher priority is...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a voice interaction method and a voice interaction device. The method comprises steps: after a voice recognition text is received, the voice recognition text is distributed to each service respectively, and semantic understanding is carried out respectively; and based on the obtained semantic understanding result and the application state of a client, confidence ranking is carried out, a semantic understanding result with the highest confidence is acquired, and a response is given to the semantic understanding result with the highest confidence. As confidence ranking of semantic understanding results on multidimensional-based information not only considers the matching degree between a semantic understanding result and each service but also considers the application state of the client, for example, whether the client is in a navigation state or a music listening state, the application of the client and the application state are possibly objects to be processed by voice interaction, semantic understanding on the multidimensional-based information can effectively enhance the accuracy of judging the service belonging, the accuracy of man-machine interaction semantic understanding is improved, and the user experience is enhanced.

Description

Technical field [0001] The present invention relates to the field of voice signal processing, in particular to a voice interaction method and device. Background technique [0002] With the increasing maturity of artificial intelligence-related technologies, people's lives have begun to become intelligent, and various smart devices have gradually entered people's daily lives, such as smart cars. Voice is one of the mainstream interactive methods in smart device applications, and its convenient and fast advantages are obvious to all. [0003] During voice interaction, the voice input by the user is transcribed into text and then undergoes semantic understanding, and the client responds to the corresponding event according to the result of semantic understanding. However, different from the close-range voice interaction of mobile terminals such as mobile phones, in the application environment of smart homes or cars, the user is relatively far away from the microphone, coupled with no...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/22G10L15/18G06F17/27
CPCG10L15/1822G10L15/22G10L2015/223G06F40/30
Inventor 李深安孔祥星王兴宝庄纪军王雪初马军涛韩后岳
Owner IFLYTEK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products