Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Semantic parsing method and device for voice

A technology of semantic parsing and speech, applied in the field of communication, can solve the problem of low accuracy of semantic parsing in the field of music, and achieve the effect of improving the success rate of interaction and improving the accuracy

Active Publication Date: 2021-04-30
QINGDAO HAIER TECH +1
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The embodiment of the present invention provides a method and device for semantic analysis of speech, to at least solve the problem in the related art that homophones can only be sent through error correction, making music The problem of low accuracy of domain semantic analysis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semantic parsing method and device for voice
  • Semantic parsing method and device for voice
  • Semantic parsing method and device for voice

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0072] The method embodiment provided in Embodiment 1 of the present application may be executed in a mobile terminal, a computer terminal, or a similar computing device. Taking running on a mobile terminal as an example, figure 1 It is a block diagram of the hardware structure of the mobile terminal of the semantic analysis method of the voice of the embodiment of the present invention, as figure 1 As shown, the mobile terminal may include one or more ( figure 1 Only one is shown in the figure) a processor 102 (the processor 102 may include but not limited to a processing device such as a microprocessor MCU or a programmable logic device FPGA) and a memory 104 for storing data. Optionally, the above-mentioned mobile terminal also A transmission device 106 for communication functions as well as input and output devices 108 may be included. Those of ordinary skill in the art can understand that, figure 1 The shown structure is only for illustration, and does not limit the st...

Embodiment 2

[0112] In this embodiment, there is also provided a speech semantic analysis device, which is used to implement the above embodiments and preferred implementation modes, and what has already been described will not be repeated. As used hereinafter, the term "module" may realize a combination of software and / or hardware for a predetermined function. Although the devices described in the following embodiments are preferably implemented in software, implementations in hardware, or a combination of software and hardware are also possible and contemplated.

[0113] Figure 4 is a block diagram of a semantic analysis device for speech according to an embodiment of the present invention, such as Figure 4 shown, including:

[0114] The first acquisition module 42 is configured to acquire a plurality of text recognition results of the voice data, and phoneme recognition results corresponding to the plurality of text recognition results, wherein one text recognition result co...

Embodiment 3

[0141] An embodiment of the present invention also provides a storage medium, in which a computer program is stored, wherein the computer program is set to execute the steps in any one of the above method embodiments when running.

[0142] Optionally, in this embodiment, the above-mentioned storage medium may be configured to store a computer program for performing the following steps:

[0143] S1. Acquire multiple text recognition results of speech data, and phoneme recognition results corresponding to the multiple text recognition results, wherein one text recognition result corresponds to one phoneme recognition result;

[0144] S2. Obtain the target recognition result with the highest confidence from the plurality of text recognition results;

[0145] S3. Determine the field classification result to which the voice data belongs according to the target recognition result;

[0146] S4. In the preset text domain to which the voice data belongs, determine the musi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a semantic parsing method and device for voice, and the method comprises the steps: obtaining a plurality of text recognition results of voice data, and phoneme recognition results corresponding to the text recognition results; obtaining a target recognition result with the highest confidence degree from the plurality of text recognition results; determining a domain classification result to which the voice data belongs according to the target recognition result; and in the preset text field to which the voice data belongs, determining the music name of the voice data according to the plurality of text recognition results and the phoneme recognition results corresponding to the plurality of text recognition results. The voice interaction system solves the problems that the music name recognition accuracy is low and the interaction success rate is relatively low, improves the music name recognition accuracy, and also improves the interaction success rate of a user during interaction in the music field.

Description

technical field [0001] The present invention relates to the communication field, in particular, to a semantic analysis method and device for speech. Background technique [0002] In modern daily life, users usually like to call terminal devices, such as speakers and mobile phones, to play songs through the intelligent voice dialogue system. The above performance is unsatisfactory. Aiming at the technical problem of poor interactive success rate of song names in the music field in intelligent dialogue systems, the invention proposes to use the Lattice search path in the speech recognition decoder to output the phoneme-level N-Best recognition results with the highest confidence score, and then call semantic analysis, The edit distance algorithm of the phonemes is used to screen the final analysis results to improve the success rate of interaction in the music field. [0003] In the existing speech dialogue system, the natural speech audio data from the user is obtained from...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/18G10L15/26G10L15/02G06F40/279G06F40/30
CPCG10L15/1822G10L15/02G06F40/279G06F40/30G10L2015/025
Inventor 苏腾荣朱文博
Owner QINGDAO HAIER TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products