A method and system for completing a semantically incomplete corpus

A corpus and semantic technology, applied in the field of speech recognition, can solve problems such as vague intentions, incomplete language components, and difficult intelligent recognition of user intentions by speech recognition products, achieving the effect of increasing workload

Active Publication Date: 2019-02-15
GUANGDONG XIAOTIANCAI TECH CO LTD
View PDF8 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Due to accidents that may occur during the user input voice, for example, part of the voice input is accidentally interrupted or part of the voice is not captured by the microphone, as well as the interference of the external environment, such as the environment is too noisy to intelligently recognize part of the voice. Incomplete components make it difficult to accurately identify the user's true intentions
[0005] In addition, for low-grade students, because they are in the initial stage of learning, in the process of language expression, language components are often incomplete and intentions are vague, which makes it difficult for voice recognition products to intelligently recognize the real user intentions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and system for completing a semantically incomplete corpus
  • A method and system for completing a semantically incomplete corpus
  • A method and system for completing a semantically incomplete corpus

Examples

Experimental program
Comparison scheme
Effect test

no. 1 example

[0079] The first embodiment of the present invention, such as figure 1 As shown, a method to complete semantically incomplete corpus includes:

[0080] S100 Acquire a corpus sample library with complete semantics, and create an audio library, a semantic slot, and a regular expression library based on the corpus sample library.

[0081] Specifically, a large number of semantically complete corpus samples are collected to establish a corpus sample library, all corpus samples are analyzed to summarize the characteristics of the semantically complete corpus, and audio libraries, semantic slots, and regular expression libraries are established.

[0082] S200 acquires the voice of the user.

[0083] Specifically, the user's voice is acquired. The user's voice may be a voice input by the user in real time. For example, the user only inputs part of the voice, or the system collects and acquires only part of the voice due to factors such as the environment. It may also be the downloa...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and a system for completing semantically incomplete corpus. The method comprises the following steps: obtaining a semantically complete corpus sample library; accordingto the corpus sample library, establishing an audio library, a semantic slot and a regular expression library; obtaining a semantically intact corpus sample library; obtaining a semantically intact corpus sample library. Acquiring user voice; Matching the user voice and the audio library; When the matching result is consistent, determining a part of speech corresponding to the matching participleaccording to the semantic slot, wherein the matching participle is a participle matching with the audio library in the user voice; Comparing the part of speech of the matching participle with the regular expression library, and completing the incomplete components in the user speech according to the regular expression in the regular expression library to obtain a supplementary full-text version;A semantic analysis is performed according to the complement text. The invention intelligently identifies the true user intention by completing the incomplete components in the corpus.

Description

technical field [0001] The invention relates to the technical field of speech recognition, in particular to a method and system for complementing semantically incomplete corpus. Background technique [0002] With the rapid development of the Internet, people's lives are becoming more and more intelligent, so people are becoming more and more accustomed to using smart terminals to fulfill various needs. Moreover, with the increasing maturity of artificial intelligence-related technologies, the intelligence of various terminals is also getting higher and higher. As one of the mainstream communication applications of human-computer interaction in smart terminals, voice interaction is increasingly favored by users. [0003] The smart terminal recognizes the voice input by the user, and then takes corresponding measures. Therefore, the accuracy of the voice input by the user through the terminal seriously affects the feedback made by the smart terminal. [0004] Due to accident...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F16/903G06F17/27G10L15/18
CPCG10L15/1822G06F40/289G06F40/30
Inventor 魏誉荧
Owner GUANGDONG XIAOTIANCAI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products