Corpus generation method, corpus generation device and intelligent equipment

A technology of corpus and words, which is applied in the field of smart devices and computer-readable storage media, corpus generation devices, and corpus generation methods, and can solve problems such as the large amount of corpus and the impact on the efficiency of intent recognition

Pending Publication Date: 2020-05-19
UBTECH ROBOTICS CORP LTD
View PDF2 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, when the input sentence contains many entity words, it will often lead to a large number of replaced corpus, which will affect the efficiency of subsequent intent recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Corpus generation method, corpus generation device and intelligent equipment
  • Corpus generation method, corpus generation device and intelligent equipment
  • Corpus generation method, corpus generation device and intelligent equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0028] A corpus generation method provided by the embodiment of the present application is described below, please refer to figure 1 , the corpus generation method in the embodiment of the present application includes:

[0029] Step 101, receiving an input sentence;

[0030] In the embodiment of the present application, the smart device may first obtain the input sentence to be recognized, and the above input sentence refers to a sentence input by the user during interaction with the smart device. Optionally, the above-mentioned input sentence may be a sentence input by the user through text; or, the above-mentioned sentence may also be a sentence input by the user through voice, and the input form of the above-mentioned input sentence is not limited here.

[0031] Step 102, carry out entity word recognition to above-mentioned input sentence;

[0032] In the embodiment of the present application, after receiving the above-mentioned input sentence, the smart device will first...

Embodiment 2

[0083] On the basis of the first embodiment above, the second embodiment of the present application provides another corpus generation method, such as figure 2 As shown, the corpus generation method in the embodiment of the present application includes:

[0084] Step 201, receiving an input sentence;

[0085] Step 202, performing entity word recognition on the above-mentioned input sentence;

[0086] Step 203, if there is more than one entity word in the above-mentioned input sentence, then obtain a preset word-slot combination list, the above-mentioned word-slot combination list contains more than one word-slot combination, and each word-slot combination includes at least one word slot;

[0087] Step 204, based on the word slot combinations contained in the above word slot combination list, match and replace more than one entity word contained in the above input sentence to obtain more than one replacement corpus;

[0088] In the embodiment of the present application, the ...

Embodiment 3

[0133] Embodiment 3 of the present application provides a device for generating corpus, which can be integrated into smart devices, such as Figure 4 As shown, the corpus generating device 400 in the embodiment of the present application includes:

[0134] A receiving unit 401, configured to receive an input sentence;

[0135] A recognition unit 402, configured to perform entity word recognition on the above-mentioned input sentence;

[0136] The obtaining unit 403 is used to obtain a preset word slot combination list if there is more than one entity word in the input sentence. The above word slot combination list contains more than one word slot combination, and each word slot combination includes at least one word slot ;

[0137] The generating unit 404 is configured to match and replace more than one entity word contained in the above-mentioned input sentence based on the word-slot combination contained in the above-mentioned word-slot combination list, to obtain more tha...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a corpus generation method, a corpus generation device, intelligent equipment and a computer readable storage medium. The method comprises the steps: receiving an input statement; performing entity word recognition on the input statement; if more than one entity words exist in the input statement, a preset word slot combination list is obtained, the word slot combination list comprises more than one word slot combination, and each word slot combination at least comprises one word slot; and based on word slot combinations contained in the word slot combination list, matching and replacing more than one entity words contained in the input statement to obtain more than one replacement corpus. According to the scheme, word slot replacement is limited through the presetword slot combination list, it is avoided that word slot replacement is conducted on the input statement through invalid word slot combinations, generation of wrong replacement corpora can be reducedto a certain extent, and the subsequent replacement corpus processing efficiency is improved.

Description

technical field [0001] The present application belongs to the technical field of artificial intelligence, and in particular relates to a method for generating corpus, a device for generating corpus, an intelligent device, and a computer-readable storage medium. Background technique [0002] Now more and more smart devices have human-computer interaction functions. The above-mentioned smart devices often perform word slot extraction and replacement operations on the user's input sentences, and then perform intent recognition based on the replaced corpus. However, when the input sentence contains many entity words, it will often result in a large amount of corpus to be replaced, which will affect the efficiency of subsequent intent recognition. Contents of the invention [0003] In view of this, the present application provides a corpus generation method, a corpus generation device, an intelligent device, and a computer-readable storage medium, which can reduce the generati...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/295
Inventor 黄日星熊友军
Owner UBTECH ROBOTICS CORP LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products