Unlock instant, AI-driven research and patent intelligence for your innovation.

An example extension method, device, equipment and medium

An extension method and random seed technology, applied in text database query, unstructured text data retrieval, etc., can solve the problems of single sentence pattern in extended examples, limited training improvement of sequence annotation model, etc.

Active Publication Date: 2021-02-19
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, because it is only a replacement of the key information in the instance to be extended, the sentence pattern of the generated extended instance is the same as that of the instance to be extended, resulting in a single sentence pattern of the generated extended instance
However, the extended instance of a single sentence has limited improvement in the training of sequence labeling models.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An example extension method, device, equipment and medium
  • An example extension method, device, equipment and medium
  • An example extension method, device, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0047] figure 1 It is a flow chart of an example extension method provided by Embodiment 1 of the present invention. This embodiment is applicable to the case of performing instance expansion based on a small number of provided instances. The method can be executed by an example extension device, and the device can be realized by software and / or hardware. see figure 1 , the instance extension methods provided in this embodiment include:

[0048] S110. Obtain instance rules to be expanded including keyword information.

[0049] Wherein, the keyword information may be any information describing the extended instance. The instance-to-be-extended rule is used to limit any part of the extended instance. Specifically, the keyword information may be a keyword or a relationship between keywords. Typically, the keyword information may be at least one of text intent, slot, and slot sequence.

[0050] Exemplarily, the instance rule to be extended may be: "from" is followed by a st...

Embodiment 2

[0067] figure 2 It is a flow chart of an example extension method provided by Embodiment 2 of the present invention. This embodiment is an optional solution proposed on the basis of the foregoing embodiments. see figure 2 , the instance extension methods provided in this embodiment include:

[0068] S210. Determine the instance rule to be extended associated with the instance to be extended.

[0069] Specifically, the rules for determining the instance to be extended associated with the instance to be extended include:

[0070] Perform text analysis on the instance to be extended, and extract instance rules to be extended from the instance to be extended according to the text analysis result.

[0071] The determination of the above instance rules to be extended can achieve the following effects: only according to a small number of instances to be extended provided by the user, an instance extension model that meets the needs of the user can be trained.

[0072] However,...

Embodiment 3

[0085] Figure 3a It is a flow chart of an example extension method provided by Embodiment 3 of the present invention. This embodiment is an optional solution proposed on the basis of the foregoing embodiments. see Figure 3a , the instance extension methods provided in this embodiment include:

[0086] Offline model generation and online instance generation.

[0087] Specifically, see Figure 3b , offline model generation includes:

[0088] Extract several instances to be expanded from the database that stores a large amount of data;

[0089] Extract a number of instance rules to be extended from the instance to be extended;

[0090] Wherein, the instance rules to be expanded include keywords or relationships between keywords, for example, keyword A must be before keyword B.

[0091] A to-be-extended instance rule and an to-be-extended instance conforming to the to-be-extended instance rule are used as a pair of training samples, put into a machine learning model for t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses an instance extension method, device, equipment and medium, and relates to the technical field of natural language processing. An embodiment of the present invention provides an instance extension method, the method comprising: obtaining instance rules to be extended including keyword information; inputting the obtained instance rules to be extended into an instance extension model to generate an extended instance. The embodiment of the present invention provides an example extension method, device, equipment and medium, so as to generate an extended example with richer sentence patterns than the example to be expanded.

Description

technical field [0001] Embodiments of the present invention relate to the technical field of natural language processing, and in particular, relate to an example extension method, device, device and medium. Background technique [0002] For the task of understanding the search term (query), the more common method is to parse the query into intents and slots, that is, mark the key information in the query as slots, and mark the purpose of the query as intent. For example, "what's the weather like tomorrow", the intent is weather query, and the slot information is tomorrow. [0003] In machine learning, the query is usually understood and answered based on the sequence annotation model. However, the training of sequence labeling models requires a large amount of instance data with labeling intent and slot information as training samples. At present, the main method of obtaining instance data is: identifying a small number of manually-labeled instances to be extended, and rep...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/33
Inventor 王一鸣姜文斌孙珂
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD