Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for obtaining text extraction model

A text extraction and model technology, applied in the field of machine learning, can solve problems such as huge amount of corpus data, low efficiency of manual labeling, high labor cost and time cost, and achieve the effect of reducing labor cost and time cost

Active Publication Date: 2019-03-08
TENCENT TECH (SHENZHEN) CO LTD +1
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The training text set is completely obtained by manual labeling. Due to the huge amount of corpus data required to obtain the text extraction model and the low efficiency of manual labeling, the training process of the text extraction model will consume a lot of labor and time costs.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for obtaining text extraction model
  • Method and device for obtaining text extraction model
  • Method and device for obtaining text extraction model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] In order to make the object, technical solution and advantages of the present invention clearer, the implementation manner of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0025] figure 1 It is a schematic diagram of an implementation environment for obtaining a text extraction model provided by an embodiment of the present invention. see figure 1 , the implementation environment includes:

[0026] At least one server 101, at least one chat robot 102, at least one terminal 103 (eg, mobile terminal and desktop computer). Wherein, the server 101 is used to acquire the first text extraction model, if the extraction accuracy of the first text extraction model is lower than the preset threshold, then acquire the second training text set, and acquire the second text extraction model according to the acquired training text set , applying the acquired text extraction model to the chat robot 102 or the termina...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method and apparatus for obtaining a text extraction model, which relate to the technical field of machine learning. The method comprises: obtaining a first text extraction model, the first text extraction model being obtained according to a manually-marked first training text collection; if the extraction accuracy of the first text extraction model is lower than a preset threshold, obtaining a second training text collection, the second training text collection comprising multiple first training corpora and multiple first target texts extracted from the multiple first training corpora by means of the first text extraction model; and obtaining a second text extraction model according to the first training text collection and the second training text collection. The second training text collection is obtained by means of the first text extraction model, so that the process of obtaining the text extraction model tends to be automated, and accordingly, labor costs and time costs are reduced.

Description

technical field [0001] The invention relates to the technical field of machine learning, in particular to a method and device for acquiring a text extraction model. Background technique [0002] Machine learning technology refers to the technology of improving the performance of computers by summarizing data such as text or pictures, and is widely used in data mining, computer vision, natural language processing and robotics. For example, in order to enable chatbots to understand the meaning of natural language and interact with users, machine learning techniques are usually used to obtain text extraction models, and the text extraction models are applied to chatbots, so that chatbots can learn from the user's corpus Extract the text that expresses the user's needs, and respond to the text. [0003] Generally, when obtaining a text extraction model, it is necessary to obtain a large amount of corpus, and manually mark the text expressing the user's needs from each corpus, a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/332G06F17/27
CPCG06F16/3329G06F40/279
Inventor 陈益
Owner TENCENT TECH (SHENZHEN) CO LTD