Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Label automatic generating method and system, computer readable storage medium and equipment

An automatic generation and labeling technology, applied in computing, natural language data processing, special data processing applications, etc., can solve the problems of lack of manual labeling, few labels, no labels, etc.

Active Publication Date: 2018-12-07
SHANGHAI ADVANCED RES INST CHINESE ACADEMY OF SCI
View PDF13 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] In view of the shortcomings of the prior art described above, the purpose of the present invention is to provide a method, system, computer-readable storage medium and equipment for automatically generating tags, which are used to solve the problem of no tags and few tags for text data such as Internet online content in the prior art. , Manual labeling lacks a unified standard, and different users may label similar texts as different labels

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Label automatic generating method and system, computer readable storage medium and equipment
  • Label automatic generating method and system, computer readable storage medium and equipment
  • Label automatic generating method and system, computer readable storage medium and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0045] This embodiment provides a method for automatically generating labels, including:

[0046] Create an initial label set for the training text with labels and the text to be generated with labels;

[0047] Mining the training text and the label to be generated text;

[0048] Train a label discriminative model;

[0049] According to the label discriminant model, a text label corresponding to the text to be generated for the label is searched for.

[0050] The method for automatically generating labels provided by this embodiment will be described in detail below with reference to illustrations. The label automatic generation method described in this embodiment is used to realize more accurate label annotation of text data, and help users obtain desired information more accurately and efficiently.

[0051] see figure 1 , is shown as a schematic flowchart of an automatic label generation method in an embodiment. Such as figure 1 As shown, the label automatic generation...

Embodiment 2

[0092] This embodiment provides an automatic label generation system, including:

[0093] Create a module for creating an initial label set for the text to be generated for the label;

[0094] Mining module, used for mining tags to be generated text;

[0095] A training module, for training a label discriminant model;

[0096] The label generation module is used to find the text label corresponding to the text to be generated according to the label discrimination model.

[0097] The label automatic generation system provided by this embodiment will be described in detail below with reference to figures. It should be noted that it should be understood that the following division of each module of the automatic label generation system is only a division of logical functions, and may be fully or partially integrated into a physical entity or physically separated during actual implementation. And these modules can all be implemented in the form of calling software through proce...

Embodiment 3

[0122] This embodiment provides a device, the device includes: a processor, a memory, a transceiver, a communication interface, and a system bus; the memory and the communication interface are connected to the processor and the transceiver through the system bus and complete mutual communication, and the memory uses The computer program is stored, the communication interface is used to communicate with other devices, the processor and the transceiver are used to run the computer program, so that the device executes the steps of the above automatic label generation method.

[0123] The system bus mentioned above may be a Peripheral Component Interconnect (PCI for short) bus or an Extended Industry Standard Architecture (EISA for short) bus or the like. The system bus can be divided into address bus, data bus, control bus and so on. For ease of representation, only one thick line is used in the figure, but it does not mean that there is only one bus or one type of bus. The comm...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a label automatic generating method and system, a computer readable storage medium and equipment. The label automatic generating method comprises the steps of establishing an initial label set aiming at a training text with a label and a text with a to-be-generated label; performing mining on the training text with the label and the text with the to-be-generated label; training a label judging model; and according to the label judging model, searching a text label corresponding to the text with the to-be-generated label. According to the invention, the text analysis technology, machine learning and the deep learning algorithm are adopted, and information mining is carried out on the text data to be labeled on the basis of the original label set constructed by the multiple methods; based on the text topic analysis method, the distribution situation of words in the text is combined, so that similarity calculation of the text label theme of the multi-model fusion isrealized, the problems that text data such as internet online content are not labeled, and the labels are few are solved, and the problems that manual labeling lacks a unified standard, and differentusers can mark similar texts as different labels can be solved. Finally, a user can obtain expected information more accurately and more efficiently.

Description

technical field [0001] The invention belongs to the technical fields of natural oracle processing, text analysis, machine learning, and deep learning, and relates to a generation method and system, in particular to an automatic label generation method, system, computer-readable storage medium, and equipment. Background technique [0002] Crawler technology is a program of "automated web browsing", which automatically grabs the information needed by users on the World Wide Web according to certain rules. With the development of the Internet, the network has become the carrier of a large amount of information. Crawler technology has also become an important part of data collection and is the most basic step in big data analysis. [0003] Text analysis technology refers to the representation of text and the selection of feature items, which is a basic problem in text mining and information retrieval. It converts the unstructured original text into structured information that ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30G06F17/27
CPCG06F40/216
Inventor 李梅于景洋王煜宁德军
Owner SHANGHAI ADVANCED RES INST CHINESE ACADEMY OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products