Artificial intelligence-based multi-label classification method and system of multi-level text

An artificial intelligence and multi-label technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of unrealistic large-scale application, high collection cost, strong assumptions, etc., and achieve the effect of good scalability

Active Publication Date: 2018-05-25
INST OF INFORMATION ENG CHINESE ACAD OF SCI
View PDF3 Cites 33 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The problem faced by the first method is that the category labeling information of fine-grained text is very little, the collection cost is high, and large-scale application is unrealistic; the problem faced by the second method is that the assumptions in the traditional multi-instance learning method are too strong and cannot be well modeling actual data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Artificial intelligence-based multi-label classification method and system of multi-level text
  • Artificial intelligence-based multi-label classification method and system of multi-level text
  • Artificial intelligence-based multi-label classification method and system of multi-level text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] In order to make the above-mentioned features and advantages of the present invention more comprehensible, the following specific embodiments are described in detail together with the accompanying drawings.

[0054] Suppose we have online review texts, each review text has a category label given by the user, and the category label is divided into two types: positive and negative. The following explains in detail how to use the multi-level text multi-label classification model and system of the present invention to extract sentences with favorable comments and negative comments in reviews.

[0055] 1. Build a multi-level text multi-label classification model

[0056] 1) Determine the text level: The set level includes document level, sentence level (text level to predict category) and word level.

[0057] 2) Determine text construction assumptions: document construction is based on sentences, using weighted combination assumptions; sentence construction is based on word...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an artificial intelligence-based multi-label classification method and system of multi-level text. The method includes: 1) utilizing a neural network to construct a multi-label classification model of the multi-level text, and obtaining text class prediction results of training text according to the model; 2) carrying out learning on parameters of the multi-label classification model of the multi-level text according to existing text class labeling information in the training text and the text class prediction results, which are of the training text and are obtained inthe step 1), to obtain a multi-label classification model of the multi-level text with determined parameters; and 3) utilizing the multi-label classification model of the multi-level text with the determined parameters to classify to-be-classified text. The method infers labels of the formed text simply through the document-level labeling information, and can be well applied to scenes where labels of formed text are difficult to collect; compared with traditional multi-instance learning (MIL) methods, the method of the invention introduces minimal assumptions, and can better fit actual data;and the method of the invention has good scalability.

Description

technical field [0001] The present invention relates to the fields of artificial intelligence, text classification, and content visualization, in particular to an artificial intelligence-based multi-level text multi-label classification method and system. Background technique [0002] The understanding and analysis of text content is the research goal of natural language processing. Most of the text content exists in the form of documents, and each document corresponds to a file. Typical file formats include TXT, HMTL, WORD, PDF, etc. At present, with the vigorous development of the Internet and mobile Internet, the number of documents to be analyzed has increased sharply. How to classify texts with different granularities (such as sentences, paragraphs, and documents) is of great significance to information discovery, information browsing, and analysis. For example, a large number of e-commerce websites provide user comments, such as "This juicer is very delicate for baby...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06K9/62
CPCG06F16/353G06F16/355G06F18/2155G06F18/2431
Inventor 李鹏王斌郭莉梅钰
Owner INST OF INFORMATION ENG CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products