Science subject extraction method based on multi-view learning

An extraction method and multi-view technology, applied in special data processing applications, instruments, electrical digital data processing, etc., to achieve the effect of simple data preprocessing

Active Publication Date: 2014-01-22
ZHEJIANG UNIV
View PDF4 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In order to overcome the shortcomings of the existing scientific topic extraction methods that only consider the unilateral data information in the paper data and ignore other potential data that can be used, the present invention proposes a scientific t...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Science subject extraction method based on multi-view learning
  • Science subject extraction method based on multi-view learning
  • Science subject extraction method based on multi-view learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] With reference to accompanying drawing, further illustrate the present invention:

[0035] A method for scientific topic extraction based on multi-view learning:

[0036] 1. The method comprises the following steps:

[0037] 1) Obtain the paper data from the paper database as the target document for the forthcoming scientific subject extraction;

[0038] 2) For each target document, extract the data information of multiple views in the document as the basis for scientific topic extraction;

[0039]3) According to the different content characteristics of different view data information, simple data preprocessing is performed on the data information of each view;

[0040] 4) For each view, represent the data information of all target documents as a data matrix, and the data information of each target document is a row vector;

[0041] 5) Using the method of multi-view learning, with the help of data information from multiple views, the target documents are clustered, a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a science subject extraction method based on multi-view learning. The extraction method includes the steps that thesis data are obtained from a thesis database to serve as target files where science subjects are to be extracted; data information of multiple views in the target files are extracted to serve as bases of science subject extraction; simple data pre-processing is carried out on the data information of each view, the data information of all the target files is expressed to form a data matrix, and data information of each target file is one row vector of the data matrix; by means of the multi-view learning method, the target files are clustered, the target files of the same kind correspond to the same science subject; the science subject of the target files of each kind is extracted and expressed in a mode of multiple key words. The method has the advantages of making up for the defect that in a traditional method, data information of only one aspect is considered, well making use of data information of various aspects and obtaining better science subject extraction effects by means of complementary relationships between the data information and consistent auxiliary clustering of potential subjects.

Description

technical field [0001] The invention relates to the technical fields of text clustering and scientific topic extraction, especially the text clustering method and topic extraction work based on multi-view learning. Background technique [0002] Every article has its own specific topic, especially for academic papers. All scholars and researchers need to conduct research on existing scientific topics before conducting scientific research and writing papers. , also want to know the topic of the article first. Experienced researchers often have a clear understanding of the scientific topics in their field. They can find papers related to their research field, clarify the relationship between papers, and predict the popularity of certain scientific topics. and development trends, and this information plays a vital role in the research work of scholars and the development of the entire research field. With the rapid development of the Internet, information has begun to burst, a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/93
Inventor 王灿王哲卜佳俊陈纯于智
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products