Unlock instant, AI-driven research and patent intelligence for your innovation.

Feature extraction method and device

A technology for feature extraction and extraction unit, which is applied in the field of feature extraction methods and devices, and can solve problems such as reducing the accuracy of feature extraction.

Inactive Publication Date: 2017-05-24
NEUSOFT CORP
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The current text feature extraction method is based on word frequency, that is, it is extracted according to the frequency of words appearing in the text. This extraction method only considers the importance of a single word in the text, thereby reducing the accuracy of feature extraction.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Feature extraction method and device
  • Feature extraction method and device
  • Feature extraction method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0047] see figure 1 , which shows a flow chart of the feature extraction method provided by the embodiment of the present invention, which is used to extract words related to the target topic as features of the text to be processed, so as to improve the accuracy of feature extraction. Specifically, the feature extraction method provided by t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a feature extraction method and device. According to the method, all words can be extracted from a to-be-processed text, at least one target subject is selected from all subjects contained in the to-be-processed text, the relevancy between each word and the to-be-processed text is obtained according to the relevancy between each word and each target subject, and then at least one word is selected from all the words to serve as features of the to-be-processed text according to the relevancy between each word and the to-be-processed text. For instance, the words, having the relevancy with the to-be-processed text higher than the relevancy with the to-be-processed text of the other words, in a preset number are selected to serve as the features of the to-be-processed text according to the relevancy between each word and the to-be-processed text, so that the selected features are relevant to main content of the to-be-processed text, that is, when the features of the to-be-processed text are extracted, the significance of the words as well as the relevancy between the words and the main content of the to-be-processed text are considered, therefore, the words irrelevant with the main content are filtered out of the extracted words, and feature extraction accuracy is improved.

Description

technical field [0001] The invention belongs to the technical field of text mining, and more specifically, relates to a feature extraction method and device. Background technique [0002] With the increasing popularity of the Internet, text information expands rapidly. For example, hundreds of thousands of web pages are updated every day on the Internet (network), and millions of new web pages are added, making the information on the Internet rich and complex. How to effectively organize and manage these information, and quickly, accurately, and comprehensively mine the information needed by users from a large number of text information is a major challenge in the field of text mining. [0003] In the field of text mining, text feature extraction is a key link in the field of text mining, and words, as the understanding unit of natural language, will be extracted as text features. The current text feature extraction method is based on word frequency, that is, it is extract...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
CPCG06F40/284
Inventor 董超
Owner NEUSOFT CORP