Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text feature extraction method, device, equipment and readable storage medium

A feature extraction and text technology, applied in the field of information processing, can solve problems such as fitting and dimension disaster, and achieve the effect of improving professionalism, strong professionalism, and reducing redundant features

Inactive Publication Date: 2018-06-29
北京中关村科金技术有限公司
View PDF5 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Existing, overfitting and curse of dimensionality often occur in feature extraction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text feature extraction method, device, equipment and readable storage medium
  • Text feature extraction method, device, equipment and readable storage medium
  • Text feature extraction method, device, equipment and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] In order to enable those skilled in the art to better understand the solution of the present invention, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments. Apparently, the described embodiments are only some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0046] Please refer to figure 1 , figure 1 It is a flowchart of a text feature extraction method in an embodiment of the present invention, the method includes the following steps:

[0047] S101, setting a target keyword set corresponding to the target field;

[0048] In this embodiment, when text features are to be extracted for the target field, a set of target keywords may be set for the target field. W...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text feature extraction method. The method comprises the steps of setting a target keyword collection corresponding to a target domain, obtaining an original article collection corresponding to the target keyword collection, pre-processing the articles in the original article collection, obtaining a target article collection, conducting word segmentation on each article in the target article collection to obtain a lexical collection, and calculating corresponding information gain value of each word in the lexical collection to determine a text feature collection. Themethod can obtain professional text features, facilitate the understanding and visualization of data, reduce the computing and storage capacity and the like. The invention further discloses a text feature extraction device, equipment and a readable storage medium, which have corresponding technical effects.

Description

technical field [0001] The present invention relates to the technical field of information processing, in particular to a text feature extraction method, device, equipment and readable storage medium. Background technique [0002] With the rapid development of artificial intelligence technology, the era of robots has arrived. In machine learning, feature extraction is an important problem in feature engineering. [0003] In practical applications, data and features determine the upper limit of machine learning, while models and algorithms only approach this upper limit. It can be seen that feature engineering, especially feature selection, occupies a very important position in machine learning. The reason why feature selection needs attention is that with the development of science and technology, tens of thousands of feature variables can be collected in many fields, but the sample size that can be used as a training set is often far smaller than the number of features. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/22G06F17/27
CPCG06F40/194G06F40/232G06F40/284
Inventor 李界鹏王能
Owner 北京中关村科金技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products