Method for text classification using diverse text features

A technology of sample features and text classification, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve problems such as inability to mine internal structures well
CN108664633AActive Publication Date: 2018-10-16NANJING UNIV

Patent Information

Authority / Receiving Office
CN · China
Current Assignee / Owner
NANJING UNIV
Publication Date
2018-10-16

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention discloses a method for text classification using diverse text features. The method comprises the steps of generating a plurality of sets of different text feature representations by using a multi-dimensional text representation algorithm, and generating a multi-dimensional text feature representation longitudinally; using a plurality of different text representation algorithms to generate multiple sets of different text feature representations, and generating a multi-dimensional text feature representation transversely; combining different feature representation vectors of each sample as a new feature vector of the sample, thereby obtaining a new feature representation of a data set. The method improves existing text representation algorithms, proposes to use more text representations with lower dimensions and larger differences to mine different internal structures of texts, enhances a text representation ability, improves the effect of tasks such as text categorizationand the like, and greatly reduces the dimension of the text features.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention belongs to the field of text representation, and in particular relates to a method for classifying text by using diversified text features. Background technique

[0002] In recent years, with the rapid development of computer technology and the Internet, human beings have entered the information age. Massive data, especially various text data, contain important information and great value. Reasonable sorting and summarization of these text data is conducive to better utilization of these large-scale text data. Text classification is a very effective method.

[0003] Text classification has always been a very important basic research direction in the field of machine learning and artificial intelligence, and it is also widely used in the industry. The effectiveness of text classification depends largely on the quality of text feature representation. Plain text that can be read by humans cannot be directly recognized and utilized by mach...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More