A text classification method and text classification device

A text classification and text technology, applied in the field of text processing, can solve the problems of limited space for algorithm improvement, sparse data, difficult to achieve domain migration, etc., and achieve the effect of good domain adaptability
CN108304468BActive Publication Date: 2021-12-07CHINA UNIONPAY

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Patents(China)
Current Assignee / Owner
CHINA UNIONPAY
Publication Date
2021-12-07

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention relates to a text classification method and a text classification device. The method comprises the following steps: an NLP preprocessing step, which analyzes the user dialogue text with a natural language processing method, and obtains a word set and semantic annotation results about the user dialogue text; a multidimensional feature selection step, for the word set and Semantic annotation results are combined according to various rules to obtain a vectorized representation of the semantic information contained in the user dialogue text; and a classification step is to calculate a probability estimation value for the user dialogue classification obtained in the multi-dimensional feature selection step. According to the text classification method and text classification system of the present invention, the advantages of statistics and deep learning methods can be integrated, and a text classification solution oriented to customer needs can be realized through multi-dimensional feature selection.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to text processing technology, in particular to a text classification method and a text classification device. Background technique

[0002] At present, the implementation schemes of text classification technology are mainly divided into statistical learning methods and deep learning methods. The former is mainly based on the feature selection method. The word and sentence-level features of the text are selected through indicators such as TF-IDF, PMI, and chi-square value, and the feature vector representing the text is obtained, and the feature vector is obtained by machine learning. The probability of each label is used as the final classification standard; the latter is mainly based on model construction, using the discrete information of the text as input, through the serial and parallel structure of the multi-layer neural network, supplemented by the back propagation algorithm to update the network weight , directly get the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More