Unlock instant, AI-driven research and patent intelligence for your innovation.

Text word weight calculation method and system, storage medium and terminal

A weight calculation and word technology, applied in the text word weight calculation method and system, storage medium and terminal field, can solve the problem of low importance, and achieve the effect of good flexibility and wide application scenarios

Active Publication Date: 2019-11-12
中智关爱通(上海)科技股份有限公司
View PDF3 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] However, the existing word weight calculation methods either assume that the greater the DF of all words in the text set, the higher their importance, or assume that the larger the DF of all words in the text set, the lower their importance, which cannot meet the simultaneous requirements of text Concentration, the larger the DF of some words, the lower the importance, and the larger the DF of other words, the higher the importance of the scene

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text word weight calculation method and system, storage medium and terminal
  • Text word weight calculation method and system, storage medium and terminal
  • Text word weight calculation method and system, storage medium and terminal

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] Embodiments of the present invention are described below through specific examples, and those skilled in the art can easily understand other advantages and effects of the present invention from the content disclosed in this specification. The present invention can also be implemented or applied through other different specific implementation modes, and various modifications or changes can be made to the details in this specification based on different viewpoints and applications without departing from the spirit of the present invention. It should be noted that, in the case of no conflict, the following embodiments and features in the embodiments can be combined with each other.

[0058] It should be noted that the diagrams provided in the following embodiments are only schematically illustrating the basic ideas of the present invention, and only the components related to the present invention are shown in the diagrams rather than the number, shape and shape of the compo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a text word weight calculation method and system, a storage medium and a terminal. The method comprises the following steps of obtaining a text set and a keyword set; carrying out word segmentation processing on all texts of the text set to obtain a word vector T formed by a plurality of words and a corpus formed by texts consisting of words and separators; calculating the TF value of the jth word of the word vector T in the ith text of the corpus; calculating the DF value of the jth word of the word vector T relative to the corpus; calculating an IDF value of the jth word of the word vector T relative to the corpus based on the keyword set; combining the DF value and the IDF value of the jth word of the word vector T to obtain a DFIDF value; and calculating the weight of the jth word of the word vector T in the ith text of the corpus. According to the text word weight calculation method and system, the storage medium and the terminal, word weight calculation requirements under different application scenes can be met, and the flexibility is good.

Description

technical field [0001] The present invention relates to a weight calculation method, in particular to a text word weight calculation method and system, a storage medium and a terminal. Background technique [0002] In the field of text classification, there are two commonly used word weight calculation methods as follows: [0003] (1) TFIDF (Term Frequency-Inverse Document Frequency) is a commonly used weighting technique for information retrieval and data mining. TF means Term Frequency, and IDF means Inverse Document Frequency. The guiding ideology of TFIDF is based on such a basic assumption: a word that appears many times in one text will also appear many times in another similar text, and vice versa. Therefore, if the feature space coordinate system takes TF word frequency as a measure, it can reflect the characteristics of similar texts. In addition, the ability of words to distinguish different categories should also be considered. The TFIDF method believes that th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F16/35G06F17/27
CPCG06F16/3346G06F16/35
Inventor 温国华温艳鸿
Owner 中智关爱通(上海)科技股份有限公司