Text key word extracting method

An extraction method and keyword technology, applied in the field of computer automatic text keyword extraction, can solve the problem of insufficient keyword accuracy

Inactive Publication Date: 2007-11-07
SHANGHAI UNIV
View PDF0 Cites 48 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The accuracy of keyword extraction by TF-IDF method is not high enough

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text key word extracting method
  • Text key word extracting method
  • Text key word extracting method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] A preferred embodiment of the present invention is described in detail as follows in conjunction with accompanying drawing:

[0043] The existing keyword extraction method is to calculate the weight of meaningful content words (verbs, nouns) in a single text in the text through the TF-IDF formula, and filter the keywords of a single text by sorting the weights in descending order.

[0044] In the TF-IDF formula, the absolute word frequency is the frequency with which words appear in the text. The relative word frequency is the normalized word frequency (that is, the weight of the word), and its calculation method is the TF-IDF formula, that is

[0045] W ( t , d → ) = tf ( t , d → ) ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

This invention relates to an improved TF-IDF pick-up method for text key words, which picks up key words of one text by a text frequency modification method to increase accuracy for picking up key words from a single text and picks up key words of common fields in a set of texts of a same kind by a word frequency modification method or a comparison selection method.

Description

Technical field: [0001] The invention relates to a method for automatically extracting text keywords by a computer, and more specifically, relates to several improved methods for extracting text keywords from TF-IDF formulas. Background technique: [0002] One of the basic units of text knowledge acquisition and representation is text keywords. The accuracy of text keyword automatic acquisition directly affects the performance of text knowledge acquisition and the quality of text ontology establishment. [0003] The class keywords co-occurring in multiple texts belonging to a field present the lowest level knowledge of the texts in this field, and are one of the basic units for the representation and acquisition of textual knowledge in this field. The accuracy of automatic acquisition of keywords in text domain directly affects the performance of text domain knowledge acquisition and the effect of domain knowledge ontology establishment, thus affecting the quality and effec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
Inventor 方宁骆祥峰徐炜民
Owner SHANGHAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products