Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for quickly sorting single text keywords

A technology of quick sorting and keywords, applied in the fields of electrical digital data processing, special data processing applications, instruments, etc., can solve the problems such as the iterative process is not so applicable, and the time complexity is high.

Active Publication Date: 2018-11-06
NANJING UNIV OF POSTS & TELECOMM
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] In tasks that do not need to provide keyword vector values, but only need to provide keyword sequences, or even only need to provide keyword sets, this type of method requires high time complexity, so that the iterative process becomes less complicated. Be applicable

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for quickly sorting single text keywords
  • Method for quickly sorting single text keywords

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] In order to enable those skilled in the art to better understand the solutions of the present invention, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention.

[0024] For the existing power method, with ||P(t)-P(t-1)|| 1 ≤ε as a judgment condition is a quantitative numerical comparison of the value of the iterative vector itself, which can be called a quantitative analysis of the vector P(t) sequence, only when the sum of the absolute values ​​of the changes of each element in this sequence is less than a certain threshold ε , to determine the end of the iteration.

[0025] As for the vector P(t), the "size relationship" among the various elements is likely to be determined long before the quantitative analysis is qualified. The "size relationship" between elements, that is, the sequence stability of the vector will not change due ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention discloses a method for quickly sorting single text keywords. The method is characterized in comprising the following steps: S1: selecting a single text and converting the singletext into a corresponding graph model structure, and then according to the graph model structure, generating a candidate word adjacency matrix; S2: by using the power method iteration, generating an approximate value of a feature vector corresponding to a feature value when the value of the candidate word adjacency matrix is 1; S3: using qualitative analysis in step S2, performing qualitative analysis on the feature vector generated in each power method iteration, and generating a local sorting vector; and S4: setting a judgment threshold, calculating an inverse order value between the sortingvectors generated in the two successive iterations, and comparing the magnitude of the reverse order value with the magnitude of the reverse order value corresponding to the previous two successive iterations, and comparing the magnitude of the reverse order value of the previous two successive iterations with the magnitude of the judgment threshold. According to the method disclosed by the present invention, the iterative process can converge quickly, the time complexity of the calculation can be effectively reduced, and the method has the characteristics of high extraction precision and high sorting correctness.

Description

technical field [0001] The invention belongs to the field of natural language processing, mainly includes the process of extracting and sorting single texts, and in particular relates to a method for quickly sorting single text keywords. Background technique [0002] One of the goals of natural language processing tasks is to simplify and organize massive documents for people to find and research. Specifically, how to express a document with a paragraph, a sentence or even a few words, and how to quickly respond to the demand and give more accurate response content after people put forward the search demand. The basic task of this goal is the keyword extraction task of the document. After processing the document, the algorithm generates a keyword sequence, and the keywords in the sequence form a representation and dependency relationship with the corresponding document in turn. There are two types of data related to this task: A vector representation of keyword sort results...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27G06F17/30
CPCG06F40/284
Inventor 徐小龙柳林青孙雁飞李云李洋徐佳王俊昌朱洁
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products