Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Method of Quick Sort Single Text Keyword

A quick sorting and keyword technology, applied in text database query, unstructured text data retrieval, electronic digital data processing, etc., can solve problems such as inappropriate iterative process and high time complexity

Active Publication Date: 2022-04-05
NANJING UNIV OF POSTS & TELECOMM
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] In tasks that do not need to provide keyword vector values, but only need to provide keyword sequences, or even only need to provide keyword sets, this type of method requires high time complexity, so that the iterative process becomes less complicated. Be applicable

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Method of Quick Sort Single Text Keyword
  • A Method of Quick Sort Single Text Keyword
  • A Method of Quick Sort Single Text Keyword

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] In order to enable those skilled in the art to better understand the solutions of the present invention, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention.

[0024] For the existing power method, with ||P(t)-P(t-1)|| 1 ≤ε as a judgment condition is a quantitative numerical comparison of the value of the iterative vector itself, which can be called a quantitative analysis of the vector P(t) sequence, only when the sum of the absolute values ​​of the changes of each element in this sequence is less than a certain threshold ε , to determine the end of the iteration.

[0025] As for the vector P(t), the "size relationship" among the various elements is likely to be determined long before the quantitative analysis is qualified. The "size relationship" between elements, that is, the sequence stability of the vector will not change due ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for quickly sorting single-text keywords, which is characterized in that the method includes the following steps: S1: selecting a single text and converting the single text into a corresponding graph model structure, and then according to the graph model structure to generate the adjacency matrix of candidate words; S2: use the power method to iterate to generate the approximate value of the eigenvector corresponding to the eigenvalue of the adjacency matrix of the candidate words with a value of 1; S3: use qualitative analysis in step S2 to generate Perform qualitative analysis on the eigenvectors to generate a local sorting vector; S4: Set a judgment threshold, calculate the reverse sequence value between the sort vectors generated by two adjacent iterations, compare the reverse sequence value with the reverse sequence value corresponding to the previous iteration, Simultaneously compare the reverse sequence value of the last round of iterations with the size of the judgment threshold; the method of the invention can quickly converge the iterative process, can effectively reduce the time complexity of calculation, and has the characteristics of high extraction accuracy and high sorting accuracy.

Description

technical field [0001] The invention belongs to the field of natural language processing, mainly includes the process of extracting and sorting single texts, and in particular relates to a method for quickly sorting single text keywords. Background technique [0002] One of the goals of natural language processing tasks is to simplify and organize massive documents for people to find and research. Specifically, how to express a document with a paragraph, a sentence or even a few words, and how to quickly respond to the demand and give more accurate response content after people put forward the search demand. The basic task of this goal is the keyword extraction task of the document. After processing the document, the algorithm generates a keyword sequence, and the keywords in the sequence form a representation and dependency relationship with the corresponding document in turn. There are two types of data related to this task: A vector representation of keyword sort results...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/284G06F16/332G06F16/33G06F16/338
CPCG06F40/284
Inventor 徐小龙柳林青孙雁飞李云李洋徐佳王俊昌朱洁
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products