Unlock instant, AI-driven research and patent intelligence for your innovation.

Markov matrix off-line correction method of text keywords

A Markov matrix, keyword technology, applied in special data processing applications, instruments, electrical digital data processing and other directions, can solve the problems of user extraction and analysis, the accuracy of keyword extraction and user satisfaction, etc. The effect of search efficiency

Inactive Publication Date: 2013-10-02
SHANGHAI UNIV
View PDF5 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, when the above method extracts domain keywords, it does not extract and analyze the user's historical records. Therefore, the accuracy of keyword extraction and user satisfaction are not high.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Markov matrix off-line correction method of text keywords
  • Markov matrix off-line correction method of text keywords
  • Markov matrix off-line correction method of text keywords

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] Embodiments of the present invention will be described in further detail below in conjunction with the accompanying drawings.

[0042] Such as figure 1 As shown, a Markov matrix offline correction method for text keywords, this method extracts each keyword by analyzing the user's historical records, each intersection keyword is represented by a Markov matrix, and the correction word of each keyword is established According to the selection rules, the correction word is selected to correct the keyword entered by the user next time. The operation steps are as follows:

[0043] (1), each text that the user searches and downloads each time is recorded as the historical text collection that the user searches, and is recorded as M;

[0044] (2), extract the set of keywords with intersection in the historical text set searched by the user, the detailed steps are as follows:

[0045] (2-1), obtain all the texts M in the historical text collection searched by the user;

[00...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a Markov matrix off-line correction method of text keywords. The Markov matrix off-line correction method comprises the following steps that (1) each text searched and loaded by users in each time is marked as a historical text set of the user searching; (2) the keyword sets with the intersection in the historical text set of the user searching are extracted; (3) the extracted keywords with the intersection in the historical text set are shown by adopting the Markov matrix; (4) a correction word selecting ruler is built, and correction words are selected from keywords with the intersection according to the correction word selecting rule; and (5) when the users input new keywords and carry out next new searching, the corresponding correction words are found, the correction is carried out, and correction results are returned. The method has the advantages that sources of the extracted keywords are historical records of the users and self behavior records of the users, the Markov matrix showing is adopted, the domain knowledge structure can be accurately analyzed and is corrected, and the user searching efficiency is effectively improved.

Description

technical field [0001] The invention relates to a method for automatically extracting text keywords by a computer and providing off-line correction for user input, more specifically, relates to a Markov matrix off-line correction method for text keywords. Background technique [0002] A "method for extracting text keywords" is also disclosed in the Chinese patent specification (patent application number: 200710041150. 7), which points out that "on the basis of extracting text keywords by the TF-IDF method, the text frequency correction method Extract keywords from a single text to improve the accuracy of keyword extraction from a single text; extract common field keywords in similar text collections by word frequency correction or comparison selection method, this method can avoid a keyword Frequent occurrences in documents lead to high absolute word frequency and are counted as domain keywords. It can effectively improve the keyword extraction accuracy of a single text, th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
Inventor 陈雪高英虎汤文清
Owner SHANGHAI UNIV