Unlock instant, AI-driven research and patent intelligence for your innovation.

A Markov matrix offline correction method for text keywords

A Markov matrix and keyword technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of user extraction and analysis, extraction keyword accuracy and low user satisfaction, and improve The effect of search efficiency

Inactive Publication Date: 2016-04-27
SHANGHAI UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, when the above method extracts domain keywords, it does not extract and analyze the user's historical records. Therefore, the accuracy of keyword extraction and user satisfaction are not high.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Markov matrix offline correction method for text keywords
  • A Markov matrix offline correction method for text keywords
  • A Markov matrix offline correction method for text keywords

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] Embodiments of the present invention will be described in further detail below in conjunction with the accompanying drawings.

[0042] Such as figure 1 As shown, a Markov matrix offline correction method for text keywords, this method extracts each keyword by analyzing the user's historical records, each intersection keyword is represented by a Markov matrix, and the correction word of each keyword is established According to the selection rules, the correction word is selected to correct the keyword entered by the user next time. The operation steps are as follows:

[0043] (1), each text that the user searches and downloads each time is recorded as the historical text collection that the user searches, and is recorded as M;

[0044] (2), extract the set of keywords with intersection in the historical text set searched by the user, the detailed steps are as follows:

[0045] (2-1), obtain all the texts M in the historical text collection searched by the user;

[004...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a Markov matrix off-line correction method of text keywords. The Markov matrix off-line correction method comprises the following steps that (1) each text searched and loaded by users in each time is marked as a historical text set of the user searching; (2) the keyword sets with the intersection in the historical text set of the user searching are extracted; (3) the extracted keywords with the intersection in the historical text set are shown by adopting the Markov matrix; (4) a correction word selecting ruler is built, and correction words are selected from keywords with the intersection according to the correction word selecting rule; and (5) when the users input new keywords and carry out next new searching, the corresponding correction words are found, the correction is carried out, and correction results are returned. The method has the advantages that sources of the extracted keywords are historical records of the users and self behavior records of the users, the Markov matrix showing is adopted, the domain knowledge structure can be accurately analyzed and is corrected, and the user searching efficiency is effectively improved.

Description

technical field [0001] The invention relates to a method for automatically extracting text keywords by a computer and providing off-line correction for user input, more specifically, relates to a Markov matrix off-line correction method for text keywords. Background technique [0002] A "text key word extraction method" (patent application number: 200710041150.7) is also disclosed in the Chinese patent specification, which points out that "on the basis of text key words extracted by the TF-IDF method, text frequency correction method is used to extract single Keywords in a single text, improve the accuracy of keyword extraction from a single text; use the word frequency correction method or comparative selection method to extract common field keywords in similar text collections", this method can avoid a keyword in a document Frequent occurrences lead to a high absolute word frequency and are counted as domain keywords. It can effectively improve the keyword extraction accu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/27G06F17/30
Inventor 陈雪高英虎汤文清
Owner SHANGHAI UNIV