Unlock instant, AI-driven research and patent intelligence for your innovation.

A Data Mining Method

A data mining and data technology, applied in the direction of network data indexing, network data retrieval, other database retrieval, etc., can solve the problems of high repetition of search results, missing information, redundant information, etc., to reduce search time and solve information problems Effect of duplication and improvement of processing efficiency

Inactive Publication Date: 2018-10-12
ANHUI HUAZHEN INFORMATION SCI & TECH
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] Today's society has entered the era of high-speed information dissemination. While this brings convenience to people, more and more problems have emerged. For example, the search results of existing search engines are too repetitive and there are many redundant information that do not meet expectations. , long search time, low efficiency, etc.
[0003] Due to the high rate of information reprinting on the Internet, Baidu, Google and other search engines take a long time to search for the recall rate of search, and the search results are very repetitive, which is not conducive to users to quickly find valuable content.
In addition, some industry search engines only aim at industry websites, which improves the search efficiency, but the recall rate is low and it is easy to cause omissions
[0004] The current commercial competition largely determines the extent to which enterprises have mastered the latest information. In other words, the update and analysis of industry information by enterprises determines the potential of enterprises. Enterprises are often unable to afford independent information search consumption. On the other hand, enterprise-customized search engines often only search industry websites, and do not catalog the entire Internet, which is likely to cause information omission

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Data Mining Method
  • A Data Mining Method
  • A Data Mining Method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] refer to figure 1 , a data mining method proposed by the present invention performs data mining through fixed-point collection and automatic discovery, and performs unified data analysis and storage on the mined data. Websites in the industry include links to well-known websites in the industry, forums, blogs, etc. Fixed-point collection can focus on these important websites, which means paying attention to industry trends and reducing the time to search for websites. Automatic discovery is a supplement to fixed-point collection. It supplements data by searching other non-famous websites to avoid omission of target data. Unified data analysis can effectively remove duplicate information, solve the problem of frequent reprinting of network data and duplicate information, and at the same time.

[0028] refer to figure 2 , fixed-point acquisition includes the following steps:

[0029] Prefabricated websites in the industry as data sources, and set credibility weights f...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data mining method and solves the problems that network information is high in repetition degree and much in redundant information. The data mining method is high in data mining speed, good in recall ratio and ideal in effect. The data mining method comprises the steps of fixed point collection, automatic discovery and data analysis and storage. In the fixed point collection, websites in industries are preset and utilized as data sources, reliability weight values are set for the data sources, a data collecting mode is set according to the data sources, and data are mined from the data sources regularly or irregularly. In automatic discovery, a network probe is set and can automatically find out websites with high similarity, the websites with the high similarity are utilized as collecting point websites, the collecting point websites are added into a collecting point website base, reliability weight values are set for the collecting point websites, a data extracting mode is set according to the collecting point websites, and data are mined from the data sources regularly or irregularly. In data analysis and storage, the mined data are encoded in a unified mode, repetition information is removed, data are screened, clustering analysis is carried out on the screened data, the information amount of the same topic can be calculated out, the topic attention weight is labeled, data are stored, and indexes are established.

Description

technical field [0001] The invention relates to the technical field of data mining, in particular to a data mining method. Background technique [0002] Today's society has entered the era of high-speed information dissemination. While this brings convenience to people, more and more problems have emerged. For example, the search results of existing search engines are too repetitive and there are many redundant information that do not meet expectations. , long search time, low efficiency, etc. [0003] Due to the high rate of information reprinting on the Internet at present, search engines such as Baidu and Google take a long time to search for the recall rate of search, and the search results are very repetitive, which is not conducive to users to quickly find valuable content. In addition, some industry search engines only aim at industry websites, which improves the search efficiency, but the recall rate is low and it is easy to cause omissions. [0004] The current co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F16/951
Inventor 贾岩
Owner ANHUI HUAZHEN INFORMATION SCI & TECH