Big data-based public information association method and mining engine

A technology for disclosing information and big data, which is used in electrical digital data processing, special data processing applications, instruments, etc.

Inactive Publication Date: 2015-06-03
BEIJING DEDA INFORMATION TECH
View PDF4 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, there is no mature method and mining engine

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Big data-based public information association method and mining engine

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The present invention will be further described below in conjunction with the accompanying drawings.

[0025] (1) According to the information model of the designated non-natural person object, determine the distribution source of public information on the Internet, and determine the source of direct collection or certified collection according to the nature of the information source, such as: government websites, portal websites, professional media, specialized agencies, etc. Technical means, and data elements that can be collected;

[0026] The engine collects all public information on the Internet, covering commercial, proprietary and public data sets. On the premise of complying with the original access rules of the data set, it maximizes the extended domain and its public information through direct collection and certified collection. data source.

[0027] (2) Different sources of information (websites, Weibo, WeChat, mobile applications), the corresponding styles...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a big data-based public information association method and a mining engine. The method includes the steps of 1, collecting internet public information sources, and collecting data sources related to mass public information according to types of direct acquisition and authentication acquisition; 2, allowing a multi-source matching system to perform matching types of information according to the different data sources; 3, allowing a multi-format information extraction system to extract specified data and elements according to different formats of information carriers; 4, allowing a multi-dimensional association integrating-analyzing system to integrate and analyze gathered data by means of operations such as deduplication, denoising, false removal and clustering according to an association algorithm of public information models; 5, allowing an experts correction system to correct related algorithms of deep learning and the systems used in the step above, on the basis of various indexes and a quality assessment model acquired; 6, allowing a visual display system to visually and integrally display the specified public information according to the principle of time series.

Description

technical field [0001] The present invention relates to the technical field of big data-based public information association methods and mining engines, and specifically relates to an association analysis method for full-cycle data in the development process of a designated non-natural person object and an implementation technology of a mining engine. Background technique [0002] In the Internet era, data and information have become important corporate resources. Valuable information can be quickly extracted from the ever-changing massive data. At the same time, the information on the Internet is complex and scattered. General search engines have become a necessary tool for people to obtain information. It can automatically index and provide query services. When a user enters a keyword query, the website will return all URLs containing the keyword information and provide links to the information. At present, there are many search engine systems on the Internet, but there ar...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 不公告发明人
Owner BEIJING DEDA INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products