Unlock instant, AI-driven research and patent intelligence for your innovation.

Data collecting method and device

A data collection and data collection technology, applied in the Internet field, can solve the problems of low data value density, large amount of basic data, and low accuracy, and achieve the effect of improving accuracy and data value density

Inactive Publication Date: 2017-03-22
KINGDEE SOFTWARE(CHINA) CO LTD
View PDF5 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] This method has certain shortcomings. The amount of basic data is too large, and the proportion of non-related data is relatively high. It is often difficult to correctly select data closely related to the subject, and the accuracy is low.
In the era of big data, the value density of presented data is low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data collecting method and device
  • Data collecting method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] In order to enable those skilled in the art to better understand the solution of the present invention, the present invention will be further described in detail below with reference to the accompanying drawings and specific embodiments. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

[0046] The embodiment of the present invention provides a data collection method, which can be applied to an application scenario where a search engine provides a search service for a user. A search engine refers to a system that collects information from the Internet, organizes and processes the information, provides search services for users, and displays relevant information related to user retrieval to...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data collecting method and device. The data collecting method comprises the following steps that a target theme and a target theme collecting website are determined; target webpage links corresponding to the target theme are determined in a plurality of webpage links included in the target theme collecting website; content in a webpage corresponding to each target webpage link is collected, and a plurality of pieces of collected data are obtained; a result data set is determined according to the matching degree of the target theme and each piece of collected data. According to the technical scheme, the target webpage links corresponding to the target theme are determined in a targeted mode, so that less content is collected from the webpage corresponding to each target webpage link, correlation with the target theme is large, and the precision of data collection and data value density are improved.

Description

Technical field [0001] The present invention relates to the field of Internet technology, in particular to a data collection method and device. Background technique [0002] With the rapid development of Internet technology, there are more and more applications of big data. In the big data scenario, the demand for data collection is gradually increasing. [0003] In the prior art, when data on a certain topic is needed, most of the data is obtained from the Internet through non-targeted crawlers, and then based on the obtained massive data, through complex data matching algorithms, filter out related topics The data. [0004] This method has certain shortcomings. The amount of basic data is too large, and the proportion of non-relevant data is relatively high. It is often difficult to correctly select data closely related to the topic, and the accuracy is low. In the era of big data, the data value density presented is low. Summary of the invention [0005] The purpose of the pres...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/951G06F16/955G06F16/9558G06F16/957G06F40/284
Inventor 陈桓蔡晓胜张良杰
Owner KINGDEE SOFTWARE(CHINA) CO LTD