Crawler retrieval and big data intelligent recommendation optimization processing method based on open source framework

A big data and crawler technology, applied in the field of big data platform and resource acquisition, can solve the problems of bulky Java language, increase the difficulty of data crawling, poor asynchronous support, etc., achieve high flexibility and scalability, and improve network resources Acquisition capabilities and intelligent recommendation algorithm functions, saving labor and time costs
CN111428112AInactive Publication Date: 2020-07-17上海浩方信息技术有限公司

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
上海浩方信息技术有限公司
Publication Date
2020-07-17
Estimated Expiration
Not applicable · inactive patent

Smart Images

  • Figure 1
    Figure 1
Patent Text Reader

Abstract

The invention relates to a crawler retrieval and big data intelligent recommendation optimization processing method based on an open source framework. The method comprises the steps: carrying out theresource crawler through the open source framework, and obtaining a needed target service resource; performing word segmentation on the obtained target service resources according to an NPL word segmentation technology to realize information word segmentation matching; and performing information screening and recommendation according to preset keywords, fields and weight values. The crawler retrieval and big data intelligent recommendation optimization processing method based on the open source framework is adopted; the network resource collection capability and the intelligent recommendationalgorithm function of the target user are improved; a web crawler technology is realized by combining an open source HttpClient technology and a python algorithm packet, so that part of labor investment and time cost are greatly reduced or even directly saved, and crawler resource management has relatively high flexibility and expandability; and intelligent recommendation algorithm scheduling is executed for the target user, so that on-demand filtering is realized and effective information is screened out.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the field of big data platforms, in particular to the field of resource acquisition, and specifically refers to a method for crawler retrieval and big data intelligent recommendation optimization processing based on an open source framework. Background technique

[0002] With the advent of the era of network big data and the rapid development of business data of various enterprises and companies, in order to deal with the ever-increasing huge data, for the research and analysis of big data, various units need to continuously invest a lot of manpower, material resources and time costs, which are specifically reflected in the following aspects point:

[0003] 1) Continuously invest a lot of human resources and time costs in order to obtain data resources;

[0004] 2) Due to the continuous growth of the acquired data, it is necessary to extract and analyze effective resources to invest too much labor cost;

[0005] 3) Due to the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More