Web page element collection method and device, terminal and computer readable storage medium

A technology of webpage elements and collection methods, applied in the field of network communication, can solve the problems of high technical threshold and difficulty in quickly collecting webpage data, and achieve the effect of simple operation and low threshold

Active Publication Date: 2018-02-23
深圳数阔信息技术有限公司
View PDF11 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For individual webmasters and individual store owners who are non-technical personnel, the technical threshold is very high, and it is difficult to quickly collect web page data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Web page element collection method and device, terminal and computer readable storage medium
  • Web page element collection method and device, terminal and computer readable storage medium
  • Web page element collection method and device, terminal and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0042] see figure 1 , the present embodiment provides a web page element collection method, the method includes the following steps:

[0043] S10: Obtain the URL of the webpage to be collected input by the user, and open the webpage in the built-in browser.

[0044] S20: Obtain the webpage element clicked by the user, and display a function option group corresponding to the webpage element clicked by the user, where the function option group includes at least one selectable function option.

[0045]Wherein, the webpage element is a constituent element of a webpage, including various types such as pictures, texts, videos, and audios. In an HTML / XML web page, a web page element includes multiple sub-nodes, and each sub-node contains different information, so that the web page element becomes a node with complete information. When a user clicks on a web page element, the web page element will be fetched. According to the obtained webpage elements, different function option gro...

Embodiment 2

[0062] see image 3 , this embodiment provides a web page element collection device 100, the device includes:

[0063] The webpage opening module 110 is used to obtain the website address of the webpage that needs to be collected input by the user, and open the webpage in the built-in browser;

[0064] The option display module 120 is configured to display a corresponding function option group according to the web page element clicked by the user, and the function option group includes at least one selectable function option;

[0065] Function option determination module 130, configured to determine the function option selected by the user;

[0066] The operation generation and execution module 140 is configured to generate an XPath path expression corresponding to the webpage element, and generate execution steps corresponding to the function options or perform operations corresponding to the function options, and the execution steps are used to is executed to realize the c...

Embodiment 3

[0069] see Figure 4 This embodiment provides a terminal 200, the terminal 200 includes a memory 210 and a processor 220, the memory 210 is used to store a computer program, and the processor 220 executes the computer program so that the terminal 200 implements the method for collecting webpage elements described above.

[0070] Wherein, the terminal 200 includes terminal devices (such as computers, servers, etc.) that do not have mobile communication capabilities, and also includes mobile terminals (such as smart phones, tablet computers, vehicle-mounted computers, smart wearable devices, etc.).

[0071] The memory 210 may include an area for storing programs and an area for storing data. Wherein, the storage program area can store the operating system, at least one application program required by the function (such as sound playback function, image playback function, etc.); the storage data area can store data created according to the use of the terminal 200 (such as audio d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a web page element collection method. The method comprises the steps that a web page website which is input by a user and needs to be collected is acquired, and the web page is opened in a built-in browser; a web page element which is clicked by the user is acquired, and a function option group corresponding to the web page element which is clicked by the user is displayed; a function option which is selected by the user is determined; an Xpath path expression corresponding to the web page element is generated, and an execution step corresponding to the function optionis generated or the operation corresponding to the function option is executed. According to the web page element collection method and device, a terminal and a computer readable storage medium, theXpath path expression is adopted for positioning and selecting an operation mode in a user oriented mode, and the achievement threshold that nontechnical personnel quickly collect web page data is lowered.

Description

technical field [0001] The invention belongs to the technical field of network communication, and specifically relates to a web page element collection method, device, terminal and computer-readable storage medium. Background technique [0002] With the development of the Internet, especially the rise of C2C e-commerce, a large number of personal websites and online stores have emerged. In order to quickly realize website data or fill product information, individual website owners or individual store owners began to fill their own websites or stores by collecting similar information from other websites. Web page data acquisition has become an increasingly widely used Internet technology. [0003] At present, the general method of webpage data collection is to extract the source code of the entire webpage through network packet capture, then analyze the source code of the webpage, and match the source code of the webpage through regular expressions, and finally obtain the de...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/955G06F16/9558G06F16/9566G06F16/958
Inventor 刘宝强肖云飞
Owner 深圳数阔信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products