Network commodity information extraction method

A product information and network technology, which is applied in the field of network product information extraction, can solve problems such as heavy workload, high technical difficulty, and poor accuracy, and achieve the effect of reducing the quality of personnel, facilitating related operations, and reducing the number of templates

Active Publication Date: 2012-06-13
浙江盘兴数智科技股份有限公司
View PDF4 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Manual extraction has good accuracy, but heavy workload, low efficiency, and high cost; fully automatic extraction has low cost, high efficiency, but poor accuracy, and high technical difficulty; semi-automatic extraction is based on a small amount of manual labeling, and the workload is small. The accuracy of human intervention is better guaranteed, and it is a more feasible way

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Network commodity information extraction method
  • Network commodity information extraction method
  • Network commodity information extraction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The present invention will be further described in detail below in conjunction with the accompanying drawings and examples.

[0024] see Figure 1 ~ Figure 3 , in this embodiment, the whole process of commodity information extraction is described in detail by taking the "food category" of "Taobao" as an example.

[0025] 1. Use a template generation tool to generate an initial template for network commodity information extraction. The template generation tool is a browser plug-in tool designed by the applicant of the present invention. The procedure of this step is as follows:

[0026] (1), the user browses the webpage at will in the browser until the webpage where information needs to be extracted;

[0027] (2) Click the "template generation plug-in" icon in the browser toolbar to start the extraction tool;

[0028] (3) Click the "Start Collection" button to start the extraction process. At this time, when the mouse moves to each part of the webpage, a blue frame wi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a network commodity information extraction method. The network commodity information extraction method includes steps of (1), generating an initial network commodity information extraction template by the aid of a template generating tool; and (2), applying the initial template to extract commodity information of websites. By the aid of the template generating tool, the template is generated during information extraction, and is processed and modified, the information is extracted semi-automatically, and required specified information, such as names of commodities, image URL (uniform resource locator) of the commodities and prices, can be accurately and quickly extracted from web pages and labeled. The network commodity information extraction method leads operation to be visual and brings convenience for relevant operation, error rate is reduced, and work efficiency is improved.

Description

technical field [0001] The invention relates to a method for extracting network commodity information. Background technique [0002] In recent years, with the rapid development of e-commerce, all kinds of enterprises and individuals have carried out marketing activities through the Internet, which has brought together a large amount of commodity information and has become the largest source of commodity information. There is no shortage of information of great commercial value such as price, origin, dealer, sales volume, customer evaluation, etc. in this information. [0003] Classifying and analyzing these data and displaying them in an appropriate way can bring certain help to the business decision-making of enterprises. For example, for a company that manufactures and sells pressure cookers, how to position its own product prices, how to grasp the ever-changing industry market prices in the market, especially the price changes of competitors, how to know the scope of sal...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 刘崟吴浩苗
Owner 浙江盘兴数智科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products