Product database forming method based on Internet data and system

An Internet and database technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as low efficiency, difficulty in product information collation, and difficulty in product information comprehensive collation, so as to meet real-time needs and ensure real-time performance Effect

Inactive Publication Date: 2013-09-25
广州市尊网商通资讯科技有限公司
View PDF5 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In this way, due to the non-uniform format of product release standards, for the product demand side, the demand standards are various, and because the product description formats of major websites are not uniform, it is difficult to comprehensively sort out product information, and it is impossible to know the products that meet the demand standards. For more comprehensive product information, if product selection is carried out according to demand standards, for the case of large-volume multi-model product selection, it is often necessary to read a large number of web pages, which is inefficient
[0003] To sum up, due to the lack of a unified product description standard in related technologies, there is a technical problem that it is difficult to sort out product information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Product database forming method based on Internet data and system
  • Product database forming method based on Internet data and system
  • Product database forming method based on Internet data and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach

[0065] One of the methods is based on the classification rule: if most of the k most similar samples in the feature space (that is, the nearest neighbors in the feature space) of a sample belong to a certain category, then the sample also belongs to this category.

[0066] In the classification decision, this method only determines the category of the sample to be divided according to the category of the nearest one or several samples.

[0067] The specific algorithm steps are as follows:

[0068] Re-describing the training text vector according to the set of feature items;

[0069] After the current text arrives, the current text is segmented according to the feature words, and the vector representation of the current text is determined;

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a product database forming method based on Internet data and a system. The method includes the steps of capturing webpage data with the theme relevance higher than a preset threshold value by the adoption of the focused crawler technology, performing structuralized storage on the captured webpage data, automatically classifying the structuralized storage webpage data according to the categories which products belong to, performing statistics on the frequency and the time of occurrence of attributes of the products in the webpage data after the automatic classification, performing weighting calculation on the frequency and the time of occurrence of the attributes of the products according to preset weighting, acquiring the decision value of the attributes of the products, and determining the sort order of the attributes of the products according to the decision value of the attributes of the products. The system comprises a data capturing module, a structuralized storage module, a data classifying module and an attribute deciding module. According to the product database forming method based on the Internet data and the system, a user can acquire comprehensive and summarized information without needing to collect and sort product information on the Internet, real-time performance of data is ensured, and real-time requirements of the user are met.

Description

technical field [0001] The invention relates to the technical field of Internet data processing, in particular to a method and system for forming a product database based on Internet data. Background technique [0002] At present, the product catalogs of some mainstream websites are formed by using fixed product release templates for various industries to form a product description. Moreover, each website adopts different standards for describing the same product. In this way, due to the non-uniform format of product release standards, for the product demand side, the demand standards are various, and because the product description formats of major websites are not uniform, it is difficult to comprehensively sort out product information, and it is impossible to know the products that meet the demand standards. For more comprehensive product information, if product selection is carried out according to demand standards, it is often necessary to read a large number of web pa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 张丽
Owner 广州市尊网商通资讯科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products