Method and system for forming product database based on Internet data
An Internet and database technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as difficulty in comprehensive product information, low efficiency, and difficult product information
Inactive Publication Date: 2016-11-30
广州市尊网商通资讯科技有限公司
View PDF5 Cites 0 Cited by
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
In this way, due to the non-uniform format of product release standards, for the product demand side, the demand standards are various, and because the product description formats of major websites are not uniform, it is difficult to comprehensively sort out product information, and it is impossible to know the products that meet the demand standards. For more comprehensive product information, if product selection is carried out according to demand standards, for the case of large-volume multi-model product selection, it is often necessary to read a large number of web pages, which is inefficient
[0003] To sum up, due to the lack of a unified product description standard in related technologies, there is a technical problem that it is difficult to sort out product information
Method used
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View moreImage
Smart Image Click on the blue labels to locate them in the text.
Smart ImageViewing Examples
Examples
Experimental program
Comparison scheme
Effect test
Embodiment approach
[0067] One of the methods is based on the classification rules as follows:
[0068] In the classification decision, this method only determines the category of the sample to be divided according to the category of the nearest one or several samples.
[0069] The specific algorithm steps are as follows:
[0070] Re-describing the training text vector according to the set of feature items;
[0071] After the current text arrives, the current text is segmented according to the feature words, and the vector representation of the current text is determined;
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More PUM
Login to View More Abstract
The invention discloses a method and a system for forming a product database based on Internet data. The method is as follows: using theme crawler technology to capture webpage data whose relevance to the topic is higher than a preset threshold; storing the captured webpage data in a structured manner; automatically classifying the structured stored webpage data according to the category to which the product belongs; Count the occurrence times and occurrence times of product attributes in the automatically classified webpage data, perform weighted calculations on the occurrence times and occurrence times of product attributes according to the preset weights, obtain product attribute decision values, and determine the order of product attributes according to the product attribute decision values . The system includes a data capture module, a structured storage module, a data classification module and an attribute decision module. This method and system for forming a product database based on Internet data allows users to obtain relatively comprehensive comprehensive information without collecting and sorting out product information on the Internet; it ensures the real-time nature of the data and meets the real-time needs of users.
Description
technical field [0001] The invention relates to the technical field of Internet data processing, in particular to a method and system for forming a product database based on Internet data. Background technique [0002] At present, the product catalogs of some mainstream websites are formed by using fixed product release templates for various industries to form a product description. Moreover, each website adopts different standards for describing the same product. In this way, due to the non-uniform format of product release standards, for the product demand side, the demand standards are various, and because the product description formats of major websites are not uniform, it is difficult to comprehensively sort out product information, and it is impossible to know the products that meet the demand standards. For more comprehensive product information, if product selection is carried out according to demand standards, it is often necessary to read a large number of web pa...
Claims
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More Application Information
Patent Timeline
Login to View More Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 张丽
Owner 广州市尊网商通资讯科技有限公司
