Commodity automatic classification method based on binary word segmentation and support vector machine

A support vector machine and automatic classification technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as unsatisfactory results, long training time for automatic product classification methods, and difficulty in building product feature information databases. Achieve the effects of solving unsatisfactory results, improving distinguishability, and improving convenience

Active Publication Date: 2016-08-31
乐乐启航(北京)教育科技有限公司
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] The technical problem to be solved by the present invention is to provide an automatic commodity classification method based on binary word segmentation and support vector machine, to solve the difficulty in constructing product feature information database and the training of automatic commodity classification method due to the structure of feature space in the prior art. The problem of long time and unsatisfactory effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Commodity automatic classification method based on binary word segmentation and support vector machine
  • Commodity automatic classification method based on binary word segmentation and support vector machine
  • Commodity automatic classification method based on binary word segmentation and support vector machine

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] Certain terms are used, for example, in the description and claims to refer to particular components. Those skilled in the art should understand that hardware manufacturers may use different terms to refer to the same component. The specification and claims do not use the difference in name as a way to distinguish components, but use the difference in function of components as a criterion for distinguishing. As mentioned throughout the specification and claims, "comprising" is an open term, so it should be interpreted as "including but not limited to". "Approximately" means that within an acceptable error range, those skilled in the art can solve the technical problem within a certain error range and basically achieve the technical effect. In addition, the term "coupled" herein includes any direct and indirect electrical coupling means. Therefore, if it is described that a first device is coupled to a second device, it means that the first device may be directly elect...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an automatic merchandise classifying method on the basis of binary word segmentation and a support vector machine. The method mainly includes: subjecting all merchandise titles in a training set to binary word segmentation processing to construct a feature word library; constructing merchandise classification sets, expressing the merchandise titles as specific vectors according to the feature word library, generating training data by the aid of the specific vectors and the merchandise classification sets, and performing parameter optimization on the training data by a sequential dual method to obtain optimal classification vectors; calculating inner products of the optimal classification vectors and the specific vectors expressed by titles of merchandises to be classified, and selecting the classification corresponding to the maximum inner product as classification which the merchandises belong to. The automatic merchandise classifying method solves the problems that a product feature information base is hard to construct, and an automatic merchandise classifying method is long in training time and unsatisfactory in effect due to a feature space construction in the prior art.

Description

technical field [0001] The present invention relates to the field of data mining, in particular to an automatic commodity classification method based on binary word segmentation and Support Vector Machine (Support Vector Machine, SVM, an automatic learning type classification algorithm). Background technique [0002] Data mining generally refers to the process of automatically searching for information with special relationships hidden in a large amount of data. Classification is an important part of data mining. [0003] With the rapid development of electronic information technology, data mining has penetrated into various fields, especially in the field of e-commerce, and an efficient automatic classification method of commodities is very important to manage the massive commodity information in e-commerce. At present, there are many automatic classification methods for commodities, such as: decision tree method based on logical rules, naive Bayesian or Bayesian network m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 许大伦毛颖张立群
Owner 乐乐启航(北京)教育科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products