Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Invoice trade name classification method, system and equipment and computer readable storage medium

A classification method and commodity name technology, applied in computer parts, computing, instruments, etc., can solve the problems of high labor consumption, not necessarily accurate results, and many commodity names.

Inactive Publication Date: 2018-04-03
SERVYOU SOFTWARE GRP
View PDF4 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the product names are numerous and complex, and the process of preparing keywords requires a lot of manpower. For each special category, it is necessary to prepare a dedicated keyword library in advance.
In addition, if you only match characters, you will miss the semantic relationship, so the result of relying on keyword matching is not necessarily accurate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Invoice trade name classification method, system and equipment and computer readable storage medium
  • Invoice trade name classification method, system and equipment and computer readable storage medium
  • Invoice trade name classification method, system and equipment and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0052] The embodiment of the present invention discloses a method for classifying invoice commodity names, see figure 1 As shown, the method includes:

[0053] Step S11: According to preset rules, determine the product name vector of the product name of the invoice to be classified.

[0054] It should be noted that since the method for classifying invoice product names disclosed in the embodiment of the present application measures the semantic similarity of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The application discloses an invoice trade name classification method, system and equipment and a computer readable storage medium. The method comprises the following steps: determining the trade namevector of a to-be-classified invoice trade name based on preset rules; inputting the trade name vector to a pre-created target trained model to obtain an invoice trade name type output by the targettrained model, for judging if the type is a target type, wherein the target trained model is obtained by pre-utilizing target training samples to train a to-be-trained model created by the Extreme Gradient Boosting (XGBoost) model based on the gradient boosting decision tree, and the target training samples include training samples marked with target invoice trade name types and training samples obtained based on Rocchio algorithm and not marked with the trade name types. By using the invoice trade name classification method provided by the invention, the labour cost can be reduced and the accuracy of classification results can be obviously improved.

Description

technical field [0001] The invention relates to the technical field of automatic classification, in particular to a method, system, equipment and computer-readable storage medium for classifying commodity names of invoices. Background technique [0002] There are many types of invoices, each of which has a specific scope of use. It is very common to classify invoices in real life, and among the many information of invoices, such as product name, invoice unit price, ticket purchasing company, billing company, etc., it is a relatively common method to use product names to classify invoices. practice. According to the product name on the invoice, "Daohuaxiang rice" is classified into the grain category, and "Xiaomi mobile phone" is classified under electronic products. [0003] In existing solutions, categories are generally judged based on keyword matching. For example, for wine, some words such as "wine", "Maotai", and "alcohol" are prepared in advance. Words that coincide...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06F18/2148G06F18/24
Inventor 史源源陈宏仁王培勇姜巍
Owner SERVYOU SOFTWARE GRP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products