Invoice and goods classifying method based on heuristic rule

A heuristic, cargo technology, applied in invoicing/invoicing, special data processing applications, instruments, etc., can solve the problems of relying on a large number of classified instances, difficult to control the total number of categories and coverage levels, etc., achieving less human input and faster classification Fast and good learning ability

Active Publication Date: 2016-11-09
XI AN JIAOTONG UNIV
View PDF9 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, there are the following problems when the methods described in the above documents are applied to the classification of invoice goods: the classification categories of the classification method of do...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Invoice and goods classifying method based on heuristic rule
  • Invoice and goods classifying method based on heuristic rule
  • Invoice and goods classifying method based on heuristic rule

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] The method for categorizing invoice goods based on heuristic rules of the present invention will be further described below with reference to the accompanying drawings and specific embodiments.

[0054] like figure 1 As shown, in the embodiment of this application, the process of classifying invoice goods includes:

[0055] Preprocess the name of the goods in the invoice. The preprocessing includes the following steps: delete the special symbols in the name of the goods in the invoice, normalize the separator in the name of the goods in the invoice, and use the number plus the quantifier in the name of the goods in the invoice as the specification model from the name of the goods in the invoice delete.

[0056] In this example, the name of the goods on a value-added tax invoice is "Huiyuan Qinlin Purified Water 550ml". After preprocessing, 550ml is detected as a number plus quantifier and then deleted from the name of the goods. Therefore, after preprocessing, The na...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an invoice and goods classifying method based on a heuristic rule. Non-standardized goods names on invoices are classified into a subcategory in national economy industry categories, and the classification result can directly reflect the operational information of an invoice issuing enterprise. On one hand, whether issued invoices are legal or not can be judged by comparing the invoices with the industry code of the enterprise during register; on the other hand, more visual information and more careful monitoring strength are provided for tax payment risk monitoring of the enterprise. The invoice goods names are classified into customhouse codes first through the case matching of customhouse historical records, the mapping relation from the customhouse codes to national economy industry category codes is artificially established, and finally the national economy industry category codes of the invoices are obtained. Thus, the classification problem that under classification case lacking condition, a large number of non-standardized goods names are classified into the national economy industry categories is solved, and a foundation is laid for subsequent tax analysis and risk monitoring.

Description

technical field [0001] The invention belongs to a method for classifying invoice goods, in particular to a method for classifying invoice goods based on heuristic rules. Background technique [0002] An important data source for tax analysis is the value-added tax invoice of the enterprise. The filling of the goods name mainly includes three items: brand, name and specification model. However, the filling of the goods name in the invoice has not yet been standardized, resulting in uneven quality of the goods data in the invoice. , There are even substandard formats and intentional staggering, which seriously cover up the business information that can be reflected in the invoices, provide a hotbed for tax evasion, and also affect the accuracy of tax analysis. Classifying a large number of invoice goods names has become an urgent problem to be solved in tax analysis and risk monitoring. [0003] At present, there is no classification method from invoice goods to national econ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06Q30/04
CPCG06Q30/04G06F40/242
Inventor 郑庆华蔚文达阮建飞董博
Owner XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products