Electronic invoice content analysis method and system
A technology for electronic invoices and content, which is applied in electronic digital data processing, special data processing applications, instruments, etc., and can solve the problem that the invoice management system cannot meet the requirements.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0040] Embodiment 1. A method for analyzing the content of an electronic invoice.
[0041] Combine below figure 1 The method of the first embodiment will be described in detail.
[0042] figure 1 It is a flow chart of the method for analyzing the content of the electronic invoice in Embodiment 1 of the present invention, such as figure 1 As shown, the electronic invoice described in the embodiment of the present invention is based on the layout file format, including a position analysis module, a text merging module and a text association identification module, including the following steps:
[0043] Step S101 , the location analysis module invokes the format file analysis engine module to perform location analysis on the content of the electronic invoice, and obtain a set of location information in units of characters.
[0044] Specifically, the location parsing module parses the location information of each character in the electronic invoice. Preferably in the embodimen...
Embodiment 2
[0063] Embodiment 2, the processing flow of the text merging module in the method for analyzing the content of the electronic invoice.
[0064] Combine below figure 2 The method of this embodiment will be described in detail.
[0065] figure 2 It is the processing flowchart of the text merging module in the method for analyzing the content of the electronic invoice in the second embodiment of the present invention, such as figure 2 As shown, the method of the present embodiment includes the following steps:
[0066] Step S201, sort the character sets in the position information set from top to bottom and from left to right.
[0067] Step S202 , using the character gap threshold to preliminarily merge characters in the same text field in the same line.
[0068] Step S203, using the tag dictionary to set the type attribute of each text field text line.
[0069] In the embodiment of the present invention, the tag dictionary defines the face elements of the electronic invo...
Embodiment 3
[0073] Embodiment 3, the processing flow of the text association identification module in the method for analyzing the content of the electronic invoice.
[0074] image 3 It is a processing flowchart of the text association recognition module in the method for analyzing the content of the electronic invoice according to the third embodiment of the present invention.
[0075] Step S301, according to the label dictionary, traverse the text line set, and match a commodity line label.
[0076] Step S302: Find all commodity row labels according to the row gap threshold and the matched commodity row labels.
[0077] Step S303, start traversing the text line set at the end of the product line label, and determine the start and end positions of the product line content.
[0078] Step S304, judging the attribute type of the currently indexed text.
[0079] Step S305, if the attribute type of the currently indexed text is a text type, continue traversing, and return to step S304 to ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com