Unlock instant, AI-driven research and patent intelligence for your innovation.

Vehicle insurance electronic policy text recognition and extraction method and system

An electronic insurance policy and text recognition technology, applied in the direction of electrical digital data processing, instruments, calculations, etc., can solve the problems that cannot meet the needs of the insurance industry for the extraction of vehicle insurance electronic insurance policies, and achieve high accuracy, wide application, and high extraction effects

Pending Publication Date: 2021-06-04
道和云科技(天津)有限公司
View PDF6 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the obvious industry characteristics of the vehicle insurance electronic insurance policy in the insurance industry, the data attributes, data format, and data content of the electronic insurance policy have business characteristics and rules, but the PDF format is diverse, not only using table style to display policy data, but also using flow layout If the policy data is displayed sequentially, the general PDF information extraction technology cannot meet the extraction needs of the insurance industry's vehicle insurance electronic policy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Vehicle insurance electronic policy text recognition and extraction method and system
  • Vehicle insurance electronic policy text recognition and extraction method and system
  • Vehicle insurance electronic policy text recognition and extraction method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0034] see figure 1 and figure 2 , figure 1 A schematic diagram of the steps of a vehicle insurance electronic policy text recognition and extraction method provided by an embodiment of the present invention is as follows:

[0035] Step S100, constructing a vehicle insurance electronic policy data model library in the insurance industry;

[0036] Specifically, train and establish a vehicle insurance electronic policy document identification rule base, train and establish an insurance company identification rule base, train and establish an insurance company's vehicle insurance product identification rule base, establish an insurance company's vehicle insurance product data set, train and establish an insurance company Vehicle insurance product data analysis model library, the analysis model includes data positioning model, data interception model, and data formatting model.

[0037] Step S110, extracting and processing the coordinates of each character in the PDF file in t...

Embodiment 2

[0084] see image 3 , image 3 A schematic diagram of a vehicle insurance electronic policy text recognition and extraction system module provided in an embodiment of the present invention, which is as follows:

[0085] Build database module 10, be used for building insurance industry vehicle insurance electronic policy data model library;

[0086] The extracting module 20 is used to extract the coordinates of each character in the PDF file in the data model library and process it to obtain text data;

[0087] Filtering module 30, is used for filtering text data, obtains the vehicle insurance electronic policy;

[0088] The processing module 40 is used to match the data set to be extracted from the vehicle insurance electronic policy, and extract the data information on the vehicle insurance electronic policy according to the analytical model;

[0089] An output module 50, configured to output structured data and write editable documents.

[0090] It also includes a memory...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a vehicle insurance electronic policy text recognition and extraction method and system, and relates to the technical field of digital image processing. The vehicle insurance electronic policy text recognition and extraction method comprises the steps of constructing an insurance industry vehicle insurance electronic policy data model library; extracting coordinates of each character in the PDF file from the data model library and processing the coordinates to obtain text data; filtering the text data to obtain a vehicle insurance electronic policy; matching a to-be-extracted data set of the vehicle insurance electronic policy, and extracting data information on the vehicle insurance electronic policy according to the analytical model; and outputting the structured data and writing the structured data into the editable document. The method can extract the electronic insurance policy of the non-vehicle insurance in the insurance industry, and are wider in application. In addition, the invention also provides a vehicle insurance electronic policy text identification and extraction system. The system comprises a database construction module, an extraction module, a filtering module, a processing module and an output module.

Description

technical field [0001] The present invention relates to the technical field of digital image processing, in particular to a method and system for recognizing and extracting vehicle insurance electronic policy text. Background technique [0002] The PDF (Portable Document Format, Portable Document Format) file format can encapsulate text, fonts, formats, colors, and graphics and images independent of devices and resolutions in one file, which is cross-platform, highly integrated, and highly secure. advantages such as sex. In the digitization process of the insurance industry, electronic vehicle insurance policies are generated and stored in PDF file format. In many cases, we need to extract policy data information from these documents for statistics and analysis, and it is not convenient to convert data information into readable and writable information from PDF format documents. [0003] In the prior art, there are some general PDF information extraction technologies, such...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/151G06F40/103G06F40/111G06F40/126
CPCG06F40/151G06F40/103G06F40/111G06F40/126
Inventor 卢瑞瑞杨勇志张成东郭大朋龙金泉
Owner 道和云科技(天津)有限公司