A system and method for automatic text extraction and recognition of low-resolution medical bill images
A low-resolution, automatic extraction technology, applied in character and pattern recognition, instruments, computing, etc., can solve the problems of text area pollution, character recognition rate reduction, and character recognition accuracy rate, and achieve the effect of improving the recognition rate
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
specific Embodiment approach 1
[0060] Specific Embodiment 1: In this embodiment, a Windows-based medical bill recognition system is developed for the huge bill business in the medical industry. The main functions are the input and recognition of medical bill images and the collection of image feature information.
[0061] According to the characteristics of low resolution and various types of interference of medical bill images, this embodiment designs a device including four modules: image preprocessing, field segmentation, single character segmentation, and character recognition, in which:
[0062] The functions that the image preprocessing module needs to realize are: reduce the noise on the original receipt image to improve the recognition rate of individual characters, such as the shading of the background, and remove elements that do not need to be recognized in the original receipt image, such as seals, barcodes, and borders around the edge of the image. Large areas of noise, etc. In this embodiment,...
specific Embodiment approach 2
[0066] Specific Embodiment 2: This embodiment provides a method for automatic text extraction and recognition of low-resolution medical bill images. The overall processing flow is divided into the following four steps: preprocessing of bill images, field area recognition, character string segmentation and Character recognition and verification.
[0067] Step 1. Preprocessing of bill image
[0068] General description of the implementation: In principle, the method of processing the elements that do not need to be recognized in the original bill image is to use the method of filling the background color of the bill image. Since the noise position on the edge of the original bill image is relatively fixed, this area can be filled with the background Color to achieve the effect of noise removal, and in the feasibility analysis stage, by analyzing the color parameters of the color pixels that make up the stamps and form lines, you can use the range rules of its color parameters to...
specific Embodiment approach 3
[0149] Specific embodiment three: the bill image processed in this embodiment is "Beijing Medical Outpatient Charge Bill", such as Figure 5 shown.
[0150] In the specific implementation process, the scanning device is required to be the current mainstream flatbed scanner when collecting images, and a scanner with automatic image cropping function is recommended, such as the Fujitsu fi-5220c high-speed scanner. When scanning, try to make the four sides of the check image Parallel to the scanning frame of the scanner, the receipt image generated by scanning needs to have the following characteristics:
[0151] 1. Color images with image resolution above 200dpi;
[0152] 2. The width of the image is greater than 1500 pixels, and the height is greater than 650 pixels (the default image size and coordinates in the following text are pixels);
[0153] 3. The image storage format is one of 24-bit JPG format, tiff format, and 256-color bmp format;
[0154] 4. All bill faces in th...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com