Unlock instant, AI-driven research and patent intelligence for your innovation.
A character denoising method and terminal based on binarization
What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A binarization and character technology, applied in the field of data processing, can solve problems such as the inability to identify connected domains of area noise points, poor denoising effect, etc.
Active Publication Date: 2021-08-20
厦门商集网络科技有限责任公司
View PDF15 Cites 0 Cited by
Summary
Abstract
Description
Claims
Application Information
AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology
Problems solved by technology
However, the above method can only identify the connected domain of noise points with a smaller area, but cannot identify the connected domain of noise points with a larger area, and the denoising effect is poor.
Method used
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more
Image
Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
Click on the blue label to locate the original text in one second.
Reading with bidirectional positioning of images and text.
Smart Image
Examples
Experimental program
Comparison scheme
Effect test
Embodiment 1
[0160] Such as figure 1 As shown, the present embodiment provides a binarization-based character denoising method, including:
[0162] Among them, image binarization is to set the gray value of the pixel on the image to 0 or 255, that is, the process of presenting an obvious black and white effect to the entire image. For example, in this embodiment, a single character in a character image is set to black, and the background of the character image is set to white. Binarizing the character image first can effectively distinguish the character from the background, and improve the efficiency of subsequent noise removal.
[0163] S2. Detect connected domains of the single-character image to obtain a first set of connected domains.
[0164] Among them, the connected domain refers to a collection of all connected points, and the connected points form a region, while the disconnected ...
Embodiment 2
[0285] Such as Figure 12 As shown, this embodiment provides a terminal, including one or more processors 1 and a memory 2, the memory 2 stores a program, and is configured to perform the following steps by the one or more processors 1:
[0286] S1. Binarize a character image of a single character to obtain a single character image.
[0287] Among them, image binarization is the process of setting the gray value of the pixels on the image to 0 or 255, that is, the process of presenting an obvious black and white effect to the entire image. For example, in this embodiment, a single character in a character image is set to black, and the background of the character image is set to white. Binarizing the character image first can effectively distinguish the character from the background, and improve the efficiency of subsequent noise removal.
[0288] S2. Detect connected domains of the single-character image to obtain a first set of connected domains.
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More
PUM
Login to View More
Abstract
The invention relates to a binarization-based character denoising method and a terminal, belonging to the field of data processing. The present invention uses the phenomenon that the maximum number of vertical crossings of all numbers and letters is 3, firstly recognizes the main connected domain of numbers or alphabetic characters from the single-character image, and then connects other connected domains in the single-character image with the main connected domain in turn. Domain as a whole, if the main connected domain and a connected domain other than the main connected domain in the single-character image are taken as a whole, the maximum vertical crossing number is greater than 3, which means that the connected domain cannot be combined with the main connected domain Form a number or letter, the connected domain is a noisy connected domain, which should be removed. Improved the accuracy of denoising connected domains for English and numeric character images.
Description
technical field [0001] The invention relates to a binarization-based character denoising method and a terminal, belonging to the field of data processing. Background technique [0002] In order to improve the accuracy of character recognition, it is necessary to denoise the character image before recognizing characters to reduce interference. A commonly used method for denoising a character image is specifically to search for small invalid connected regions in the binarized character image and delete them. For example, a connected region with an area smaller than 5 pixels is automatically regarded as an isolated noisy connected domain, and the isolated noisy connected domain is deleted to reduce interference information. However, the above method can only identify the connected domain of noise points with a smaller area, but cannot identify the connected domain of noise points with a larger area, and the denoising effect is poor. Contents of the invention [0003] The te...
Claims
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More
Application Information
Patent Timeline
Application Date:The date an application was filed.
Publication Date:The date a patent or application was officially published.
First Publication Date:The earliest publication date of a patent with the same application number.
Issue Date:Publication date of the patent grant document.
PCT Entry Date:The Entry date of PCT National Phase.
Estimated Expiry Date:The statutory expiry date of a patent right according to the Patent Law, and it is the longest term of protection that the patent right can achieve without the termination of the patent right due to other reasons(Term extension factor has been taken into account ).
Invalid Date:Actual expiry date is based on effective date or publication date of legal transaction data of invalid patent.