A Method for Extracting Text Lines of Handwritten Documents Based on Instance Segmentation
An extraction method and technology of text lines, applied in the field of image processing, can solve problems such as inaccurate extraction, and achieve an effect that is easy to implement and has good practical value
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0060] This embodiment provides a method for extracting text lines of handwritten documents based on instance segmentation, which is specifically performed according to the following steps:
[0061] Step 1. Scale and zoom the images in the handwritten document dataset to finally obtain the training set;
[0062] Scale the images in the ICDAR2013HandSegmCont handwritten document dataset. Assume that the height and width of a picture are h and w respectively, if h≤max_size, w≤max_size, no scaling operation will be performed, otherwise, use the nearest neighbor interpolation method in the Image library to reduce the picture and label to height and images whose widths are h×scale and w×scale respectively, where, Among them, the value of max_size is 800, and the values of h×scale and w×scale need to be rounded. Perform the above operations on each picture and its label in the training set to obtain the final training set.
Embodiment 2
[0086] This embodiment provides a method for extracting text lines of handwritten documents based on instance segmentation, which is specifically performed according to the following steps:
[0087] Step 1. Scale and zoom the images in the handwritten document dataset to finally obtain the training set;
[0088] Scale the images in the ICDAR2013HandSegmCont handwritten document dataset. Assume that the height and width of a picture are h and w respectively, if h≤max_size, w≤max_size, no scaling operation will be performed, otherwise, use the nearest neighbor interpolation method in the Image library to reduce the picture and label to height and images whose widths are h×scale and w×scale respectively, where, Among them, the value of max_size is 1000, and the values of h×scale and w×scale need to be rounded. Perform the above operations on each picture and its label in the training set to obtain the final training set.
[0089] Step 2, train the data set in the training s...
Embodiment 3
[0112] This embodiment provides a method for extracting text lines of handwritten documents based on instance segmentation, which is specifically performed according to the following steps:
[0113] Step 1. Scale and zoom the images in the handwritten document dataset to finally obtain the training set;
[0114] Scale the images in the ICDAR2013HandSegmCont handwritten document dataset. Assume that the height and width of a picture are h and w respectively, if h≤max_size, w≤max_size, no scaling operation will be performed, otherwise, use the nearest neighbor interpolation method in the Image library to reduce the picture and label to height and images whose widths are h×scale and w×scale respectively, where, Among them, the value of max_size is 600, and the values of h×scale and w×scale need to be rounded. Perform the above operations on each picture and its label in the training set to obtain the final training set.
[0115] Step 2, train the data set in the training se...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


