Text conversion method and system for assisting blind in reading

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A conversion method and a conversion system technology, which are applied in the field of text conversion methods and systems for assisting blind people to read, can solve problems such as low efficiency, large text size restrictions, logical errors, etc., and achieve the effects of high precision and fast detection and recognition speed.

Active Publication Date: 2019-02-22

成都快眼科技有限公司

View PDF11 Cites 9 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, this type of reader has the following problems: the blind cannot see, and the finger is in the wrong place; or the direction of movement is wrong, and there will be logic errors, so that the blind do not know why; the blind need to constantly change the finger placement, which is inefficient; There is a large limit on the size of the text, and some texts that cannot be touched cannot be read, so the practicality is greatly limited

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0047] Such as figure 1 As shown, a text conversion method for assisting blind people to read, including the following process:

[0048] Step 1, train the text position detection network and the recognition network respectively;

[0049] Step 2, use the trained text position detection network to detect the position of the text to be read, and guide the blind person to move the field of vision through the voice guidance algorithm to obtain text information at different positions;

[0050] Step 3: Use the trained text position recognition network to recognize the text in different positions, and use the text splicing algorithm to splice the recognition results in different positions into complete semantic content and then convert it into voice reading.

[0051] The text conversion method for assisting the blind to read in Example 1 guides the blind to move their field of vision to obtain text information at different positions through voice interaction on the basis of detecting...

Embodiment 2

[0053] Preferably, on the basis of Example 1, such as figure 2 As shown, the specific process of the voice guidance algorithm is:

[0054] A. Use the camera image acquisition module to acquire video frames with a size of 640 pixels*480 pixels in real time, and output a stable frame frequency; perform text position detection on video frames with a size of 640 pixels*480 pixels, calculate text features and get Positioning boxes of all text line regions within the video frame, each positioning box contains the coordinates of its 4 vertices.

[0055]B. Perform post-processing on the positioning frame of the detection output. The post-processing includes removing text boxes whose short side length is smaller than a certain threshold. In this specific embodiment, 20 pixels are used. Post-processing also includes sequentially judging the 4 vertices of each positioning frame, and if there is a vertex that is less than 50 pixels away from the edge of the input image, the correspondin...

Embodiment 3

[0061] Preferably, on the basis of embodiment 1 and embodiment 2, the specific method steps of described text splicing algorithm are:

[0062] a. Initialize a string array all_text to store the splicing result, which is empty in the initial state;

[0063] b. Send the current video frame and the corresponding detection and positioning frame to the recognition network to obtain the recognized multi-line text result, and store the result in a string array text.

[0064] c. Extract the first 5 characters of each character string in the character string array text, and obtain the character string array compare_text to be compared.

[0065] d. Compare each string in the string array compare_text to be compared with the 5-character substring of each string in the result string array all_text one by one, if a certain string in compare_text If the similarity with a certain substring of a certain string in all_text is greater than 0.7, record the position information of the substring....

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention, which relates to the field of auxiliary reading, discloses a text conversion method for assisting the blind in reading. The method comprises the following steps: training a character position detection network and an identification network separately; detecting the position of a to-be-read character by using the trained character position detection network and guiding a blind user to move the view based on a voice guidance algorithm to obtain character information at different positions; and identifying characters at different positions by using the trained character position detection network, splicing identification results of different positions into a complete semantic content based on a character splicing algorithm, and then converting the content into voice reading. Therefore, character detection and recognition are carried out by using deep learning and the speed is fast. The high precision is kept in a complex scene. The complete content of the whole page is spliced automatically by means of voice prompting and voice playing is carried out; the limitation on the page size is eliminated; and a phenomenon that the blind feel puzzles due to the incomplete reading information is avoided. In addition, the invention also discloses a text conversion system for assisting the blind in reading.

Description

technical field [0001] The invention relates to the technical field of reading assistance, in particular to a text conversion method and system for assisting blind people to read. Background technique [0002] Existing printed books are designed for normal people. Blind people cannot read because of their visual impairment, so they can only read some books translated into Braille or audiobooks to obtain information and learn knowledge. However, the number of these reading materials is very limited, and the illiteracy rate remains high due to the difficulty of reading for the blind. They have lost the most intuitive way to obtain information, so that they are marginalized, resulting in the serious consequence of being unable to integrate into society. With the development of computer science, many people have designed a series of products to solve the problem that blind people cannot read printed materials like normal people. The more representative method is the blind reader...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G09B21/00G06K9/00G06F16/9032

CPCG09B21/006G06V30/40

Inventor李宏亮孙旭

Owner成都快眼科技有限公司

Text conversion method and system for assisting blind in reading

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology