A text conversion method and system for assisting blind people to read

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A conversion method, technology for the blind, applied in the field of assisted reading, can solve problems such as low efficiency, large text size limit, logic errors, etc., and achieve high precision and fast detection and recognition speed

Active Publication Date: 2021-02-09

成都快眼科技有限公司

View PDF11 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, this type of reader has the following problems: the blind cannot see, and the finger is in the wrong place; or the direction of movement is wrong, and there will be logic errors, so that the blind do not know why; the blind need to constantly change the finger placement, which is inefficient; There is a large limit on the size of the text, and some texts that cannot be touched cannot be read, so the practicality is greatly limited

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0047] Such as figure 1 As shown, a text conversion method for assisting blind people to read, including the following process:

[0048] Step 1, train the text position detection network and the recognition network respectively;

[0049] Step 2, use the trained text position detection network to detect the position of the text to be read, and guide the blind person to move the field of vision through the voice guidance algorithm to obtain text information at different positions;

[0050] Step 3: Use the trained text position recognition network to recognize the text in different positions, and use the text splicing algorithm to splice the recognition results in different positions into complete semantic content and then convert it into voice reading.

[0051] The text conversion method for assisting the blind to read in Example 1 guides the blind to move their field of vision to obtain text information at different positions through voice interaction on the basis of detecting...

Embodiment 2

[0053] Preferably, on the basis of Example 1, such as figure 2 As shown, the specific process of the voice guidance algorithm is:

[0054] A. Use the camera image acquisition module to acquire video frames with a size of 640 pixels*480 pixels in real time, and output a stable frame frequency; perform text position detection on video frames with a size of 640 pixels*480 pixels, calculate text features and get Positioning boxes of all text line regions within the video frame, each positioning box contains the coordinates of its 4 vertices.

[0055]B. Perform post-processing on the positioning frame of the detection output. The post-processing includes removing text boxes whose short side length is smaller than a certain threshold. In this specific embodiment, 20 pixels are used. Post-processing also includes sequentially judging the 4 vertices of each positioning frame, and if there is a vertex that is less than 50 pixels away from the edge of the input image, the correspondin...

Embodiment 3

[0061] Preferably, on the basis of embodiment 1 and embodiment 2, the specific method steps of described text splicing algorithm are:

[0062] a. Initialize a string array all_text to store the splicing result, which is empty in the initial state;

[0063] b. Send the current video frame and the corresponding detection and positioning frame to the recognition network to obtain the recognized multi-line text result, and store the result in a string array text.

[0064] c. Extract the first 5 characters of each character string in the character string array text, and obtain the character string array compare_text to be compared.

[0065] d. Compare each string in the string array compare_text to be compared with the 5-character substring of each string in the result string array all_text one by one, if a certain string in compare_text If the similarity with a certain substring of a certain string in all_text is greater than 0.7, record the position information of the substring....

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to the field of auxiliary reading, and discloses a text conversion method for assisting blind reading. It includes the following processes: train the text position detection network and the recognition network respectively; use the trained text position detection network to detect the position of the text to be read, and guide the blind to move the field of vision through the voice guidance algorithm to obtain text information at different positions; use training The text position recognition network of the company recognizes the text in different positions, and through the text splicing algorithm, the recognition results of different positions are spliced into complete semantic content and then converted into voice reading. This solution uses deep learning for text detection and recognition, which is fast and maintains high accuracy in complex scenarios; it uses voice prompts to automatically splicing out the complete content of the entire page and performs voice playback. There is no limit to the size of the page, avoiding reading Incomplete information creates confusion for blind people. The invention also discloses a text conversion system for assisting blind reading.

Description

technical field [0001] The invention relates to the technical field of reading assistance, in particular to a text conversion method and system for assisting blind people to read. Background technique [0002] Existing printed books are designed for normal people. Blind people cannot read because of their visual impairment, so they can only read some books translated into Braille or audiobooks to obtain information and learn knowledge. However, the number of these reading materials is very limited, and the illiteracy rate remains high due to the difficulty of reading for the blind. They have lost the most intuitive way to obtain information, so that they are marginalized, resulting in the serious consequence of being unable to integrate into society. With the development of computer science, many people have designed a series of products to solve the problem that blind people cannot read printed materials like normal people. The more representative method is the blind reader...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G09B21/00G06K9/00G06F16/9032

CPCG09B21/006G06V30/40

Inventor李宏亮孙旭

Owner成都快眼科技有限公司

A text conversion method and system for assisting blind people to read

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology