Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A text conversion method and system for assisting blind people to read

A conversion method, technology for the blind, applied in the field of assisted reading, can solve problems such as low efficiency, large text size limit, logic errors, etc., and achieve high precision and fast detection and recognition speed

Active Publication Date: 2021-02-09
成都快眼科技有限公司
View PDF11 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this type of reader has the following problems: the blind cannot see, and the finger is in the wrong place; or the direction of movement is wrong, and there will be logic errors, so that the blind do not know why; the blind need to constantly change the finger placement, which is inefficient; There is a large limit on the size of the text, and some texts that cannot be touched cannot be read, so the practicality is greatly limited

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A text conversion method and system for assisting blind people to read
  • A text conversion method and system for assisting blind people to read
  • A text conversion method and system for assisting blind people to read

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0047] Such as figure 1 As shown, a text conversion method for assisting blind people to read, including the following process:

[0048] Step 1, train the text position detection network and the recognition network respectively;

[0049] Step 2, use the trained text position detection network to detect the position of the text to be read, and guide the blind person to move the field of vision through the voice guidance algorithm to obtain text information at different positions;

[0050] Step 3: Use the trained text position recognition network to recognize the text in different positions, and use the text splicing algorithm to splice the recognition results in different positions into complete semantic content and then convert it into voice reading.

[0051] The text conversion method for assisting the blind to read in Example 1 guides the blind to move their field of vision to obtain text information at different positions through voice interaction on the basis of detecting...

Embodiment 2

[0053] Preferably, on the basis of Example 1, such as figure 2 As shown, the specific process of the voice guidance algorithm is:

[0054] A. Use the camera image acquisition module to acquire video frames with a size of 640 pixels*480 pixels in real time, and output a stable frame frequency; perform text position detection on video frames with a size of 640 pixels*480 pixels, calculate text features and get Positioning boxes of all text line regions within the video frame, each positioning box contains the coordinates of its 4 vertices.

[0055]B. Perform post-processing on the positioning frame of the detection output. The post-processing includes removing text boxes whose short side length is smaller than a certain threshold. In this specific embodiment, 20 pixels are used. Post-processing also includes sequentially judging the 4 vertices of each positioning frame, and if there is a vertex that is less than 50 pixels away from the edge of the input image, the correspondin...

Embodiment 3

[0061] Preferably, on the basis of embodiment 1 and embodiment 2, the specific method steps of described text splicing algorithm are:

[0062] a. Initialize a string array all_text to store the splicing result, which is empty in the initial state;

[0063] b. Send the current video frame and the corresponding detection and positioning frame to the recognition network to obtain the recognized multi-line text result, and store the result in a string array text.

[0064] c. Extract the first 5 characters of each character string in the character string array text, and obtain the character string array compare_text to be compared.

[0065] d. Compare each string in the string array compare_text to be compared with the 5-character substring of each string in the result string array all_text one by one, if a certain string in compare_text If the similarity with a certain substring of a certain string in all_text is greater than 0.7, record the position information of the substring....

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the field of auxiliary reading, and discloses a text conversion method for assisting blind reading. It includes the following processes: train the text position detection network and the recognition network respectively; use the trained text position detection network to detect the position of the text to be read, and guide the blind to move the field of vision through the voice guidance algorithm to obtain text information at different positions; use training The text position recognition network of the company recognizes the text in different positions, and through the text splicing algorithm, the recognition results of different positions are spliced ​​into complete semantic content and then converted into voice reading. This solution uses deep learning for text detection and recognition, which is fast and maintains high accuracy in complex scenarios; it uses voice prompts to automatically splicing out the complete content of the entire page and performs voice playback. There is no limit to the size of the page, avoiding reading Incomplete information creates confusion for blind people. The invention also discloses a text conversion system for assisting blind reading.

Description

technical field [0001] The invention relates to the technical field of reading assistance, in particular to a text conversion method and system for assisting blind people to read. Background technique [0002] Existing printed books are designed for normal people. Blind people cannot read because of their visual impairment, so they can only read some books translated into Braille or audiobooks to obtain information and learn knowledge. However, the number of these reading materials is very limited, and the illiteracy rate remains high due to the difficulty of reading for the blind. They have lost the most intuitive way to obtain information, so that they are marginalized, resulting in the serious consequence of being unable to integrate into society. With the development of computer science, many people have designed a series of products to solve the problem that blind people cannot read printed materials like normal people. The more representative method is the blind reader...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G09B21/00G06K9/00G06F16/9032
CPCG09B21/006G06V30/40
Inventor 李宏亮孙旭
Owner 成都快眼科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products