Video image-text courseware text extraction method and device, equipment and medium

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A text extraction and courseware technology, applied in the field of artificial intelligence, can solve the problems of unavailable extraction, text fragmentation, irregular text, etc., and achieve good transferability

Active Publication Date: 2021-01-29

PINGAN INT SMART CITY TECH CO LTD

View PDF7 Cites 5 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

In addition, video files and ppt files used for recognition usually contain a large amount of non-standard text, such as oblique text, vertical text, and animation effects will also cause the failure of recognition

Therefore, although the existing OCR technology can recognize the text contained in video, ppt, and graphics, the extracted text is fragmented, incomplete, and text fragments lacking structural information, and cannot be further extracted and used

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0061] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0062] figure 1 It is a flow chart of video, graphic and text courseware text extraction according to the present invention, such as figure 1 As shown, the text extraction method includes:

[0063] Step S1, classifying the video-graphic courseware, using a text recognition method to identify the pictures in the video-graphic courseware and all existing text boxes on the pictures, and obtaining the structured information of the pictures;

[0064] Step S2, performing regularization processing on the picture, and updating the coordinates of each text box on the picture;

[0065] Step S3, constructing a neural network language model, the input of the neural network language model is a plurality of text fields, and the output is the rationality of splicing the plurality of text fields;

[0066] Step S4, splicing the tex...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to artificial intelligence, and discloses a video image-text courseware text extraction method. The method comprises the steps of classifying video image-text courseware, identifying pictures in the video image-text courseware and all textboxes existing on the pictures by adopting a text identification method, and obtaining structured information of the pictures; performing regularization processing on the picture to update the coordinate of each textbox on the picture; constructing a neural network language model for training, wherein the input of the neural network language model is a plurality of character fields, and the output of the neural network language model is the splicing rationality of the plurality of character fields; splicing the textboxes, performingreasonability judgment on splicing by adopting a trained neural network language model, and if the reasonability conforms to a preset value, extracting texts after the textboxes are spliced. In addition, the invention also relates to a blockchain technology, and the video image-text courseware can be stored in the blockchain. According to the invention, complete readable, processable and structured texts can be provided from images and videos.

Description

technical field [0001] The invention relates to artificial intelligence, in particular to a method, device, electronic equipment and computer-readable storage medium for extracting text from video, graphic and text courseware. Background technique [0002] With the rapid development of artificial intelligence technology and mobile networks, online education has become more and more popular. The teaching of online education is usually carried out in the form of video, ppt, or graphic html, and there is a large amount of course knowledge stored in the form of pictures. However, effective, accurate and structured textual information is often required to make use of this knowledge. Therefore, there is an urgent need for structured text extraction of these video graphic data. [0003] In the prior art, the text extraction of videos and pictures is based on OCR (Optical Character Recognition, Optical Character Recognition), but no matter what OCR recognition algorithm is, it is ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G06K9/00G06K9/20G06K9/32G06N3/04G06N3/08

CPCG06N3/08G06V30/413G06V30/414G06V10/22G06V10/25G06N3/045

Inventor王异秀

OwnerPINGAN INT SMART CITY TECH CO LTD

Video image-text courseware text extraction method and device, equipment and medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology