Supercharge Your Innovation With Domain-Expert AI Agents!

Data processing method, electronic equipment and computer readable medium

A data processing and sentence technology, applied in the computer field, can solve problems affecting the processing efficiency of neural network models, low data processing efficiency, and low training efficiency of neural network models, so as to achieve the effect of improving processing efficiency and improving construction efficiency

Inactive Publication Date: 2020-10-30
BEIJING YIZHEN XUESI EDUCATION TECH CO LTD
View PDF5 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] It can be seen from the above that the existing neural network models either have the problem of low training efficiency of the neural network model due to manual collection and labeling of training data; or the problem of low data processing efficiency due to the large amount of data to be processed
But no matter what kind of problem it is, it will affect the processing efficiency of the neural network model as a whole.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method, electronic equipment and computer readable medium
  • Data processing method, electronic equipment and computer readable medium
  • Data processing method, electronic equipment and computer readable medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0024] refer to figure 1 , shows a flowchart of steps of a data processing method according to Embodiment 1 of the present invention.

[0025] This embodiment describes the data processing scheme provided by the embodiment of the present invention from the perspective of training sample construction. The data processing method of this embodiment includes the following steps:

[0026] Step S102: Perform text detection on the first text image, and obtain information of text regions in the first text image.

[0027] In this embodiment, the first text image may be any appropriate image containing text, including but not limited to: text images of plain text, images of various scenes containing text, video frame images with subtitles, and the like.

[0028] Text detection is a technology that detects a text area in an image and marks its boundary, that is, a text box. The text detection of the first text image can also be implemented in any appropriate way. At present, there are m...

Embodiment 2

[0050] refer to figure 2 , shows a flowchart of steps of a data processing method according to Embodiment 2 of the present invention.

[0051] This embodiment takes video as an application scenario, based on the subtitle recognition of video frame images, combined with the acquisition of speech data, and the further construction of training samples for speech recognition models, the data processing method of the embodiment of the present invention is described.

[0052] The data processing method of the present embodiment includes the following steps:

[0053] Step S201: Obtain a video frame image sequence from a video.

[0054] Wherein, the video may be a complete video or a video segment, each of which includes a series of video frame images with a time sequence relationship. In this embodiment, the sequence of video frame images refers to a plurality of video frame images having a time sequence relationship.

[0055] Step S203: Perform text detection on each video frame...

Embodiment 3

[0075] refer to image 3 , shows a flowchart of steps of a data processing method according to Embodiment 3 of the present invention.

[0076] This embodiment takes video as an application scenario, and describes the data processing method of the embodiment of the present invention from the perspective of text recognition of text images, that is, subtitle recognition in video frame images, and the further construction of training samples for speech recognition models. .

[0077] The data processing method of the present embodiment includes the following steps:

[0078] Step S202: Obtain a video frame image sequence from the video.

[0079] Wherein, the video may be a complete video or a video segment, each of which includes a series of video frame images with a time sequence relationship. In this embodiment, the sequence of video frame images refers to a plurality of video frame images having a time sequence relationship.

[0080] Step S204: performing text detection on ea...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a data processing method, electronic equipment and a computer readable medium, and the method comprises the steps: carrying out the text detection of a firsttext image, and obtaining the information of a text region in the first text image; according to the information of the text area, performing image interception on the first text image to obtain a corresponding first intercepted image which does not contain a text; obtaining a plurality of text sentences, and respectively fusing the plurality of text sentences with the first intercepted image to obtain a plurality of second text images; and constructing a training sample for training a text recognition model by taking the plurality of second text images as sample images and taking text contents of text sentences corresponding to the second text images as text annotations of the second text images. Through the embodiment of the invention, the construction efficiency of the training sample for training the text recognition model is improved.

Description

technical field [0001] The embodiments of the present invention relate to the technical field of computers, and in particular, to a data processing method, electronic equipment, and a computer-readable medium. Background technique [0002] With the development of machine learning technology, neural network models have made great progress in various applications. For example, neural network models are currently widely used in speech recognition, text recognition, and so on. [0003] Although in many respects, the recognition accuracy of the neural network model based on machine learning technology is already quite accurate. However, machine learning has natural limitations, for example, a large amount of training data is required to train the neural network model, a large amount of data processing is required, and so on. At present, the commonly used method of obtaining training data is to manually collect data and manually mark it to form training data. The larger the scal...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/32G06K9/34G06K9/62G06N3/04G10L15/26
CPCG10L15/26G06V20/40G06V20/635G06V30/153G06V30/10G06N3/045G06F18/214
Inventor 秦勇李兵
Owner BEIJING YIZHEN XUESI EDUCATION TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More