Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Neural network training method, document image understanding method, device and equipment

A document image and training method technology, applied in neural learning methods, biological neural network models, neural architectures, etc., can solve the problem of unsatisfactory performance of document scenes with highly matched graphic information, and achieve the effect of enhancing interactivity

Active Publication Date: 2022-03-08
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF9 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Common graphic-text interaction tasks perform well in conventional multimodal scenarios, but the performance in document scenarios with highly matching graphic information is not satisfactory

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Neural network training method, document image understanding method, device and equipment
  • Neural network training method, document image understanding method, device and equipment
  • Neural network training method, document image understanding method, device and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] Exemplary embodiments of the present disclosure are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope of the present disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0034] In the present disclosure, unless otherwise stated, using the terms "first", "second", etc. to describe various elements is not intended to limit the positional relationship, temporal relationship or importance relationship of these elements, and such terms are only used for Distinguishes one element from another. In some examples, the first element and the second ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a neural network training method, a document image understanding method, a device and equipment, and relates to the field of artificial intelligence, in particular to a computer vision technology, an image processing technology, a character recognition technology, a natural language processing technology and a deep learning technology. The training method comprises the steps of obtaining text comprehensive features of a plurality of first texts in an original image; replacing at least one original region in the original image to obtain a sample image comprising a plurality of first regions and a real label indicating whether each first region is a replaced region; acquiring image comprehensive features of the plurality of first areas; inputting the text comprehensive features of the plurality of first texts and the image comprehensive features of the plurality of first regions into a neural network model at the same time to obtain text representation features of the plurality of first texts; determining a prediction tag based on the text representation features of the plurality of first texts; and training a neural network model based on the real tag and the predicted tag.

Description

technical field [0001] The present disclosure relates to the field of artificial intelligence, in particular to computer vision technology, image processing technology, character recognition technology, natural language processing technology and deep learning technology, and in particular to a training method for a neural network model for document image understanding, a method using Method for document image understanding of neural network model, training device for neural network model for document image understanding, device for document image understanding using neural network model, electronic equipment, computer-readable storage medium and computer program products. Background technique [0002] Artificial intelligence is a discipline that studies the use of computers to simulate certain human thinking processes and intelligent behaviors (such as learning, reasoning, thinking, planning, etc.), both at the hardware level and at the software level. Artificial intelligen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06V30/41G06N3/04G06N3/08
CPCG06N3/08G06N3/045G06V10/82G06V30/413G06V30/1444G06V30/19147
Inventor 彭启明罗斌曹宇慧冯仕堃陈永锋
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products