System and method for automating information abstraction process for documents

A document and processor technology, applied in the field of document processing automation, can solve problems such as the inadequacy of traditional computer systems

Active Publication Date: 2016-12-21
ACCENTURE GLOBAL SERVICES LTD
View PDF5 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When document structures are considered for automated information abstraction of documents, conventional computer systems may not be adequate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System and method for automating information abstraction process for documents
  • System and method for automating information abstraction process for documents
  • System and method for automating information abstraction process for documents

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The principles described herein can be implemented in many different forms. Not all depicted components may be required, however, and some implementations may include additional components. Changes may be made in the arrangement and type of components without departing from the spirit or scope of the claims presented herein. Additionally, fail or less compositions may be provided.

[0030] References throughout this specification to "one example," "example," "examples," "an embodiment," "an embodiment," "example embodiments," etc. in the singular or plural means that the description is in conjunction with the embodiments or examples. One or more specific features, structures, or characteristics are included in at least one embodiment or example of the present disclosure. Thus, throughout this specification the phrases "in one embodiment," "in an embodiment," "in an example embodiment," "in an example," "in an example," in the singular or in the plural, appear in vario...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Embodiments of the invention generally relate to a system and a method for automating information abstraction process for documents, particularly a computer-implemented method, a processing pipeline and a system create a hierarchical semantic map of a document and extracted information. The method includes apportioning the document into major sections by accessing the document, recognizing a hierarchical structure of the document, and dividing the document into the major sections by using a data analyzer and a machine learning module, classifying the major sections, and mapping the major sections to key elements in one of the multiple levels, searching one major section, and identifying sub-sections from the one major section to achieve a maximum confidence score indicating that the sub-sections associate with the key element, extracting the information from the identified sub-sections by using sequence modelers and linguistic characteristics provided by the data analyzer, generating the hierarchical semantic map of the document by using the extracted information, and displaying in a user interface drop down selections of the key elements.

Description

[0001] Cross References to Related Applications [0002] This application claims the benefit of Indian Provisional Application No. 2902 / CHE / 2015 filed June 10, 2015, which is hereby incorporated by reference in its entirety. technical field [0003] The present disclosure relates to the field of document processing automation, and more particularly to systems and methods for automating information abstraction processing of large documents. Background technique [0004] Computer systems can be used to process text documents containing information. A computer system can create summaries that preserve the emphasis of the original document. When document structures are considered for automated information abstraction of documents, traditional computer systems may not be adequate. Because of this, there are technical problems to be solved to automatically abstract concrete, well-defined information from documents by using computer systems and data processing techniques. Cont...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F40/00
CPCG06F16/35G06F16/36G06F16/345G06F40/151G06F40/258G06V30/416G06V30/142G06V30/2504G06F18/285G06F18/2113G06V30/413G06V30/414
Inventor S·森古普塔A·K·莫哈默德拉席德C·拉克施米纳拉希姆汉M·卡珀J·乔治M·斯里瓦斯塔瓦V·萨曼斯R·G·纳塔拉简S·斯瓦米
Owner ACCENTURE GLOBAL SERVICES LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products