Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

110 results about "Document structuring" patented technology

Document Structuring is a subtask of Natural language generation, which involves deciding the order and grouping (for example into paragraphs) of sentences in a generated text. It is closely related to the Content determination NLG task.

A method and a terminal for creating paper document structured data based on a deep learning model

The invention relates to a method and a terminal for creating paper document structured data based on a deep learning model. The method comprises the following steps: training a sample set through a preset document; wherein each sample in the document sample set comprises a paper document OCR recognition result and a labeled document corresponding to the paper document OCR recognition result; wherein the labeled document records position information and category information of each key field in the OCR recognition result of the paper document; training a preset first deep learning model by using the training sample set to obtain a second deep learning model; enabling the second deep learning model to analyze a first paper document OCR recognition result to obtain position information and category information of each key field in the first paper document OCR recognition result; and creating a structured document corresponding to the first paper document OCR recognition result accordingto the position information and the category information of each key field in the first paper document OCR recognition result. The accuracy of converting the OCR result of the paper document into thestructured document is improved.
Owner:厦门商集网络科技有限责任公司

Financial document information processing method and device, electronic equipment and storage medium

The embodiment of the invention discloses a financial document information processing method and device, electronic equipment and a storage medium. The financial document information processing methodcomprises the steps: enabling a to-be-audited financial document to generate document structural data through a document processing module; generating financial subject structured data based on the document structured data; inputting the document structured data into a text error correction model, and outputting an error correction result; inputting the document structured data into a manager information casual inspection and verification module to generate a verification result of manager information; respectively inputting financial subject structured data into a financial index formula calculation module, a financial subject change verification module and a financial statement extraction verification module; respectively generating a verification result of the financial index formula,a verification result of financial subject change and a verification result of financial subject data and corresponding reference data; and displaying all verification results and error correction results. According to the technical scheme provided by the embodiment of the invention, the financial document auditing efficiency can be improved.
Owner:DATAGRAND TECH INC

Document structured data embedding method and system

The invention relates to the field of computer knowledge management. The invention relates to the field of data embedding, in particular to a document structured data embedding method and system. Thesystem comprises a template generator, a document editor, a structured data collector, a data authentication processor, a structured data controller, a template library and a data extraction and conversion interface, the method specifically comprises the following steps of: constructing a document structured framework template; pre-loading into a document editor; editing of the structured data label and the extensible semi-structured data label is completed, the edited document data is extracted and converted into xml structural body data and document attribute fields, the structural body datais embedded into a target format file, and the structured data and the extensible semi-structured data in the structural body data are extracted. By means of the method, the documents can meet the requirements for manual reading, understanding, using and filing, automatic collection and processing of the documents embedded with the structural data can be achieved, and the requirements for the standardization degree and data precision of the documents can be effectively controlled.
Owner:XINING NINGGUANG ENG CONSULTATION +1
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products