A method for automatic abstracting for electronic official documents of enterprises

A technology of electronic official documents and automatic summaries, which is applied in the fields of electronic digital data processing, special data processing applications, and natural language data processing. clear effect

Inactive Publication Date: 2017-02-15
STATE GRID FUJIAN ELECTRIC POWER CO LTD +1
View PDF6 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the actual use process, we found that the operation results of the above algorithms are relatively unstable, and cannot continue to obtain satisfactory results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method for automatic abstracting for electronic official documents of enterprises
  • A method for automatic abstracting for electronic official documents of enterprises
  • A method for automatic abstracting for electronic official documents of enterprises

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The technical solution of the present invention will be specifically described below in conjunction with the accompanying drawings.

[0044] A method for automatic summarization of enterprise electronic official documents according to the present invention comprises the following steps,

[0045] S1. Document preprocessing: obtain the title of the document, and extract the plain text stream from corporate documents in various formats; then, based on the plain text stream of the document, punctuation marks representing the end of the sentence include periods, semicolons, and exclamation points as a delimiter, divide the document into sentences, and obtain all sentence structures of the document;

[0046] S2. Normalized representation: document normalization is to represent documents with mathematical vectors and matrices, and adjust word segmentation weights, which are used in the subsequent sentence sorting process;

[0047] S3. Preliminary sorting of sentences: use the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for automatic abstracting for electronic official documents of enterprises. Different from common mainstream abstract extracting algorithms, the method is mainly used for automatic abstracting for electronic official documents of enterprises, so that the characteristics (strong topicality of documents and abundant information of the titles of the documents) of electronic official documents of enterprises can be fully utilized and the conventional algorithms can be creatively modified and combined according to the characteristics. According to tests, the method can effectively improve the effect of automatic abstracting for electronic official documents of enterprises.

Description

technical field [0001] The invention relates to a large-scale enterprise-oriented method for automatic summarization of electronic official documents of enterprises, in particular to a method for automatic summarization of electronic official documents of enterprises. Background technique [0002] With the deepening of informatization construction, more and more processes in enterprises are running online, and a large amount of business operation information exists in the form of electronic documents. Enterprise documents usually have a limited number of edits, but because they carry specific business information, they are usually read in large quantities. The number of readers and the number of times far exceed the number of edits. Therefore, if we can study the abstract extraction technology of official document electronic documents, extract the key content from a large amount of historical official document information, and present it to users in the form of abstract, it ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
CPCG06F16/345G06F40/205G06F40/211G06F40/216G06F40/284
Inventor 蔡宇翔付婷蔡力军苏运东肖琦敏王雪晶陈锐宋立华张垚
Owner STATE GRID FUJIAN ELECTRIC POWER CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products