Unlock instant, AI-driven research and patent intelligence for your innovation.

Text abstract generation method and device, computer equipment and storage medium

A summary and text technology, applied in the field of natural language processing, can solve the problem of low accuracy

Pending Publication Date: 2022-08-09
华润数字科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the embodiments of the present application is to propose a method, device, computer equipment, and storage medium for generating text abstracts that integrate entity information, so as to solve the problem of low accuracy in traditional text abstract generation methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text abstract generation method and device, computer equipment and storage medium
  • Text abstract generation method and device, computer equipment and storage medium
  • Text abstract generation method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0045] continue to refer to figure 2 , which shows a flow chart of the implementation of the method for generating a text abstract by merging entity information provided in Embodiment 1 of the present application. For the convenience of description, only the part related to the present application is shown.

[0046] The above-mentioned method for generating text summaries fused with entity information includes the following steps:

[0047] Step S201: Acquire raw text data to be processed.

[0048] Step S202: Perform entity extraction on the original text data to obtain entity text data and entity type data.

[0049] Step S203: Perform a first fusion operation on entity text data and entity type data to obtain entity fusion data.

[0050] In the embodiments of this application, refer to image 3 , showing a schematic structural diagram of the Encoder module provided by the embodiment of the present application, and it is assumed that the original text that needs to extract ...

Embodiment 2

[0099] further reference Figure 7 , as a response to the above figure 2 For the implementation of the method shown, the present application provides an embodiment of a text abstract generating device that integrates entity information, and the device embodiment is the same as figure 2 Corresponding to the method embodiments shown, the apparatus can be specifically applied to various electronic devices.

[0100] like Figure 7 As shown, the apparatus 200 for generating text abstracts fused with entity information in this embodiment includes: a data acquisition module 210 , an entity extraction module 220 , a first fusion module 230 , a vector conversion module 240 , a digest encoding module 250 and a digest decoding module 260 . in:

[0101] a data acquisition module 210, configured to acquire raw text data to be processed;

[0102] The entity extraction module 220 is used to perform entity extraction operation on the original text data to obtain entity text data and ent...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention belongs to the technical field of natural language processing in artificial intelligence, and relates to a text abstract generation method and device fusing entity information, computer equipment and a storage medium. According to the method, an entity mapping layer (hereinafter referred to as Entity Embedding) and an entity type mapping layer (hereinafter referred to as TypeEmbedding) are added on the basis of an original network, entity information of an input text is mapped to two vectors with the same dimension, and the amount of information accepted by a model is increased; and meanwhile, a word-entity cross attention layer is added between the multi-head attention layer and the feedforward neural network layer of each sub-module, so that the expression ability of the model to entities is enhanced, and important information is accurately extracted by a decoder.

Description

technical field [0001] The present application relates to the technical field of natural language processing, and in particular, to a method, apparatus, computer equipment and storage medium for generating text summaries that integrate entity information. Background technique [0002] With the advent of the era of big data, the explosive growth of text data on the Internet has resulted in people having to spend a lot of time browsing and understanding the corresponding text, and inevitably missing some important information. Therefore, how to quickly and efficiently obtain important information from a large amount of text becomes more and more important. Automatic summarization technology is an effective way to alleviate this problem. [0003] There is an existing method for generating text summaries, that is, an encoder-decoder model using deep learning. Specifically, the encoder is responsible for vector encoding the original text to extract semantic information; the dec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/295G06N3/04
CPCG06F40/295G06N3/047G06N3/045
Inventor 陈焕坤王伟黄勇其张黔
Owner 华润数字科技有限公司