Domain-oriented Chinese text topic sentence generation method

A topic sentence and topic technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problem that topic extraction methods cannot obtain topic content description, and achieve the effect of strong application applicability and good generation effect.

Active Publication Date: 2018-11-27
DONGHUA UNIV
View PDF5 Cites 30 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The technical problem to be solved by the present invention is: the existing topic extraction method canno

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Domain-oriented Chinese text topic sentence generation method
  • Domain-oriented Chinese text topic sentence generation method
  • Domain-oriented Chinese text topic sentence generation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] In order to make the present invention more comprehensible, preferred embodiments are described in detail below with accompanying drawings.

[0035] To extract the topic statement of the text, it is not only necessary to extract the keywords in the text, but also to organize these keywords into short sentences in the correct sentence pattern. For example, for a sentence in the field of urban community management: "There is white garbage on the lawn of the happy community." The generated topic sentence should be: "There is white garbage on the lawn."

[0036] In order to accomplish this goal, a field-oriented Chinese text topic sentence generation method provided by the present invention divides the entire topic statement generation process into 3 steps: (1) Establish domain knowledge graph (2) Semantic information extraction (3) Sentence statement Classify and generate topics. figure 1 A flowchart implemented for this process.

[0037] Step 1: Create domain knowledge ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a domain-oriented Chinese text topic sentence generation method. The method is characterized by comprising the following steps of establishing a corresponding domain knowledge map for a domain-oriented text data set, using a deep neural network model for extracting semantic information from texts, classifying the texts according to topic sentence patterns, and finally generating topic sentences of the texts. A data set conceptual model and content narrative mode characteristics are obtained by creating the domain knowledge map, and a deep learning model is used for conducting labeling and classifying training on text data, so that the topic sentences of the texts are generated, and knowledge-based query and statistics are achieved. The method has high application applicability and a good topic sentence generation effect on the limited domain data set.

Description

technical field [0001] The invention relates to a method for extracting topics from Chinese texts, in particular to a method for summarizing description features of domain texts based on domain data sets and generating topic sentences for texts. Background technique [0002] In recent years, with the development of artificial intelligence technology, computers have achieved many valuable results in natural language understanding. Topic extraction is an important branch in the field of text mining, which plays a very important role in search engines, text classification, and information statistics. How to refine and accurately extract the subject information from the text is the key to understanding the content of language expression, and has always been a research hotspot in this field. [0003] Due to the diversity and complexity of Chinese semantics and sentence structures, it is difficult to directly extract topics from texts. In order to obtain the main information of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F17/27
CPCG06F40/258
Inventor 宋晖刘栩彤戴龙其叶长晖岳万琛
Owner DONGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products