Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Domain-Oriented Chinese Text Topic Sentence Generation Method

A topic sentence and topic technology, which is applied in the field of topic extraction from Chinese texts, can solve the problem that the topic extraction method cannot get the topic content description, etc., and achieve the effect of strong application applicability and good generation effect

Active Publication Date: 2021-08-27
DONGHUA UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The technical problem to be solved by the present invention is: the existing topic extraction method cannot obtain a complete description of the topic content, and the text is mainly described through the topic keywords

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Domain-Oriented Chinese Text Topic Sentence Generation Method
  • Domain-Oriented Chinese Text Topic Sentence Generation Method
  • Domain-Oriented Chinese Text Topic Sentence Generation Method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] In order to make the present invention more comprehensible, preferred embodiments are described in detail below with accompanying drawings.

[0035] To extract the topic statement of the text, it is not only necessary to extract the keywords in the text, but also to organize these keywords into short sentences in the correct sentence pattern. For example, for a sentence in the field of urban community management: "There is white garbage on the lawn of the happy community." The generated topic phrase should be: "There is white garbage on the lawn."

[0036] In order to accomplish this goal, a field-oriented Chinese text topic sentence generation method provided by the present invention divides the entire topic statement generation process into 3 steps: (1) Establish domain knowledge graph (2) Semantic information extraction (3) Sentence statement Classify and generate topics. figure 1 A flowchart implemented for this process.

[0037] Step 1: Create domain knowledge gr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a field-oriented Chinese text topic sentence generation method, which is characterized in that it includes the following steps: facing the field text data set, establishing a corresponding field knowledge map, applying a deep neural network model to extract semantic information from the text, according to The topic sentence pattern classifies the text and finally generates the topic sentence of the text. The invention obtains the conceptual model of the data set and the characteristics of the content narrative mode by creating a domain knowledge map, and uses the deep learning model to label and classify the text data, and then generate the topic sentence of the text to realize knowledge-based query and statistics. This method has strong application applicability, and has a good effect of generating topic sentences for limited domain data sets.

Description

technical field [0001] The invention relates to a method for extracting topics from Chinese texts, in particular to a method for summarizing description features of domain texts based on domain data sets and generating topic sentences for texts. Background technique [0002] In recent years, with the development of artificial intelligence technology, computers have achieved many valuable results in natural language understanding. Topic extraction is an important branch in the field of text mining, which plays a very important role in search engines, text classification, and information statistics. How to refine and accurately extract the subject information from the text is the key to understanding the content of language expression, and has always been a research hotspot in this field. [0003] Due to the diversity and complexity of Chinese semantics and sentence structures, it is difficult to directly extract topics from texts. In order to obtain the main information of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G06F40/258
CPCG06F40/258
Inventor 宋晖刘栩彤戴龙其叶长晖岳万琛
Owner DONGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products