Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for automatically extracting subject of argumentative article

A subject and article technology, applied in natural language data processing, special data processing applications, instruments, etc., can solve problems such as not covering all rules, not being able to update in real time, and poor flexibility.

Inactive Publication Date: 2017-07-07
贺惠新
View PDF6 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] There are many existing processing methods to automatically extract the main body of the discussion from the discussion article, including the rule-based method. This method has achieved certain results, but due to the diversity of natural language sentence patterns, this method cannot cover Discuss all the rules in subject extraction, and cannot be updated in real time, poor flexibility

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for automatically extracting subject of argumentative article
  • Method for automatically extracting subject of argumentative article
  • Method for automatically extracting subject of argumentative article

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0131] Below with the accompanying drawings figure 1 with figure 2 This embodiment will be described.

[0132] The method designed in the present invention is made up of training model and application model two stages, and it comprises the following steps:

[0133] training phase

[0134] Training step 1: Obtain dependent resources in the model training phase: Obtain a set of NS sentences S={S(i) } is the training corpus, and each sentence is recorded as S(i), where 1≤i≤NS, and NS≥10000 is required; obtain the artificially summarized important vocabulary dictionary Di;

[0135] Training step 2: Generate a common word dictionary Dc based on the training sentences; the specific implementation steps are:

[0136] Training step 21: In the sentence of each training sentence, the start and end order of all the strings of the subject in this sentence are divided, and multiple substrings are formed, and each substring is removed. The character string corresponding to the subject ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention relates to a method for automatically extracting a subject of an argumentative article, and belongs to the technical application field of natural language processing. The method disclosed by the present invention comprises: based on the sequence annotation strategy of the random field of the statistical condition, by analyzing semantic features and position characteristics of the subject in the title of the argumentative article and combining with performance of the trained corpus, establishing a commonly used word dictionary and an important word dictionary; using information such as dictionaries and words, locations and the like to carry out sequence feature annotation on the title of the argumentative article; and using the annotated corpus to train and generate the model, so that unknown data can be predicted, the relatively high accuracy can be ensured, and the applicability of the algorithm in different scenarios can be improved. According to the method disclosed by the present invention, automatic extraction of the subject in the argumentative article by the computer can be effectively realized, the main display object of the article can be displayed in an intuitive form, related information of the object can be quickly mastered by the reader in a facilitated manner, related content retrieval and comparison can be facilitated, and automatically extracted phrases can be provided for the computer to carry out various follow-up analysis.

Description

technical field [0001] The invention relates to a method for automatically extracting a discussion subject of a discussion article, belonging to the field of computer technology application of natural language processing. Background technique [0002] An expositional article is a written form in which the author expresses various research and analysis processes and conclusions through writing for a certain discourse subject. Among them, the discussion subject is the core object of the discussion article, including attribute examples such as objective things, theories, events, processes, relationships, etc., which can efficiently and clearly locate the focus of the corresponding article. The extraction and display of the main body of the discussion can present the main display target of the article in an intuitive form, which helps readers quickly grasp the relevant information of this object, and facilitate the retrieval and comparison of related content. [0003] However, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27G06K9/62
CPCG06F40/211G06F40/35G06F18/214
Inventor 贺惠新
Owner 贺惠新
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products