Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Chinese implicit discourse relation identification method

A text and relational technology, applied in the field of natural language processing, can solve problems such as lack of lexical information

Active Publication Date: 2016-09-21
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF3 Cites 29 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But inside the discourse fragments with implicit discourse relations, since there are no conjunctions, there is no obvious lexical information to help us identify their discourse relations

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese implicit discourse relation identification method
  • Chinese implicit discourse relation identification method
  • Chinese implicit discourse relation identification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039]In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in further detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0040] The basic idea of ​​the present invention is to properly use the vocabulary, syntax and punctuation information of discourse units, and propose a Chinese implicit discourse relationship recognition method. figure 1 It is a flow chart of the Chinese implicit discourse relationship recognition method according to an embodiment of the present invention, such as figure 1 As shown, the Chinese implicit discourse relationship identification method includes the following steps:

[0041] Step 1, perform automatic word segmentation processing on the Chinese implicit text relation argument pair, and obtain the automatic word segmentation result;

[0042] In this step, automatic word segmentation (Tokenize, Segmentation) is p...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a Chinese implicit discourse relation identification method. The method comprises the following steps of step 1 carrying out automatic word segmentation on a Chinese implicit discourse relation theory element pair to obtain an automatic word segmentation result; step 2 learning feature expression of Chinese implicit discourse relation theory elements based on the obtained automatic word segmentation result of the Chinese implicit discourse relation theory elements; step 3 carrying out modelling on the Chinese implicit discourse relation between the theory elements through a maximum-margin-based neural network model based on the obtained feature expression; and step 4 utilizing the obtained neural network model to identify the Chinese implicit discourse relation. According to the method, the Chinese implicit discourse relation can be more accurately identified. Through experimental verification on a Chinese discourse tree bank, in comparison with the existing English implicit discourse relation identification method, the method obtains an identification result with the higher accuracy rate in the Chinese implicit discourse relation identification.

Description

technical field [0001] The invention relates to the technical field of natural language processing, in particular to a method for identifying Chinese implicit discourse relations. Background technique [0002] With the gradual deepening and maturity of research at the word, word, phrase, and sentence levels, more and more researchers have shifted their research focus to the text level. A text is sometimes called a discourse or a discourse, and usually refers to an overall unit of language consisting of a series of consecutive clauses, sentences, or sentence groups, which is meaningful, conveys a complete message, is coherent, and semantically coherent. The reason why discourse has received attention is mainly due to the following points: (1) Unlike syntactic analysis, which uses words as the most basic analysis unit, discourse analysis uses basic discourse units as the basic unit, and the segmentation of basic discourse units is usually similar to that of human beings. The ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27G06F17/28G06N3/02
CPCG06F40/211G06F40/284G06F40/30G06F40/58G06N3/02
Inventor 张家俊李浩然宗成庆
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products