Dependency parsing field self-adaption method based on web search

A technology of dependency syntax and network search, applied in network data retrieval, other database retrieval, natural language data processing and other directions, can solve the problem of insufficient self-learning ability of new features, and achieve the effect of improving the performance of dependency syntax analysis.

Active Publication Date: 2014-03-19
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF4 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

These new features can be learned through collaborative training (co-training) and self-training (self-trai

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dependency parsing field self-adaption method based on web search
  • Dependency parsing field self-adaption method based on web search
  • Dependency parsing field self-adaption method based on web search

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0013] The domain adaptive method of dependency syntax using web search can be divided into two modules: a candidate dependency syntax tree reordering model based on word semantic relationship and a candidate dependency syntax tree reordering model based on network search for rationality evaluation of candidate dependency syntax trees.

[0014] The overall framework of the candidate dependency syntactic tree reordering model based on the semantic relationship of words is basically the same as that of the traditional reordering model. The difference is that the traditional reordering model uses a supervised learning mechanism, and the candidate dependency syntactic tree reordering model based on the semantic relationship of words needs to use tree bank resources as the basis for evaluating each candidate dependency syntax tree during the learning process. . However, the present invention is oriented to domain-adaptive reordering tasks, without treebank resources as the learning...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A dependency parsing field self-adaption method based on web search comprises the following steps: establishing a reorder model according to K-Best candidate dependency parsing trees, and generating a group of candidate dependency parsing trees by a reference model, wherein each candidate dependency parsing tree corresponds to a probability to define initial sequences of dependency parsing results, and on the basis of the initial sequences, the reorder model tries to use the syntax characteristics newly increased in a target field to improve the initial syntax tree sequence; evaluating the dependent relationship between words of the candidate dependency parsing trees based on the web search: at first, splitting a dependency parsing tree into the collection of a group of word relations, wherein the intensity of each word semantic relation in the collection is calculated on the basis of the web search respectively, and then the evaluation level is determined according to the weight of the word relations. Experiment results show that a maltparser trained on a penn Chinese treebank shows favorable performance in biomedicine data.

Description

technical field [0001] The invention relates to the technical field of natural language processing, in particular to a domain adaptive method using the dependency syntax of network search. Background technique [0002] Syntactic analysis is a basic research problem in natural language processing, and it plays an important supporting role in applications such as machine translation, question answering systems, and information extraction. Dependency syntax has attracted much attention for its concise form, easy labeling, and easy application. In recent years, the research on dependency parsing has made great progress. At present, the best dependency parser (sometimes, also called "dependency parsing model") has an accuracy rate of more than 90% for English dependency parsing, and the accuracy rate for Chinese dependency parsing has also reached More than 80%. However, this is only achievable performance in restricted domains such as news. However, in upper-level applicatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/95G06F40/211
Inventor 周光有赵军
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products