Multi-word expression extraction method and device
A technology for obtaining devices and vocabulary, which is applied in special data processing applications, instruments, and electronic digital data processing, etc., can solve the problems of low accuracy of multi-word expression database and inability to obtain multi-word expressions at one time, so as to improve utilization rate and improve The effect of accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0031] A multi-word expression extraction method, the method comprises the steps of the following sequence: (1) the document base adopts preprocessing such as word segmentation and part-of-speech tagging to form a source language document; (2) calculates the mutual information of adjacent words in the multi-document, and Further calculate the jump information before and after the mutual information sequence; (3) The mutual information sequence and the jump information sequence form a two-dimensional mutual information set; (4) The two-dimensional mutual information set uses a classifier to express inliers and outliers for multiple words , to filter continuous internal point links to construct multi-word expressions. Such as figure 1 shown.
[0032] Combine the following figure 1 The present invention is further described.
[0033] In the step (1), Chinese word segmentation, part-of-speech tagging, named entity recognition, and part-of-speech selection are performed on all t...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


