Constraint conditional random field-based Vietnamese noun chunk identification method
A technology of constraints and recognition methods, applied in natural language translation, semantic tool creation, natural language data processing, etc., to achieve good recognition results and improve the effect of lexical analysis
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0030] Embodiment 1: as Figure 1-2 Shown, based on the Vietnamese noun block recognition method of constraint random field, the concrete steps of described method are as follows:
[0031] Step1. Building a corpus of noun chunks: First, crawl text corpora from Vietnamese websites, perform word segmentation, part-of-speech tagging, and manually mark noun phrases, and then manually proofread, mark, and deduplicate to form a corpus of Vietnamese noun chunks; Vietnamese nouns Part of the corpus in the chunk corpus is used to construct constraints, as training corpus and test corpus;
[0032] Step2, build constraints: from the Vietnamese noun chunk corpus, select the part-of-speech characteristics of the noun chunks according to the Vietnamese grammatical characteristics, and construct constraints in combination with the characteristics;
[0033] Step3. Construct a Vietnamese noun chunk recognition model based on constrained random fields: first, use conditional random fields to t...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


