Text multi-feature ambiguity resolution method and system
An ambiguity resolution, multi-feature technology, applied in the field of text multi-feature ambiguity resolution methods and systems, can solve problems such as ambiguity, and achieve accurate text classification and recognition.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0038] In this embodiment 1, a text multi-feature ambiguity resolution system is provided, the system includes:
[0039] A dissolving module, the dissolving module is configured to: use the trained dissolving model to identify and extract the combined ambiguity field in the text to be disintegrated, and perform the extraction on the extracted text according to the context relevance and part-of-speech features of the words in the text Segmentation to obtain the text after ambiguity resolution; wherein, the trained resolution model is obtained by training a training set, and the training set includes a feature vector composed of text weight features, context-related features and part-of-speech features of the text where the ambiguous field is located .
[0040] In this embodiment 1, the above-mentioned system is used to implement the text multi-feature ambiguity resolution method, which includes: inputting the text to be digested into the trained resolution model, and identifyin...
Embodiment 2
[0049] In this embodiment 2, combined with the characteristics of concise, fuzzy and unstructured language of TCM medical record texts, a method that combines multiple features is proposed to improve the existing ambiguity resolution algorithm and design a method suitable for TCM medical records. The multi-feature ambiguity resolution model of case text is applied to TCM medical case text for text disambiguation recognition, which further improves the accuracy of word segmentation of TCM medical case text.
[0050] In this embodiment 2, the multi-feature ambiguity resolution method for the TCM proposal text includes: inputting the TCM medical case text to be resolved into the trained resolution model, identifying and extracting the combined ambiguity field in the text, according to The contextual relevance and part-of-speech features of the words in the text are used to segment the extracted text to obtain the disambiguated text; wherein, the trained resolution model is obtaine...
Embodiment 3
[0148] Present embodiment 3 is aimed at the multi-feature ambiguity resolution of Chinese medical record text, provides a kind of multi-feature ambiguity resolution method of Chinese medical record text, and builds multi-feature ambiguity resolution model, is divided into four stages: (1) for combination type The problem of ambiguity resolution is to build a combined ambiguous thesaurus to identify and extract ambiguous fields. (2) Select an appropriate context window; (3) Generate a feature vector by extracting the weight feature, context word feature and part-of-speech feature of the text. (4) Input the feature vector into the nonlinear SVM, and train the "combine" classifier and "separate" classifier through the training set text to realize the resolution of combined ambiguity in the text of TCM medical records and improve the accuracy of word segmentation in TCM medical case texts Rate.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com