Chinese language lexical analysis method based on linear model
A technology of lexical analysis and Chinese, applied in the field of statistical natural language processing, can solve the problem of single word segmentation model, achieve strong generalization ability and improve accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0042] In the following, the present invention will be further described by taking an analysis method using a perceptron classifier, a word sequence language model, a part-of-speech tag sequence language model, and a co-occurrence score model of a word-part-of-speech pair set as an example.
[0043] Each model in the present invention is trained in a corpus, and the corpus is a collection of sentences that have undergone word segmentation and part-of-speech tagging. Word segmentation and part-of-speech tagging are done manually by human experts. On this corpus, the machine learning model can learn the knowledge of word segmentation and part-of-speech tagging. This learned knowledge comes in handy when faced with new labeled sentences waiting to be segmented.
[0044] Firstly, the perceptron classifier model and the upper linear model based on the linear interpolation model (ie, the linear lexical analysis model) in the present invention are respectively introduced.
[0045] ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com