An Automatic Text Label Extraction Method Combining Topic Model and Semantic Analysis
A technology of semantic analysis and topic model, applied in semantic analysis, natural language data processing, instruments, etc., can solve the problems of unrealistic labeling of training sets, time-consuming and laborious labeling, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0042] The present invention will be further described below in conjunction with accompanying drawing:
[0043] Such as figure 2 Shown: A text label automatic extraction method combining topic model and semantic analysis, including the following steps:
[0044] The first step: preprocessing;
[0045] Step 2: LDA modeling and context analysis;
[0046] The third step: label extraction.
[0047] The preprocessing method of the first step is: if there are low-frequency words, stop words and tag information, the preprocessing includes removing low-frequency words, removing stop words and removing tag information; the low-frequency words are only in one or two texts appear, the stop words are auxiliary words that carry almost no information, words that reflect the grammatical structure of the sentence, all function words, and punctuation marks; the markup information is web page text or other markup language text information; other markup language text information including htm...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com