Semantic frame-based power grid defect text mining method
A semantic framework and text mining technology, applied in semantic analysis, digital data processing, natural language data processing, etc., can solve problems such as heavy workload, time-consuming and labor-intensive, and difficulty in verifying the correctness of classification and statistical work. Easy application and high statistical accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0022] The specific implementation steps of the present invention are further described below in conjunction with examples:
[0023] Step 1: Word segmentation. Defective text is segmented based on Hidden Markov Model (HMM, Hidden Markov Model).
[0024] Step 2: word frequency feature extraction. Perform word frequency statistics on word segmentation results, sort words from high frequency to low frequency, and remove stop words such as symbols, names of people, and places.
[0025] Step 3: Co-occurrence feature extraction. The four slots Pb, Ps, A, and C rarely appear together. Most of the semantic frames in defect texts have slots missing. The non-core slots Pb and C are often missing, and the core slots Ps and A always exist (extremely Except in some special cases).
[0026] Step 4: Lexeme Feature Extraction. The position sequence of the four grooves has strong regularity, and the most typical arrangement sequence is Pb-Ps-A-C, Pb-Ps-C-A.
[0027] Step 5: Build ontolog...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com