Analysis Method of Railway Accident Causes Based on Word Expansion LDA
A technology of accident causes and analysis methods, applied in semantic analysis, instruments, data processing applications, etc., can solve problems such as the decline of expert judgment ability and the impact of subjective accident analysis results
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0046] like figure 1 As shown, Embodiment 1 of the present invention provides a method for analyzing the cause of a railway accident, the method comprising the following steps:
[0047] Step S110: use TF-IDF to represent the railway accident text, construct a document vector space model, and generate document vectors;
[0048] Step S120: using the TextRank method to calculate the importance of words in the railway accident text;
[0049] Step S130: according to word importance and semantic similarity, weight the words that meet the semantic threshold, and train and generate word expansion LDA model;
[0050] Step S140: using the word expansion LDA model to extract the features of the railway accident text, and extracting the cause of the railway accident theme and feature items;
[0051] Step S150: use the SVM accident classification model to classify the text of the railway accident report, and determine the data set of the cause of the railway accident;
[0052] Step S160...
Embodiment 2
[0064] like figure 2 As shown, Embodiment 2 of the present invention provides a method for constructing a word-extended LDA topic model based on word importance and semantic similarity, which method includes the following process steps:
[0065] Step 1.1, use the TextRank method to calculate the importance of words in the document
[0066] Specifically, the given accident text is divided into complete sentences, each sentence is segmented, and stop words are removed, each sentence is represented as a set of phrases, a word graph is constructed, and then any co-occurrence relationship is used to construct any The edge between two words, only when two words co-occur in a fixed-length window, there is an edge between them, initialize the importance of all words, and calculate the importance of each word through multiple iterations, by setting The maximum number of iterations is to control the calculation, and the final iteration result is defined as the importance of words, and...
Embodiment 3
[0076] like image 3 As shown, the third embodiment of the present invention provides a text classification method based on two-level accident causes using the SVM accident classification model. The method includes the following steps:
[0077] Step 2.1, constructing the improved HFACS-RAs model.
[0078] The word expansion LDA topic model that the training of embodiment two generates carries out subject feature extraction to accident text, and each subject selects the top eight subject words of frequency ranking as the accident cause characteristic item, constitutes the accident cause characteristic space; From the implication of subject word It can identify the human factors and organizational classifications in current accidents, and based on the content extracted from the accident text features, an improved HFACS-RAs model is designed on the basis of the HFACS-RAs model, such as Figure 4 As shown, the "preconditions for unsafe behaviors" are further divided into "persona...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com