Method for mining data in construction regulation field based on associative regulation mining technology

A technology of regulations and rules, which is applied in the field of data mining in the field of construction regulations based on association rule mining technology, can solve problems such as association analysis of outlier data, and achieve the effect of reducing dimensions

Inactive Publication Date: 2010-02-24
XI'AN UNIVERSITY OF ARCHITECTURE AND TECHNOLOGY
View PDF2 Cites 59 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0042] In view of the defects or deficiencies in the above-mentioned prior art, the purpose of the present invention is to provide a method for mining data in the field of construction regulations based on association rule mining technology. In the process of mining data in the field of construction regulations, each The candidate feature words in the text of construction regulations are arranged in descending order of t

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for mining data in construction regulation field based on associative regulation mining technology
  • Method for mining data in construction regulation field based on associative regulation mining technology
  • Method for mining data in construction regulation field based on associative regulation mining technology

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0153] Step 1: Since the construction regulations data of various provinces and cities have strong commonality, the cluster sampling strategy is used to extract the text data of construction regulations in Shaanxi Province with a capacity of 250 since 1949;

[0154] The second step: remove the words with high frequency of the above-mentioned text feature words for each construction regulation text, and the remaining words are used as candidate feature words;

[0155] The third step: count the candidate feature words for each text and sort them by frequency, and truncate when the cumulative frequency reaches 85%;

[0156] Step 4: Summarize the features of all texts to form a feature word set with a feature word capacity of 362;

[0157] Step 5: Index each document with a Boolean assignment method to form a text vector space model;

[0158] Step 6: Transpose the text vector space model to obtain a set of feature vector space models;

[0159] Step 7: Sequentially extract featur...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for mining data in construction regulation field based on associative regulation mining technology; 1. a construction regulation text vector space model is generated, 2. a construction regulation data vector space model is generated, 3. the construction regulation data vector space model is subject transposition to generate a construction regulation data feature vector space model, namely, a frequent feature set is generated, and 4. construction regulation data association degree is calculated and an association rule is output. The method can mine the data in construction regulation field, provides higher recall ratio for a user inquiring data, recommends associative query contents, and solves the technical problem that the existing association analysis technologies can not carry out association analysis on outlier data.

Description

technical field [0001] The invention relates to a method for data mining of text features in the field of natural language processing, which belongs to the subclass G06F17 / 27 of the International Patent Classification (IPC), in particular to a method for mining data in the field of construction regulations based on association rule mining technology. Background technique [0002] Construction regulation data is unstructured data, and mining technology for construction regulation data belongs to the research category of text mining technology. The so-called text mining (Text Mining) refers to the use of data mining technology to discover novel, potentially usable and ultimately understandable knowledge (including concepts, patterns, rules, laws, constraints, etc.) from a large number of unstructured and heterogeneous text collections. and visualization) process. Text data has richer and more complex connotations than numerical structured data. The main task of text mining re...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 苏变萍金维兴董丽丽侯筱婷
Owner XI'AN UNIVERSITY OF ARCHITECTURE AND TECHNOLOGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products