Hadoop-based mass Web data mining genetic method
A data mining and massive technology, applied in the Hadoop-based massive Web data mining genetic field, can solve problems such as loose coupling of data contexts, achieve the effects of overcoming disadvantages, improving mining efficiency, and high execution efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0026] The content of the present invention is described in more detail below:
[0027] The expression of the present invention is as follows:
[0028] Step 1 Data segmentation processing. According to the characteristics of web data, web data is segmented, such as web log files are segmented by user and access date, and transmitted to different sub-nodes, and user-defined support S is obtained at the same time.
[0029] Step 2 initializes the population. Each sub-node uses Map and Reduce operations under the Hadoop platform to convert the data set into a 1-itemset form of a preferred sub-path that meets the user-defined support degree, which is used as the initial population of the genetic algorithm.
[0030] Step 3: Calculation of fitness value. The frequency of an access path is used to measure whether it is the user's preferred access path. Therefore, the fitness function is defined as follows:
[0031]
[0032] Among them, S' is the access frequency of the path. I...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com