Massive web log data query and analysis method
A technology of data query and analysis method, which is applied in the direction of network data retrieval, network data indexing, electronic digital data processing, etc. It can solve the problems of inaccurate data analysis results and large retrieval time delay, achieve accurate results, realize data mining, Achieving Scalability and Efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0024] The technical solutions provided by the present invention will be described in detail below in conjunction with specific examples. It should be understood that the following specific embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention.
[0025] Customers will leave traces of their visits in the process of browsing the website, and these traces will be saved in the form of web log files. For these data, this example uses the ETL language in Hive, optimized Hive SQL query, MapReduce with combiner function, and genetic algorithm based on data segmentation technology to accurately provide log data query and analysis results. Such as figure 1 As shown, the specific steps of this method are as follows:
[0026] Step 10, use ETL in Hive to analyze the data of each data source. The ETL process includes four steps of data extraction, cleaning, transformation and loading. In the extraction stage, the so...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com