MapReduce-based FTP distributed collection method
A collection method and distributed technology, applied in multi-channel programming devices, digital transmission systems, electrical components, etc., can solve the problems of troublesome maintenance and slow single-thread collection, and achieve the effect of improving speed and simplifying maintenance work.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0019] The present invention will be further described below in conjunction with the accompanying drawings and embodiments.
[0020] figure 1 It is a flow chart of FTP distributed acquisition based on MapReduce in the present invention.
[0021] See figure 1 , the FTP distributed collection method based on MapReduce provided by the present invention, comprises the steps:
[0022] S1) pre-configure a plurality of FTP server information and log file paths, and store the configuration information in the HDFS of Hadoop as the data input of MapReduce;
[0023] S2) the input directory and the number of Reduce tasks of MapReduce are set;
[0024] S3) use MapReduce to distribute different log records to different HDFS cluster nodes for processing;
[0025] S4) After each HDFS cluster node reads the FTP server information, use the account password to connect to the FTP server, expand the pre-configured log file path, and write the file into HDFS through the IO stream, so as to real...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com