The invention provides a large-scale mixed heterogeneous storage
system-oriented node fault
prediction system and method. A
time sequence-based association
rule mining algorithm is adopted to construct a node fault
prediction system architecture, and a main process of node fault prediction includes: collecting state data and log information of each storage node; carrying out data preprocessing, and generating sequence
modes on the basis of a sliding window; using the sequence
modes and fault sequences, which are extracted in a fault identification process, together as input of an association rule
algorithm, and outputting output results as typical fault sequences; carrying out matching on the typical fault sequences and sequence
modes generated in real time; and if a matching result meetsan established rule, issuing early warning to notify a
system administrator, and giving feedback to a prediction result by the administrator according to a subjective interest degree. According to thesystem and method, real-time online fault prediction is carried out for nodes of a large-scale mixed heterogeneous storage
system, and accuracy and recall which are better than those of existing fault
prediction algorithms and better
scalability can be obtained.