The invention provides a large-scale mixed heterogeneous storage 
system-oriented node fault 
prediction system and method. A 
time sequence-based association 
rule mining algorithm is adopted to construct a node fault 
prediction system architecture, and a main process of node fault prediction includes: collecting state data and log information of each storage node; carrying out data preprocessing, and generating sequence 
modes on the basis of a sliding window; using the sequence 
modes and fault sequences, which are extracted in a fault identification process, together as input of an association rule 
algorithm, and outputting output results as typical fault sequences; carrying out matching on the typical fault sequences and sequence 
modes generated in real time; and if a matching result meetsan established rule, issuing early warning to notify a 
system administrator, and giving feedback to a prediction result by the administrator according to a subjective interest degree. According to thesystem and method, real-time online fault prediction is carried out for nodes of a large-scale mixed heterogeneous storage 
system, and accuracy and recall which are better than those of existing fault 
prediction algorithms and better 
scalability can be obtained.