The invention relates to an extracting statistical method and
system aiming at semi-structured
big data, and belongs to the field of
big data extracting statistics. The extracting statistical method and
system solve the problem that the process of extraction and statistics of the semi-structured
big data is tedious and easy to cause
data redundancy. According to the extracting statistical method and
system, by providing a
client to let a
user input operational statements of the extraction and statistics aiming at the semi-structured big data, the operation statements are synchronized to a
parsing conversion module which parses the operation statements and converts
parsing results into configuration rules; the
client calls an application engine module to generate a job task according to the configuration rules, and submit the job task to a underlying framework; the underlying framework splits the job task into multiple subtasks, distributes the subtasks to a cluster for execution, and returns the resulting data obtained after the execution back to the
client to show to the user. The extracting statistical method and system are used for improving the
maintainability and the automatic
visualization level of the extraction and statistics aiming at the semi-structured large data and reducing the
data redundancy, and the extracting statistical method and system are simple and reliable.