XBRL file-based data mining method

A technology of data mining and instance files, applied in the computer field, can solve problems such as increasing the cost of investors and analysts, and achieve the effect of rapid mining and increasing computing speed

Inactive Publication Date: 2016-09-07
YUNNAN UNIVERSITY OF FINANCE AND ECONOMICS
View PDF4 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, disclosing a large amount of information about listed companies will greatly increase the cost of searching and processing information for investors and analysts.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • XBRL file-based data mining method
  • XBRL file-based data mining method
  • XBRL file-based data mining method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The present invention will be described in further detail below through specific embodiments and in conjunction with the accompanying drawings.

[0024] The invention provides a data mining method based on XBRL instance files.

[0025] The method comprises the steps of:

[0026] S101. Obtain an XBRL instance file, and store the XBRL instance file using the Hadoop platform HDFS file system.

[0027] During specific implementation, all the XBRL instance files are optionally obtained from the Internet, and all the XBRL instance files are stored. Since the number of XBRL instance files is large, when a single server stores them, the load on the server is very large. Therefore, the present invention utilizes the Hadoop platform for operation, that is, utilizes the HDFS distributed database for storage. This method can completely save all the information in all XBRL instance files, and at the same time solve the problem of massive data storage, making preparations for the n...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an XBRL instance file-based data mining method. The method includes the following steps: acquiring an XBRL instance file, and storing the XBRL instance file through an Hadoop platform HDFS file system; performing fragmentation treatment on the XBRL instance file stored in the Hadoop platform, analyzing each fragments of the XBRL instance file and generating a corresponding Boolean matrix through a MapReduce technique; performing segmentation treatment on the Boolean matrix, counting the numbers of different elements corresponding to all the segments, in the Boolean matrix through an iterative algorithm, acquiring a frequent item of the XBRL instance file according to the numbers, and acquiring data of the XBRL instance file, corresponding to the frequent data. The Hadoop platform can achieve storage massive XBRL instance files, the XBRL instance file can be analyzed through a Map / Reduce function of the Hadoop platform, and the corresponding Boolean matrix can be generated; and then segmentation treatment is performed on the Boolean matrix through Map / Reduce data, and in this manner, the calculated amount of data mining can be decreased, and the calculation speed can be increased.

Description

technical field [0001] The invention belongs to the technical field of computers, and in particular relates to a data mining method based on XBRL files. Background technique [0002] XBRL (eXtensible Business Reporting Language, eXtensible Business Reporting Language), is an XML-based markup language for the definition and exchange of business and financial information. XBRL is conducive to the compilation, analysis and exchange of business information, providing low-cost, high-efficiency services and reliable and accurate business information for everyone who provides and uses financial data.” At present, XBRL has more and more widespread use around the world Applications, such as US Securities Regulatory Commission (SEC), Canadian Securities Regulatory Commission (CSA), Toronto Stock Exchange, Korea Stock Exchange, Tokyo Stock Exchange, Shanghai Stock Exchange, Shenzhen Stock Exchange and other multinational securities regulatory agencies and stock exchanges They are all ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/182G06F16/2465
Inventor 冯涛
Owner YUNNAN UNIVERSITY OF FINANCE AND ECONOMICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products