Data and code version management system and method

A version management and data management technology, applied in the field of data analysis, can solve the problems of low efficiency and confusion of version management, achieve the effect of solving low efficiency or confusion, and reducing operating pressure

Active Publication Date: 2016-09-21
INST FOR INTERDISCIPLINARY INFORMATION CORE TECH XIAN CO LTD
View PDF4 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] In view of the shortcomings of the prior art described above, the purpose of the present invention is to provide a data and code version management system and method for solving the problems of low efficiency or confusion in the version management of data and code in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data and code version management system and method
  • Data and code version management system and method
  • Data and code version management system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0046] see figure 1 , which is a schematic structural diagram of the data version management system of the present invention. As shown in the figure, the first aspect of the present application is to provide a data version management system. The data version management system can be configured on a single server, server cluster, Servers based on cloud computing architecture, or distributed servers. Wherein, the server cluster refers to the collection of many servers for data version management, and the server cluster can use multiple computers to perform parallel calculations to improve the calculation speed. The server based on the cloud computing architecture pools the storage of each server through virtualization technology, so that the servers where the modules in the data version management system are located share computing resources. The distributed server distributes the data and programs in the data version management system on multiple servers for coordinated operat...

Embodiment 2

[0079] see figure 2 , is shown as a flow chart of the data version management method of the present invention, as shown in the figure, the second aspect of the present application is to provide a data version management method. The data version management method is mainly executed by a data version management system. Wherein, the data version management system can be configured in a single server, a server cluster, a server based on cloud computing architecture, or a distributed server. Wherein, the server cluster refers to the collection of many servers for data version management, and the server cluster can use multiple computers to perform parallel calculations to improve the calculation speed. The server based on the cloud computing architecture pools the storage of each server through virtualization technology, so that the servers where the modules in the data version management system are located share computing resources. The distributed server distributes the data a...

Embodiment 3

[0114] see image 3 , which is a schematic structural diagram of the code version management system of the present invention. As shown in the figure, the third aspect of the present application is to provide a code version management system, which can be configured on a single server, server cluster, Servers based on cloud computing architecture, or distributed servers. Wherein, the server cluster refers to the collection of many servers for data version management, and the server cluster can use multiple computers to perform parallel calculations to improve the calculation speed. The server based on the cloud computing architecture pools the storage of each server through virtualization technology, so that the servers where the modules in the code version management system are located share computing resources. The distributed server distributes the data and programs in the code version management system on multiple servers for coordinated operation.

[0115] Each module in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a data and code version management system and method. The data and code version management system comprises a data management module, a code management module, an execution engine module and a system core module; at least a data set is stored in the data management module; at least an execution code is stored in the code management module; the code management module receives and stores a code pushed by a user or sends a code processing request based on the code pushed by the user; the execution engine module is configured with at least an execution back-end engine, calls the execution back-end engine based on a received execution instruction and runs one execution code so as to carry out operations for the at least one data set in the data management module; and the system core module after receiving a data processing request submitted by the user processes the data set in the data management module, and establishes a data workflow of the data set and records the formed data version information and code version information. The data and code version management system and method solve the problems such as low data and code version management efficiency and disordered data and code version management.

Description

technical field [0001] The invention relates to the field of data analysis, in particular to a data and code version management system and method. Background technique [0002] In recent years, people have collected a large amount of data. At the same time, data scientists have also become a hot job in major companies. However, there are currently insufficient tools to help data scientists analyze data streams. As the tasks of data science become more and more complex, many data analysts began to transform code versioning tools, such as Git. However, data science tasks are not something Git can fully handle. [0003] First, data science is data-centric. A dataset can undergo several operations such as cleaning, labeling, and preprocessing. In this way, multiple versions of the data set are generated. Data scientists need to keep track of these versions and modify the data over time. A common but not recommended method is to save multiple copies, and name these copies ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/215G06F16/219
Inventor 徐葳徐方舟张炀
Owner INST FOR INTERDISCIPLINARY INFORMATION CORE TECH XIAN CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products