A data source division method, device, equipment and storage medium

A data source and database technology, applied in the field of data processing, can solve the problems of increasing operation and maintenance costs, reducing the accuracy of division, and unable to change the data source in time, so as to improve the accuracy of division and reduce the amount of division tasks.

Active Publication Date: 2020-09-29
JINGDONG TECH HLDG CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The existing data source division method is manually divided based on the experience of developers, which increases the operation and maintenance cost and reduces the division accuracy
And when the business changes, the data source under the business directory cannot be changed in time, resulting in the inability to adapt to the frequent changes of different businesses

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A data source division method, device, equipment and storage medium
  • A data source division method, device, equipment and storage medium
  • A data source division method, device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0031] figure 1 It is a flow chart of a data source division method provided by Embodiment 1 of the present invention. This embodiment is applicable to the situation where all data sources are divided into corresponding business catalogs. This method can be executed by a data source division device. The The device can be implemented by software and / or hardware, and integrated into equipment with data processing functions, such as desktop computers, notebook computers, and the like. like figure 1 As shown, the method specifically includes the following steps:

[0032] S110. Obtain database information corresponding to multiple data sources.

[0033] Wherein, the data source may refer to information used to describe the database, so that the corresponding database can be obtained according to the data source. There is a one-to-one correspondence between data sources and databases. The database information corresponding to the data source may refer to the database table infor...

Embodiment 2

[0056] figure 2 It is a flow chart of a data source division method provided by Embodiment 2 of the present invention. On the basis of the above embodiments, this embodiment determines the similarity between two data sources and the preset clustering algorithm is The clustering process when the clique penetration algorithm is described in detail. The explanations of terms that are the same as or corresponding to the above-mentioned embodiments will not be repeated here.

[0057] see figure 2 , the data source division method provided in this embodiment specifically includes the following steps:

[0058] S210. Obtain database information corresponding to multiple data sources.

[0059] S220. Perform word segmentation processing on the database information corresponding to each data source, and determine a first feature set corresponding to each data source according to the word segmentation result.

[0060] S230. Count the third occurrence probability corresponding to eac...

Embodiment 3

[0109] Figure 4 It is a schematic structural diagram of a data source division device provided by Embodiment 3 of the present invention. This embodiment is applicable to the situation where all data sources are divided into corresponding business catalogs. The device specifically includes: a database information acquisition module 310, a A feature set determination module 320 , a data source set determination module 330 and a data source set division module 340 .

[0110] Among them, the database information acquisition module 310 is used to acquire database information corresponding to multiple data sources; the first feature set determination module 320 is used to perform word segmentation processing on the database information corresponding to each data source, and determine each The first feature set corresponding to the data source; the data source set determination module 330 is used to determine the similarity between two data sources according to each first feature wo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a data source division method and device, equipment and a storage medium. The method comprises the steps of acquiring database information corresponding to multiple data sources; performing word segmentation processing on the database information corresponding to each data source, and determining a first feature set corresponding to each data source according to a word segmentation result; determining the similarity between every two data sources according to each first feature word in each first feature set, clustering each data source according to each similarity and a preset clustering algorithm, and determining each data source set; and when the number of the data source sets is equal to the number of the preset service directories, dividing each data source set to the corresponding preset service directory. Through the technical scheme of the embodiment of the invention, automatic and reasonable division of the data source can be realized,and the division accuracy is improved.

Description

technical field [0001] Embodiments of the present invention relate to data processing technologies, and in particular, to a data source division method, device, device, and storage medium. Background technique [0002] At present, more and more industries generate a large amount of data every day. For example, the large amount of data generated by the e-commerce industry every day can include different types of data on different topics such as business, system, traffic, and users. [0003] Often, developers need to investigate data sources for project development. It can be seen that a large amount of data research will greatly increase the workload of developers and reduce the efficiency of data usage. In view of this, in the prior art, each data source is manually divided into an existing business catalog when applying for data source creation through manual maintenance by database developers. [0004] However, in the course of realizing the present invention, the invent...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G06F16/31
CPCG06F16/31G06F16/35
Inventor 宋宇航云兴海
Owner JINGDONG TECH HLDG CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products