Data integration support systems, methods, and programs
Patent Information
- Authority / Receiving Office
- JP · JP
- Patent Type
- Applications
- Current Assignee / Owner
- HITACHI SOFTWARE ENG
- Filing Date
- 2024-12-13
- Publication Date
- 2026-06-25
Smart Images

Figure 2026104213000001_ABST
Abstract
Claims
1. A data integration support system that assists in the process of integrating data from a set of source data files into a target database, A schema information generation device generates source schema information, which is schema information that defines the source database into which the aforementioned data files are compiled, and A suggestion device that, based on the relationships between tables in the source schema information where tables refer to other tables, and the relationships between tables in the destination schema information defining the destination database, presents the user with suggestion information indicating possible table options in the source database for each table in the destination database, and prompts the user to select one of the options. A data integration device that determines which tables in the source database to be integrated into each of the tables in the destination database, based on the selection of options by the user, A data integration support system that has the following features.
2. The suggestion device presents, for each table at each level in the external reference tree of the source schema information, which is a tree representation of the relationships between tables in the source schema information, as options, the tables at the corresponding levels in the source external reference tree, which is a tree representation of the relationships between tables in the source schema information. If the user determines that the table in the source database to be integrated into the table in the destination database is not among the options, then the tables located one level higher in the source external reference tree are presented as the next options. The data integration support system according to claim 1.
3. The system further includes a database creation device that creates the source database based on the source schema information, The data integration device creates the target database by applying the tables of the source database selected by the user to the tables of the target database. The data integration support system according to claim 1.
4. The data integration device updates the integration status file, which shows the correspondence between the table in the source database and the table in the destination database, each time the suggestion device receives a selection from the user. The suggestion device determines, based on the updated integration status file, the table in the source database to be used as an option for the table in the destination database. The data integration support system according to claim 1.
5. The schema information generation device is For each column in each file included in the aforementioned data file group, the column that does not contain a NULL value and does not contain duplicate values will be used as a candidate key for a single attribute. The data integration support system according to claim 1.
6. The schema information generation device is Among all combination patterns of two or more columns that are not candidate keys for the single attribute, the pattern in which all columns are not simultaneously NULL values and which does not contain duplicate values is designated as a candidate key for the multiple attribute. The data integration support system according to claim 5.
7. The schema information generation device determines whether there is a foreign key, and what that foreign key is, based on the matching of the values of the candidate key and the foreign key, which references each candidate key in each table of the source schema information. The data integration support system according to claim 6.
8. The schema information generation device generates the source schema information by performing second normalization and third normalization on the data files based on the candidate keys and the values of each attribute in the data files. The data integration support system according to claim 6.
9. The schema information generator, as part of the second normalization process, identifies all combinations of attributes included in the candidate key of the multiple attributes that are not the candidate key itself as sub-patterns, and if there are attributes functionally dependent on the sub-pattern, it extracts the portion of the attribute column from the table containing that attribute as a column into another table, using the sub-pattern as the primary key. The data integration support system according to claim 8.
10. The schema information generator considers any attribute not included in any of the candidate keys of the multiple attributes as a non-key attribute, all combination patterns of the non-key attributes as transitive dependent patterns, and if there are any non-key attributes functionally dependent on the transitive dependent patterns, From a table containing the non-key attribute as a column, extract the portion containing the non-key attribute into a separate table using the transitional dependency pattern as the primary key. The data integration support system according to claim 9.
11. A data integration support method that assists in the process of integrating data from a set of source data files into a destination database, Computers The aforementioned data files are compiled into a database, and schema information is generated that defines the source database, which is the source schema information. Based on the relationships between tables in the source schema information where tables refer to other tables, and the relationships between tables in the destination schema information defining the destination database, the user is presented with suggestion information indicating possible table options in the source database for each table in the destination database, and the user is prompted to select one of these options. Based on the user's selection of options, the tables in the source database to be integrated into each of the tables in the destination database are determined. A data integration support method for carrying out this task.
12. A data integration support program that assists in the process of integrating data from a set of source data files into a target database, On the computer, The aforementioned data files are compiled into a database, and schema information is generated that defines the source database, which is the source schema information. Based on the relationships between tables in the source schema information where tables refer to other tables, and the relationships between tables in the destination schema information defining the destination database, the user is presented with suggestion information indicating possible table options in the source database for each table in the destination database, and the user is prompted to select one of these options. Based on the user's selection of options, the tables in the source database to be integrated into each of the tables in the destination database are determined. A data integration support program that enables the execution of this task.