Data utilization support system and data utilization support method
The data utilization support system addresses the challenge of structuring non-standardized law and regulation data by estimating and generating structured data, enhancing work efficiency in organizations.
Patent Information
- Authority / Receiving Office
- JP · JP
- Patent Type
- Patents
- Current Assignee / Owner
- HITACHI REAL ESTATE PARTNERS CO LTD
- Filing Date
- 2022-07-27
- Publication Date
- 2026-06-22
AI Technical Summary
The challenge of efficiently structuring non-standardized data formats of laws and regulations provided as open data, particularly in the notation of 'articles', 'paragraphs', and 'items', hinders their automated import into databases, complicating work efficiency in organizations.
A data utilization support system that uses an information processing device to provide information on article number formats, defines descriptive patterns, and estimates a description pattern by comparing article data with predefined formats, generating structured data based on these patterns.
Enables efficient structuring of open data, facilitating tasks such as identifying and verifying laws and regulations, improving work efficiency by standardizing the data format.
Smart Images

Figure 0007877110000001 
Figure 0007877110000002 
Figure 0007877110000003
Abstract
Description
Technical Field
[0001] The present invention relates to a data utilization support system and a data utilization support method.
Background Art
[0002] When conducting business or projects in organizations such as companies and government agencies, work such as identifying and verifying relevant laws and regulations is carried out. However, since such work is complicated and time-consuming, various mechanisms for reducing the workload have been proposed conventionally.
[0003] For example, Patent Document 1 describes a revised notice search system configured for the purpose of easily grasping ordinance revisions in municipalities. The revised notice search system connects to an external database such as an ordinance database in which information on laws and regulations and regulations is accumulated via the Internet line to acquire ordinance data, and compares the last acquired ordinance data with the acquired ordinance data to detect inconsistent parts and create ordinance revision data, and notifies users that there has been an ordinance revision based on the ordinance revision data.
Prior Art Documents
Patent Documents
[0004]
Patent Document 1
Summary of the Invention
Problems to be Solved by the Invention
[0005] With the current trend of recent DX (Digital Transformation) strategies and digital structural reforms, etc., countries and local public bodies, etc. are providing open data such as laws and regulations as part of digital administrative services. By utilizing such data, in organizations such as companies and government agencies, it is expected to improve the efficiency of work such as identifying and verifying laws and regulations carried out when conducting business and projects.
[0006] However, in order to effectively utilize this data by importing it into a database, it is necessary to structure the provided data in an appropriate form. In particular, the description format of the provided laws and regulations is not always standardized, with differences in the notation of "articles," "paragraphs," and "items," as well as the system for assigning article numbers, which presents a challenge in automatically importing the data into a database.
[0007] This invention is based on the above background and aims to provide a data utilization support system and a data utilization support method that can efficiently structure data of laws and regulations provided as open data, etc. [Means for solving the problem]
[0008] One aspect of the present invention for achieving the above objective is a data utilization support system, which is configured using an information processing device having a processor and a storage device, and which provides information indicating the description format of article numbers, which are numbers assigned hierarchically to articles contained in article data, which are electronic data describing laws and regulations, as one of the pieces of information. For each level of the aforementioned clause number, a combination of notation format and indentation was defined. Multiple descriptive patterns are stored, the article data to be structured is obtained, and the obtained article data is compared with the descriptive patterns, thereby determining the article data The aforementioned The system estimates a description pattern and generates and outputs structured data, which is data that structures the article data based on the estimated description pattern.
[0009] Further issues disclosed in this application, and methods for solving them, will be made clear in the section on embodiments for carrying out the invention and in the drawings. [Effects of the Invention]
[0010] According to the present invention, data of laws and regulations provided as open data can be efficiently structured. [Brief explanation of the drawing]
[0011] [Figure 1]This diagram shows the general configuration of the data utilization support system. [Figure 2] This is a block diagram illustrating the main functions of the data utilization support device. [Figure 3] This is a system flow diagram explaining the main operations of the data utilization support device. [Figure 4] This is an example of descriptive pattern definition information. [Figure 5A] This is a flowchart explaining the descriptive pattern estimation process. [Figure 5B] This is a flowchart explaining the description pattern matching process. [Figure 6A] This is an example of a screen displaying estimation results (when the description pattern estimation is successful). [Figure 6B] This is an example of a screen displaying estimation results (when estimation of the descriptive pattern fails). [Figure 7A] This is an example of structured data. [Figure 7B] This is an example of structured data that includes tabular data. [Figure 8] This is an example of an information processing device used to realize a data utilization support device or a data utilization device. [Figure 9] This is an example of description pattern definition information for the second embodiment. [Figure 10] This is a block diagram illustrating the main functions of the data utilization support device according to the second embodiment. [Figure 11] This is a system flow diagram illustrating the main operation of the data utilization support device according to the second embodiment. [Figure 12] This is an example of the estimated result display screen for the second embodiment. [Modes for carrying out the invention]
[0012] Hereinafter, embodiments of the present invention will be described with appropriate reference to the drawings. The following description and drawings are examples for explaining the present invention, and for the sake of clarity of explanation, omissions and simplifications are made as appropriate. The present invention can be implemented in various other forms. Unless otherwise limited, each component may be in a single or plural number.
[0013] In the following description, the same or similar components may be denoted by the same reference numerals, and redundant descriptions may be omitted. Also, in the following description, the letter "S" attached before the reference numeral means a processing step.
[0014] In the following description, various information may be described using expressions such as "information" and "data", but the various information may be expressed in other data structures. Also, when explaining identification information, expressions such as "identifier" and "ID" are used, but these can be replaced with each other.
[0015] [First Embodiment] FIG. 1 shows a schematic configuration of an information processing system (hereinafter referred to as "data utilization support system 1") described as a first embodiment. As shown in the figure, the data utilization support system 1 includes a data utilization support device 100 and one or more data utilization devices 3. All of these devices are configured using information processing devices (computers). The data utilization support device 100 and the data utilization device 3 may be realized by a common information processing device.
[0016] The data utilization support device 100 is communicably connected via the Internet 5 to a server device (hereinafter referred to as "provision server 2") that provides electronic data of laws and regulations (hereinafter referred to as "article data") operated by a country, a local public body, etc. as open data. to be.
[0017] The data utilization device 3 is connected to the data utilization support device 100 via a communication network 6. The communication network 6 may be, for example, a LAN (Local Area Network), a WAN (Wide Area Network), the Internet, a public communication network, or a dedicated line.
[0018] The data utilization support device 100 acquires the article data from the providing server 2 via the Internet 5, generates structured data (hereinafter referred to as "structured data") from the acquired article data, and manages it in a database. The data utilization support device 100 may also acquire the article data by other means, such as via a recording medium. The data utilization support device 100 provides the structured data to the data utilization device 3 in response to a request from the data utilization device 3. The data utilization device 3 acquires the structured data provided by the data utilization support device 100 and utilizes the acquired structured data in its operations.
[0019] Furthermore, the users of Data Utilization Support System 1 and Data Utilization Device 3 are not necessarily limited, but for example, they may be departments or personnel within organizations such as companies or government agencies that perform tasks such as identifying and confirming relevant laws and regulations in the course of carrying out their work or projects.
[0020] Figure 2 is a block diagram illustrating the main functions of the data utilization support device 100. Figure 3 is a system flow diagram illustrating the main operations of the data utilization support device 100.
[0021] As shown in Figure 2, the data utilization support device 100 includes a storage unit 110, an information acquisition unit 120, a text extraction unit 125, a description pattern estimation unit 130, a clause data structuring unit 135, and a structured data provision unit 140.
[0022] Of the above functions, the storage unit 110 stores the article data 111, article data (text) 112, description pattern definition information 113, description pattern estimation result 114, and structured data 115.
[0023] As shown in Figure 3, the information acquisition unit 120 acquires the article data from the provision server 2 via the Internet 5 and manages the acquired article data as article data 111. The article data is available in text format, as well as HTML (Hypertext Markup Language) and RT formats. It is provided as data in various formats (data formats, file formats), such as F (Rich Text Format) and PDF (Portable Document Format).
[0024] The text extraction unit 125 generates text data 112, which is text data extracted from text data 111 provided in various formats. For example, if the text data 111 is in text format, the text extraction unit 125 uses that data as is for text data 112. If the data format of the text data 111 is not text format, the text extraction unit 125 generates text data 112 by extracting text data from the text data 111 using, for example, known text data extraction software.
[0025] The description pattern estimation unit 130 estimates the description pattern (type of description format of articles and ordinances) of the article data (text) 112 by comparing it with the description pattern definition information 113, and manages the estimated result as the description pattern estimation result 114.
[0026] The Article Data Structuring Unit 135 generates structured data from Article Data (text) 112 based on the description pattern estimation result 114, and sets the generated data as structured data 115. To manage.
[0027] The structured data provision unit 140 transmits the requested structured data 115 of the requested article data to the data utilization device 3 in response to a request for provision of article data sent from the data utilization device 3.
[0028] <Estimation of descriptive patterns> Next, we will explain the method for estimating the description pattern by the description pattern estimation unit 130.
[0029] Figure 4 shows an example of description pattern definition information 113. Description pattern definition information 113 manages information defining description patterns. Description patterns are defined by combinations of description formats at each level of the article number. The example description pattern definition information 113 includes information defining the description format for each description pattern. As shown in the figure, the example description pattern definition information 113 has the following items: pattern ID 311, level 312, notation format 313, number character type 314, and sub-numbering capability 315.
[0030] Of the items listed above, pattern ID311 stores the pattern ID (e.g., "Pattern 1", "Pattern 2"), which is the identifier of the description pattern.
[0031] Level 312 stores information representing the hierarchy of article numbers used in laws and regulations (e.g., "Article," "Paragraph," "Item," "Subdivision 1," "Subdivision 2," etc.).
[0032] The notation format 313 stores information indicating the method of notation for article numbers at each level (hereinafter referred to as "notation format"). The symbol "{}" in the example notation format indicates that the part of the symbol will contain the article number (article number, paragraph number, subparagraph number, etc.). For example, "Article {}" stored in the article level of the example description pattern definition information 113 indicates that the article numbers will be written as "Article 1", "Article 2", and so on.
[0033] The number character type 314 stores information indicating the type of numbers and characters used in the description of the article number (e.g., "Chinese numerals", "digits", "katakana", etc.).
[0034] The sub-number availability 315 stores information indicating whether or not a sub-number may be attached to the article number, for example, "2" in "Article 29-2" (the value is "True" if a sub-number may be attached, and "False" if a sub-number may not be attached). Note that there are variations in the way the sub-number is written. Since there is a distinction, the description pattern definition information 113 may also include an item to distinguish the method of notating the sub-number.
[0035] The description pattern estimation unit 130 estimates the description pattern of the article data (text) 112 by comparing (matching) the article data (text) 112 with the description pattern definition information 113.
[0036] Figure 5A is a flowchart illustrating the process by which the description pattern estimation unit 130 estimates the description pattern of the clause data (text) 112 (hereinafter referred to as "description pattern estimation process S500"). The description pattern estimation process S500 will be explained below in conjunction with the same figure.
[0037] First, the description pattern estimation unit 130 reads the article data (text) 112 and the description pattern definition information 113 that are to be used for description pattern estimation (S511).
[0038] Next, the description pattern estimation unit 130 reads the article data (text) 112 and writes A process is performed to estimate the description pattern of the article data (text) 112 by comparing it with the description pattern definition information 113 (hereinafter referred to as "description pattern matching process S512").
[0039] Figure 5B is a flowchart illustrating the description pattern matching process S512. The description pattern matching process S512 will be explained below in conjunction with this figure.
[0040] The processing steps S550s to S550e shown in the figure are loop processes that are repeatedly performed by sequentially selecting the description patterns from the description pattern definition information 113.
[0041] First, the description pattern estimation unit 130 initializes the judgment flag (stores "True" in the judgment flag) (S551).
[0042] The subsequent processing steps S560s to S560e are loop processes that repeatedly select each line of the clause data (text) 112 sequentially from the beginning.
[0043] In S561, the description pattern estimation unit 130 determines whether the selected line matches one of the hierarchy levels (article, paragraph, sub-item, sub-sub
[0044] In S562, the description pattern estimation unit 130 determines whether the matched hierarchy follows the hierarchy order of the selected description pattern (whether the matched hierarchy appears in the hierarchy order of the selected description pattern). If the matched hierarchy follows the hierarchy order of the selected description pattern (S562:YES), the process proceeds to S560e (in this case, the judgment flag retains its current value and the processing of the selected row is terminated); otherwise, the process proceeds to S563.
[0045] In S563, the description pattern estimation unit 130 stores "False" in the judgment flag and processes Next, proceed to the S560e.
[0046] When the loop processing S560s to S560e for the selected description pattern is completed, the description pattern estimation unit 130 refers to the judgment flag and, if "True" is stored, stores the pattern ID of the selected description pattern in the description pattern estimation result 114. However, if the clause data (text) 112 does not match any of the description patterns in the description pattern definition information 113 (i.e., the judgment flag is "False"), the description pattern estimation unit 130 The description pattern estimation result 114 will store "No matching description pattern found".
[0047] Once the loop processing from S550s to S550e is completed, the process proceeds to S513 in Figure 5A.
[0048] In step S513 of Figure 5A, the description pattern estimation unit 130 outputs the content (pattern ID) of the description pattern estimation result 114 and presents it to the user.
[0049] Figures 6A and 6B are examples of screens displayed by the description pattern estimation unit 130 in S513 (hereinafter referred to as the "estimation result presentation screen 600"). Figure 6A is an example of the estimation result presentation screen 600 generated by the description pattern estimation unit 130 when the description pattern estimation is successful, and Figure 6B is an example of the estimation result presentation screen 600 generated by the description pattern estimation unit 130 when the description pattern estimation fails.
[0050] As shown in Figures 6A and 6B, the estimated result display screen 600 includes a section for displaying the clause data 611, a section for displaying the message 612, a section for displaying / editing the pattern ID 613, and a table for displaying the description pattern. It has a display / edit field 614 and a confirmation / register button 620.
[0051] Of these, the content (text data) of Article 111 is displayed in the Article data display field 611.
[0052] Message display area 612 displays messages regarding the results of the description pattern estimation (such as whether the description pattern estimation was successful or failed).
[0053] The pattern ID display / edit field 613 displays the pattern ID of the description pattern estimated by the description pattern estimation unit 130.
[0054] The description pattern display / edit field 614 displays the content of the description pattern estimated by the description pattern estimation unit 130. In both cases, whether the estimation is successful (Figure 6A) or unsuccessful (Figure 6B), the user can edit the content of the description pattern display / edit field 614. For example, the user can register or modify a description pattern by directly entering the description pattern into the description pattern display / edit field 614, or by operating the pull-down menu provided in the description pattern display / edit field 614.
[0055] Furthermore, if the system fails to estimate the description pattern, the user can set and register a new description pattern. In this case, the user sets a new pattern ID in the pattern ID display / edit field 613, sets the content of the description pattern in the description pattern display / edit field 614, and operates the confirmation / register button 620 (Figure 6B).
[0056] Furthermore, since the description formats for article numbers at each level are often similar, for example, a description pattern can be generated by using the description format stored as the description pattern for article numbers at one level as the description format for article numbers at another level. When estimating the description pattern, the generated description pattern can also be compared with the article data (text) 112 (or the above description pattern can also be used to estimate the description pattern of the article data). Doing so reduces the effort required to define new description patterns.
[0057] <Example of structured data> Figure 7A shows an example of structured data 115 generated by the Article Data Structuring Unit 135, which structures Article Data (text) 112 based on the description patterns estimated by the Description Pattern Estimation Unit 130. As shown in the figure, the example structured data 115 has a structure that allows an information processing device (computer) to identify Articles (text data) by specifying "Article," "Paragraph," "Item," etc. Therefore, by using the structured data 115, it is possible to easily construct, for example, an efficient search system for Articles or a system that provides information indicating the reference relationships between Articles.
[0058] Furthermore, in addition to text data, articles and regulations may also contain tabular data in the form of "appendices," etc. In such cases, for example, when the text extraction unit 125 extracts text data from the article data 111, it may determine whether or not tabular data exists (for example, by determining whether or not there is border information), and if tabular data exists, it may manage that tabular data in the structured data 1115 as information separate from the text data (as information in a separate item).
[0059] Figure 7B shows an example of structured data 115 when tabular data is managed as information distinct from text data. In the figure, the data indicated by reference numeral 1151 is tabular data.
[0060] As described above, according to the data utilization support system 1 of the first embodiment, the data utilization support device 100 estimates the description pattern of the article data, which is data of laws and regulations provided as open data, by comparing it with the description pattern, and structures the article data based on the estimated description pattern. Therefore, users can efficiently structure the article data provided as open data, and efficiently perform tasks such as identifying and confirming laws and regulations related to their work and projects.
[0061] <Example of an information processing device> Figure 8 shows an example of an information processing device used in the configuration of the data utilization support system 1 (data utilization support device 100, data utilization device 3) described above.
[0062] The illustrated information processing device 10 comprises a processor 11, main memory 12, auxiliary storage device 13 (external storage device), input device 14, output device 15, and communication device 16. These are connected via a bus, communication cable, etc. Examples of information processing devices 10 include personal computers, server devices, smartphones, tablets, office computers, and general-purpose computers (mainframes).
[0063] The information processing device 10 may be implemented, in whole or in part, using virtual information processing resources provided using virtualization technology, process space isolation technology, etc., such as a virtual server provided by a cloud system. Furthermore, all or part of the functions provided by the information processing device 10 may be implemented, for example, by services provided by the cloud system via an API (Application Programming Interface), etc. All or part of the functions provided by the information processing device 10 may be, for example, SaaS (Software as a Service), PaaS (Platform as a Service), IaaS (Infrastructure as a Service). This may also be implemented using methods such as (as a Service).
[0064] Processor 11 includes, for example, a CPU (Central Processing Unit), an MPU (Micro Processing Unit), a GPU (Graphics Processing Unit), and an FPGA (Field Programmable Graphite Array). It is composed of components such as a Gate Array, an ASIC (Application Specific Integrated Circuit), and an AI (Artificial Intelligence) chip.
[0065] The main memory 12 is a device used by the processor 11 when executing programs, and is, for example, ROM (Read Only Memory), RAM (Random Access Memory), or non-volatile memory (NVRAM (Non-Volatile RAM)). The various functions realized in each configuration of the data utilization support system 1 are realized by the respective processor 11 reading programs and data stored in the auxiliary memory 13 into the main memory 12 and executing them.
[0066] The auxiliary storage device 13 is a device for storing programs and data, such as an SSD (Solid State Drive), a hard disk drive, or an optical storage device (CD (Compact Disc)). The auxiliary storage device 13 can be composed of a DVD (Digital Versatile Disc), a storage system, an IC card, a reader / writer for non-temporary recording media such as SD cards and optical recording media, and a non-temporary storage area of a cloud server. The auxiliary storage device 13 can read programs and data from non-temporary recording media or other information processing devices equipped with non-temporary storage devices via a recording media reader or a communication device 16. Programs and data stored in the auxiliary storage device 13 are read into the main memory 12 as needed.
[0067] The input device 14 is an interface that accepts input of information from an external source, such as a keyboard, mouse, touch panel, card reader, pen input tablet, or voice input. These are power devices, etc.
[0068] The output device 15 is an interface for outputting various information such as processing progress and processing results to the outside. The output device 15 may be, for example, a display device that visualizes the above-mentioned information (LCD monitor, LCD (Liquid Crystal Display), graphics card, etc.), a device that converts the above-mentioned information into sound (speaker, etc.), or a device that converts the above-mentioned information into text (printer, etc.). For example, the information processing device 10 may be configured to input and output information to and from other devices via the communication device 16.
[0069] The input device 14 and the output device 15 constitute a user interface that enables interactive processing with the user (receiving information, providing information, etc.).
[0070] The communication device 16 is a device that enables communication with other devices. The communication device 16 is a wired or wireless communication interface that enables communication with other devices via the communication network 6, and is, for example, a NIC (Network Interface Card), a wireless communication module, a USB module, etc.
[0071] The information processing device 10 may have, for example, an operating system, a file system, a DBMS (Database Management System) (relational database, NoSQL, etc.), a KVS (Key-Value Store), etc. installed on it.
[0072] [Second Embodiment] In the first embodiment of the data utilization support system 1, the description pattern of article data was estimated based on the character description format (notation format 313, number character type 314, sub-number capability 315) of information representing the hierarchy of article numbers used in laws and regulations ("article," "paragraph," "item," etc.). In the second embodiment of the data utilization support system 1, the description pattern of article data is estimated by further considering the layout of each hierarchy of article numbers as one of the description formats. Specifically, it focuses on the fact that many laws and regulations are written with different indentation (indent: character spacing) for each "article," "paragraph," and "item," and estimates the description pattern based on the differences in indentation for each hierarchy of article numbers. Note that the basic configuration of the second embodiment of the data utilization support system 1 is the same as that of the first embodiment of the data utilization support system 1, so the following explanation will focus on the differences from the first embodiment.
[0073] Figure 9 shows an example of the description pattern definition information 113 of the second embodiment. As shown in the figure, the example description pattern definition information 113 has an additional item for indentation 316, in addition to the items of the description pattern definition information 113 of the first embodiment. In the example description pattern definition information 113, the indentation is expressed in "mm" (millimeters), but it may also be expressed in terms of the number of spaces (blanks) or tabs.
[0074] Figure 10 is a block diagram illustrating the main functions of the data utilization support device 100 according to the second embodiment. Figure 11 is a system flow diagram illustrating the main operations of the data utilization support device 100 according to the second embodiment.
[0075] As shown in Figure 10, the data utilization support device 100 of the second embodiment has a layout information extraction unit 160 in addition to the functions of the data utilization support device 100 of the first embodiment. Furthermore, the storage unit 110 of the data utilization support device 100 of the second embodiment stores layout information 116, which is information regarding the layout (indentation) of the clause data 111, in addition to the information (data) stored by the storage unit 110 of the data utilization support device 100 of the first embodiment.
[0076] As shown in Figure 11, the layout information extraction unit 160 extracts information indicating the layout (indentation) from the article data 111 and inputs it to the description pattern estimation unit 130. The description pattern estimation unit 130 also estimates the description pattern of the article data (text) 112 by comparing the layout information 116 together with the description pattern definition information 113, and manages the estimated result as the description pattern estimation result 114.
[0077] Figure 12 shows an example of the estimated result display screen 600 shown by the data utilization support device 100 of the second embodiment.
[0078] Thus, the data utilization support system 1 of the second embodiment further considers the layout of each level of the article number as one of the description formats to estimate the description pattern of the article data, and can estimate the description pattern of the article data 111 with higher accuracy.
[0079] Although embodiments have been described above, the present invention is not limited to the embodiments described above, and various modifications are included, and it is not necessarily limited to having all the configurations described. Furthermore, it is possible to replace parts of the configuration of one embodiment with the configuration of another embodiment, or to add configurations from other embodiments to the configuration of one embodiment. In addition, it is possible to add, delete, or replace parts of the configuration of each embodiment with other configurations.
[0080] For example, the present invention can also be applied to cases where the electronic data of documents other than laws and regulations, consisting of a collection of documents assigned hierarchical numbers, is to be managed as structured data. [Explanation of symbols]
[0081] 1 Data utilization support system, 2 Provisioning server, 3 Data utilization device, 5 Internet, 6 Communication network, 100 Data utilization support device, 110 Storage unit, 111 Article data, 112 Article data (text), 113 Description pattern definition information, 114 Description pattern estimation result, 115 Structured data, 120 Information acquisition unit, 125 Text extraction unit, 130 Description pattern estimation unit, 135 Article data structuring unit, 140 Structured data provision unit, S500 Description pattern estimation process, S512 Description pattern matching process
Claims
1. It is configured using an information processing device having a processor and a memory device, As one piece of information indicating the description format of article numbers, which are numbers assigned hierarchically to articles contained in article data, which are electronic data describing laws and regulations, multiple description patterns are stored for each level of the said article number, each defining a combination of notation format and indentation. Obtain the clause data to be structured, By comparing the acquired article data with the description pattern, the description pattern of the article data is estimated. Based on the estimated description pattern, structured data, which is structured data of the aforementioned clause data, is generated and output. Data utilization support system.
2. A data utilization support system according to claim 1, The description pattern is defined by a combination of the hierarchy of the article number, the notation format, the type of characters used to represent the article number, whether or not sub-numbers can be used in the article number, and the indentation. Data utilization support system.
3. A data utilization support system according to claim 1, The results of estimating the description pattern for the aforementioned clause data are presented to the user via the user interface. Data utilization support system.
4. A data utilization support system according to claim 1, A user interface for editing the aforementioned description pattern is provided to the user. Based on the description pattern received via the user interface, the structured data is generated. Data utilization support system.
5. A data utilization support system according to claim 1, The description format stored as the description pattern for the article number at one level is used to generate the description pattern for the article number at another level. By comparing the aforementioned article data with the description pattern that generated it, the description pattern of the aforementioned article data is estimated. Data utilization support system.
6. A data utilization support system according to claim 1, The aforementioned article numbers are assigned according to the hierarchy of articles, paragraphs, and subparagraphs. Data utilization support system.
7. An information processing device having a processor and a memory device, One of the pieces of information indicating the description format of article numbers, which are numbers assigned hierarchically to articles contained in article data, which are electronic data describing laws and regulations, is to store multiple description patterns that define combinations of notation format and indentation for each level of the said article number. Steps to obtain the clause data to be structured, The steps include: comparing the acquired article data with the description pattern to estimate the description pattern of the article data, and A step of generating and outputting structured data, which is structured data of the clause data based on the estimated description pattern, A data utilization support method that performs this task.
8. A data utilization support method according to claim 7, The description pattern is defined by a combination of the hierarchy of the article number, the notation format, the type of characters used to represent the article number, whether or not sub-numbers can be used in the article number, and the indentation. Data utilization support methods.
9. A data utilization support method according to claim 7, The information processing device provides to the user, via a user interface, the result of estimating the description pattern for the clause data. A data utilization support method that further implements the above.
10. A data utilization support method according to claim 7, The aforementioned information processing device A step of providing a user interface to the user for editing the aforementioned description pattern, And, A step of generating the structured data based on the description pattern received via the user interface, A data utilization support method that further implements the above.