Unlock instant, AI-driven research and patent intelligence for your innovation.

Metadata-based scientific data characterization driven by knowledge database at scale

A technology of scientific data and database, which is applied in the field of computer program products, can solve problems such as not taking into account changing rules, unable to expand data, etc.

Pending Publication Date: 2021-02-02
IBM CORP
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Also, manual inspection does not scale for large amounts of data distributed across several data files
[0004] However, even though conventional techniques can leverage metadata extracted from raw data files to represent them in the data management lifecycle, conventional techniques do not allow for users to iteratively and interactively change rules based on metadata to apply to multiple technology for data files, and learning systems that collect user interactions to improve rule definitions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Metadata-based scientific data characterization driven by knowledge database at scale
  • Metadata-based scientific data characterization driven by knowledge database at scale
  • Metadata-based scientific data characterization driven by knowledge database at scale

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] will now refer to Figure 1-9 Describing the present invention, like reference numerals refer to like parts throughout. It is emphasized that, according to common practice, the various features of the drawings are not necessarily drawn to scale. On the contrary, the dimensions of the various features are arbitrarily expanded or reduced for clarity.

[0018] by introduction figure 1 According to the depicted example, an embodiment of the metadata-based scientific data characterization method 100 of the present invention may include various steps for a hybrid rule-based and learning-based technique that enables a user to interactively and iteratively guide the characterization of multiple rules-based processing of large raw data files to (i) determine the quality of the files and (ii) exploit the small metadata associated with the large files to find data relationships. Rules can be recommended at each new iteration of the process by a machine learning based recommende...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A metadata-based scientific data characterization method, system, and computer program product include requesting a user input for a task to specify a rule for the task to determine a quality and a relationship of a data file in a data file database based on metadata associated with the data file, processing a user feedback of results using the rule run on the data file database and tracking the user feedback on the results in order to learn from the user feedback, and based on the learning, creating a modified rule to determine a quality and a relationship of a second data.

Description

Background technique [0001] The present invention relates generally to metadata-based scientific data characterization methods, and more particularly but not limited to a method for interactively and iteratively systematizing the way a user characterizes some documents using associated metadata and generally Its systems, methods and computer program products are applied at scale. [0002] Large domain-specific scientific data files have associated metadata that is critical to the scientific experiment process. However, these files are often heterogeneous. Some files have more or better associated metadata than other files, and some files are related to other files. [0003] Traditionally, when users (e.g., scientists, such as geophysicists in geology, agronomists in agriculture, etc.) acquire large data files, they first need to characterize the files by determining the quality level of the files and finding The relationship between them (for example, a file is derived from...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/16G06N5/02
CPCG06F16/164G06N5/025G06N20/00G06F16/2448G06F16/24564G06F16/24573G06F16/144G06N3/044
Inventor R·F·S·苏扎R·M·D·G·E·席尔瓦R·D·S·费雷拉E·A·V·布拉齐尔V·T·D·席尔瓦
Owner IBM CORP
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More