Data association method and system for open data set
A technology of open data and data association, which is applied in the field of data association methods and systems of open datasets, can solve the problems that the value of open data cannot be fully exploited, open datasets are difficult for data users to understand and utilize, and lack of semantic association of dataset data description etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0071] like figure 1 As shown, this embodiment provides a data association method for an open dataset, including the following steps:
[0072] S1. Perform data preprocessing on open datasets, and convert datasets in different file formats into json file formats;
[0073] S2. Analyze the open data set after the preprocessing is completed, and obtain the characteristic data of the open data set. The characteristic data of the open data set is specifically a description of metadata of the data set and a description of metadata of the data.
[0074] S3. Use machine learning technology to analyze the metadata description of the dataset to obtain the theme of the open dataset;
[0075] More specifically, step S3 includes the following steps:
[0076] S31. Using a tokenizer to segment the metadata description of the dataset to obtain a word segmentation result;
[0077] S32. Calculate the tf-idf feature vector described by the metadata of the data set according to the word segment...
PUM

Abstract
Description
Claims
Application Information

- R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com