Data association method and system for open data set
A technology of open data and data association, which is applied in the field of data association methods and systems of open datasets, can solve the problems that the value of open data cannot be fully exploited, open datasets are difficult for data users to understand and utilize, and lack of semantic association of dataset data description etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0071] like figure 1 As shown, this embodiment provides a data association method for an open dataset, including the following steps:
[0072] S1. Perform data preprocessing on open datasets, and convert datasets in different file formats into json file formats;
[0073] S2. Analyze the open data set after the preprocessing is completed, and obtain the characteristic data of the open data set. The characteristic data of the open data set is specifically a description of metadata of the data set and a description of metadata of the data.
[0074] S3. Use machine learning technology to analyze the metadata description of the dataset to obtain the theme of the open dataset;
[0075] More specifically, step S3 includes the following steps:
[0076] S31. Using a tokenizer to segment the metadata description of the dataset to obtain a word segmentation result;
[0077] S32. Calculate the tf-idf feature vector described by the metadata of the data set according to the word segment...
PUM

Abstract
Description
Claims
Application Information

- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com