Unlock instant, AI-driven research and patent intelligence for your innovation.

Categorizing a sensitive data field in a dataset

A technology for sensitive data and data sets, applied in the field of classification systems, which can solve the problem that data providers cannot directly share

Pending Publication Date: 2021-12-21
KONINKLJIJKE PHILIPS NV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Given such sensitive data, data providers are often unable to share them directly

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Categorizing a sensitive data field in a dataset
  • Categorizing a sensitive data field in a dataset
  • Categorizing a sensitive data field in a dataset

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] While the invention is capable of embodiments in many different forms, it is to be understood that one or more specific embodiments are shown in the drawings and herein will be described in detail, it being understood that the disclosure is considered as an illustration of the principles of the invention and not as an illustration. It is intended that the invention be limited to the specific embodiments shown and described.

[0040] Hereinafter, elements of the embodiment are described in operation for ease of understanding. However, it is apparent that the various elements are arranged to perform the described functions performed by them.

[0041] Furthermore, the invention is not limited to these embodiments, and the invention lies in every novel feature or combination of features described herein or recited in mutually different dependent claims.

[0042] Various embodiments relate to classifying sensitive data fields in data sets. Such a data set may consist of on...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Some embodiments are directed to a categorization system for categorizing a sensitive data field in a dataset, e.g., a disease classification according to the ICD classification. A client device is to obtain categories for one or more records of the dataset. The client device determines categorization data for the categorization. The categorization data comprises homomorphic encryptions of possible values of the sensitive data field and encodings of the categories associated to the respective possible values, thus keeping the categorization secret. A data provider device stores the dataset and determines homomorphic encryptions indicating differences between the value of the sensitive data field for a record and respective possible values. A categorization device determines which of those encryptions indicates a match and provides a category encoding associated with a matching possible value to the client device. The client device associates the encoded category to the record.

Description

technical field [0001] The present invention relates to a classification system, a client device, a data provider device and a classification device. The invention also relates to methods and computer-readable storage media corresponding to corresponding devices. Background technique [0002] In medical research, researchers often use multiple datasets, for example for training and validation of machine learning algorithms and models, or for medical hypothesis testing. Having access to more and better quality data generally leads to higher quality results. Therefore, researchers often request data from other institutions for analysis. Gaining such access can be challenging, however, as the data requested is often privacy-sensitive and includes, for example, detailed disease classifications according to the International Classification of Diseases (ICD) or location information such as postal codes. The exchange of such privacy-sensitive information is often restricted by v...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): H04L9/00
CPCH04L9/008H04L2209/88H04L2209/42G16H70/60G06F21/602G06F21/6227
Inventor P·P·范利斯东克D·普莱泰亚R·P·科斯特
Owner KONINKLJIJKE PHILIPS NV