Unlock instant, AI-driven research and patent intelligence for your innovation.

Method, device, medium and electronic equipment for measuring similarity between sets

A similarity and collection technology, applied in the computer field, can solve problems such as the impact of similarity accuracy, high computational complexity, and low confidence, and achieve the effects of reducing implementation costs, improving flexibility, and improving ease of use

Active Publication Date: 2021-06-04
KE COM (BEIJING) TECHNOLOGY CO LTD
View PDF11 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The samples obtained based on scenarios such as low-frequency interaction often have the characteristics of small values ​​of some dimensions or even all dimensions. The confidence of the probability distribution formed by such samples is often low, which will affect the accuracy of the similarity between the two sets. sex has an impact
In addition, when the dimensions of the samples in the set are high, the calculation amount of method 1 is often high
Furthermore, the first method cannot realize the control of the accuracy of the similarity between the two sets
[0006] The implementation complexity of the second method (such as the amount of calculation, etc.) is often strongly related to the number of samples in the collection. In some low-precision application scenarios, the second method is often not suitable for
Furthermore, the second method also cannot realize the control of the accuracy of the similarity between the two sets

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device, medium and electronic equipment for measuring similarity between sets
  • Method, device, medium and electronic equipment for measuring similarity between sets
  • Method, device, medium and electronic equipment for measuring similarity between sets

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] Example embodiments according to the present disclosure will be described in detail below with reference to the accompanying drawings. Apparently, the described embodiments are only some of the embodiments of the present disclosure, rather than all the embodiments of the present disclosure, and it should be understood that the present disclosure is not limited by the exemplary embodiments described here.

[0040] It should be noted that relative arrangements of components and steps, numerical expressions and numerical values ​​set forth in these embodiments do not limit the scope of the present disclosure unless specifically stated otherwise.

[0041] Those skilled in the art can understand that terms such as "first" and "second" in the embodiments of the present disclosure are only used to distinguish different steps, devices or modules, etc. necessary logical sequence.

[0042] It should also be understood that in the embodiments of the present disclosure, "plurality...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method, device, medium and electronic equipment for measuring similarity between sets are disclosed. The method includes: obtaining the sample vectors of each sample in the first set and the second set; using all the samples in the first set and the second set as samples in the cluster to be processed; according to the one-dimensional vector of the sample vector Take the value to determine the partition value for the cluster division of the cluster to be processed this time; use the partition value to perform cluster division processing on the samples in the cluster to be processed, and obtain at least one new cluster; Set to determine the current partition state; if the current partition state does not meet the preset partition stop state, then update the cluster to be processed according to the new cluster, and execute the cluster partition processing again; if it meets the preset partition stop state, then according to the currently executed cluster partition The number of times of processing determines the similarity between the first set and the second set. The disclosure can realize controllable accuracy of the similarity between sets.

Description

technical field [0001] The present disclosure relates to computer technology, and in particular to a method for measuring similarity between sets, a device for measuring similarity between sets, a storage medium and electronic equipment. Background technique [0002] The existing methods for measuring the similarity between sets mainly include the following two methods: [0003] Method 1. Probability-based measurement method. This method needs to determine the probability distribution of the samples in the two sets first, and then determine the similarity between the two sets based on the probability distribution of the samples in the two sets. [0004] The second method is the space-based measurement method. This method needs to first map the samples in the two sets to the space respectively, and then determine the typical samples in the two sets based on the space, and use the typical samples as calculation items, so as to use the obtained calculation items to perform di...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62
CPCG06F18/22
Inventor 李嘉晨郭凯
Owner KE COM (BEIJING) TECHNOLOGY CO LTD