Supercharge Your Innovation With Domain-Expert AI Agents!

A method and system for similarity measurement

A similarity measurement and similarity technology, applied in the field of information processing, can solve the problems of difficulty in similarity calculation, judgment, and high computational complexity, and achieve a good effect of calculating similarity

Active Publication Date: 2017-11-07
百度移信网络技术(北京)有限公司
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] First of all, the similarity measurement method based on attribute vectors needs to know the attribute vectors of the two objects to be compared, that is, the attribute vectors need to be known. In the case of unknown attribute vectors, the similarity cannot be judged.
[0009] Second, there is the problem of low accuracy
For example, in the case of using the cosine similarity measurement method, if the attribute vectors are not independent, that is, not orthogonal, the calculated similarity is inaccurate
For example, when the attribute vector of object Aa is {x1, y1, z1}, and the attribute vector of object Bb is {x2, y2, z2}, in the case of correlation between the above attributes, that is, in the case of non-orthogonality, its calculation The accuracy of the similarity is low, and there is a lot of information loss
[0010] In addition, there is a problem of high computational complexity
The similarity measurement method based on the association relationship and the similarity measurement method based on statistics need to find the relationship between the comparison objects, and the process is relatively complicated, which makes the similarity calculation difficult.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and system for similarity measurement
  • A method and system for similarity measurement
  • A method and system for similarity measurement

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0052] First take the continuous case as an example. For the continuous case, the given weights are all 1. Examples of book recommendations in online bookstores, refer to figure 1 The similarity measurement method is described. First, as shown in step S1, the server collects all user information and all book information in the online bookstore, as well as all historical data of users clicking and reading books. Set the collection of all books in the online bookstore as the collection M (m1, m2, ...), and the collection of all users as the collection N (n1, n2, ...), assuming that the elements in the collection M and the collection N have The attribute value of satisfies the uniform distribution from positive infinity to negative infinity. Below we introduce how to obtain the similarity between users based on the historical data of users' operations on books without knowing any attribute information of books or users.

[0053] Now assume that the book that user n1 wants to ...

Embodiment 2

[0073] Take the calculation of the similarity between users and users, or between items and items in order to recommend items to users in online shopping as an example, refer to figure 2 Make the following instructions. First, if figure 2 As shown in step S11 of , the server collects information according to the user's login and registration, the items sold on the website, and the user's operation on the item, that is, the collected information includes the user, the item, and the interaction between the user and the item , to get data about users, items, and user operations on items. The server analyzes the above information, one is the user collection User, the other is the item collection Item, and the user's operation records on the items. Here, each user's operation on the item is independent of each other, and each operation expresses the same meaning, which expresses the user's interest in the item. Table 1 shows the interaction between the existing user set User a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method and system for measuring similarity. The method for measuring the similarity comprises the following steps of a data acquiring step, to be specific acquiring elements item-a in a set a, elements item-b in a set b and the times sim(item-a, item-b) for the non-different similarity operation of elements item-a in the set a for the elements item-b in the set b; a similarity calculation step, to be specific, executing the calculation of a similarity value sim'(item-bi, item-bj) of internal elements item-bi and elements item-bj of the set b based on the formula in the specification, wherein i, j, m and n indicate mark numbers in the set, and k is a normalization factor.

Description

technical field [0001] The invention relates to the field of information processing, in particular to a method and system for similarity measurement in the field of information processing. Background technique [0002] Currently, similarity measurement is involved in many fields, such as the Internet industry, and similarity analysis is performed based on various existing similarity measurement methods. [0003] For example, in the field of personalized recommendation, etc., the server collects and stores a large amount of data on users and their operation objects, and needs to recommend relevant operation objects that users may be interested in based on the operations performed by users. The similarity between the recommended operation object and the operation object operated by the user is used to recommend the operation object with high similarity to the user. Here, in terms of similarity measurement methods, there are generally the following types. [0004] A similarit...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F16/2462
Inventor 朱宝
Owner 百度移信网络技术(北京)有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More