A construction method of a universal string similarity measurement framework
A similarity measurement and construction method technology, applied in the field of data mining, can solve problems such as difficult expansion, limitations, and complex metrics
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0027] A kind of construction method of general character string similarity measurement framework of the present invention, concrete process is:
[0028] (1) First set X={x 0 ,x 1 ,x 2 ,...} and Y={y 0 ,y 1 ,y 2 ,...} are two groups of strings to be compared, element x in X and Y i and y j sequence of characters with composed of with Respectively x i and y j The p-th and q-th characters in , m and n are x i and y j of length; string similarity measures are often used to find x i and y j The best mapping pair or evaluates a particular x i with each y in Y j similarity between.
[0029] (2) Secondly, the matched or similar set M={(x i ,y j ); x i =y j ,x i ∈X,y j ∈Y} and non-matching set N={(x i ,y j ); x i ≠y j ,x i ∈X,y j A set of character strings X×Y={(x i ,y j ); x i ∈X,y j ∈Y}.
[0030] (3) Then based on matching or similar set M={(x i ,y j ); x i =y j ,x i ∈X,y j ∈Y} and non-matching set N={(x i ,y j ); x i ≠y j ,x i ∈X,y ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


