Method and system for determining word similarity
A similarity and similarity calculation technology, which is applied in natural language data processing, instruments, electrical digital data processing, etc., can solve the problem of the decrease in the accuracy of similarity calculation
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0053] Such as figure 1 As shown, this embodiment proposes a method for determining word similarity.
[0054]In this embodiment, the method is specifically described by taking two short sentences "separate wall-mounted air conditioner" and "household air conditioner" that need to calculate word similarity as examples.
[0055] The method includes:
[0056] S100. Preprocessing the short sentence to be processed.
[0057] Specifically include: removing English and numbers in short sentences, and removing Chinese stop words in short sentences.
[0058] Remove English and numbers in short sentences, and remove Chinese stop words, such as: "Yes", "I", etc., to reduce the amount of calculation in subsequent operations. There are no parts to be cleared in the two example short sentences of this embodiment.
[0059] S200. Calculate the first similarity of the word granularity of the two short sentences.
[0060] Specifically include:
[0061] Obtain the number of the same Chines...
Embodiment 2
[0095] Corresponding to Embodiment 1 above, this embodiment proposes a system for determining word similarity, which includes:
[0096] The preprocessing module is used to preprocess short sentences to be processed;
[0097] The first similarity calculation module is used to calculate the first similarity of the word granularity of two short sentences;
[0098] The second similarity calculation module is used to calculate the second similarity of the phrase granularity of two short sentences;
[0099] The length comparison value calculation module is used to compare the length of two short sentences, and calculates the length comparison value of the two short sentences;
[0100] The final similarity acquisition module is used to perform weighted calculation of the first similarity, second similarity and length comparison value of the two short sentences, and obtain the final similarity of the two short sentences after performing data normalization.
[0101] The functions per...
Embodiment 3
[0103] Corresponding to the above embodiments, this embodiment proposes a computer storage medium, which contains one or more program instructions, and the one or more program instructions are used to be executed by a system for determining word similarity such as The method of Example 1.
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


