Word vector generation method and device supporting polarity differentiation and polysense
A word vector and polarity technology, applied in the field of word vector generation method and device that supports polarity distinction and polysemy, can solve the problem of easy matching errors of word vectors, achieve the effect of improving the impact of matching results and solving matching errors
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0076] see figure 1 , is a flow diagram of a word vector generation method that supports polarity discrimination and polysemy. In this embodiment, the method for generating a word vector according to a resource file includes the following steps:
[0077] S101: Obtain a word vector model and a resource file in the current business scenario, where the resource file includes multiple sememes corresponding to the semantics in the current business scenario.
[0078] In this embodiment, after determining which business field the text information to be processed belongs to, it is first necessary to obtain the word vector model in the current business scenario, generally by calling the established word vector model in the server or database. The word vector model here refers to a set composed of a large number of word vectors, that is, through the training corpus and the association relationship between the words appearing in the business documents in the current business scenario, t...
Embodiment 2
[0117] The difference between this embodiment and Embodiment 1 is that, if image 3 As shown, in the step of determining the operation weight according to the semantic information and the set target word calculation value, including:
[0118] S201: Count the semantic information, including all sememes corresponding to the semantics with the largest number of sememes and the number of occurrences of each sememe;
[0119] S202: Determine the total value of weight calculation according to the total number of occurrences of all the sememes in the semantics containing the largest number of sememes and the sum of the calculated value of the target word;
[0120] S203: Calculate the ratio of the number of occurrences of each sememe to the total value, and determine the operation weight of each sememe and the operation weight of the target word.
[0121] In this embodiment, determining the calculation weight needs to select the semantic information that contains the largest number of...
Embodiment 3
[0138] see Figure 5 , in this embodiment, the method for generating word vectors includes the following steps:
[0139] S301: Obtain a word vector model and a resource file in the current business scenario, and acquire a sentence text containing the target word, and the resource file includes multiple sememes corresponding to the semantics in the current business scenario;
[0140] S302: Determine the original word vector corresponding to the target word according to the word vector model; extract the semantic information corresponding to the target word in the resource file, the semantic information includes a plurality of sememes under semantics and each the number of occurrences of the sememe;
[0141] S303: Determine a set of adjacent words of the target word in the sentence text, where the set of adjacent words is a set of multiple words adjacent to the target word in the sentence text;
[0142] S304: According to the adjacent word set and the semantic information, det...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


