Word relationship mining method and device
A relationship mining and relationship technology, applied in the Internet and computer fields, can solve the problems of multi-error relationships and low correct rate, etc., to achieve the effect of improving correlation, improving correct rate, and improving user experience
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0062] In order to improve the correctness of the mined word relationship and improve user experience, the embodiment of the present invention provides a word relationship mining method, see figure 1 , the content of the method is as follows:
[0063] 101: Obtain the candidate relationship between two entries, the frequency of the candidate relationship, and the word frequency of the entry;
[0064] 102: Obtain the statistical value of mutual information and the statistical value of log likelihood ratio according to the candidate relationship, frequency and word frequency;
[0065] 103: Obtain a normalized value of credibility according to the statistical value of the mutual information and the statistical value of the log likelihood ratio;
[0066] 104: Sorting according to the normalized value of the credibility, and outputting candidate relationships that meet the preset threshold as word relationships.
[0067] Among them, obtaining the candidate relationship between two...
Embodiment 2
[0090] In order to improve the correct rate of mined word relations and improve user experience, the embodiment of the present invention provides a word relation mining method, see figure 2 , the content of the method is as follows:
[0091] 201: The computer prepares the original corpus data;
[0092] Wherein, in this embodiment, the corpus is composed of question-and-answer documents.
[0093] 202: Obtain the title and the first best answer from the original corpus data prepared in step 201;
[0094] Among them, taking each question and answer document in the original corpus data as a unit, since the title and answer in each question and answer document are marked by a specific delimiter, you can enter a specific delimiter to obtain the title and answer, During processing, identify each question and answer document one by one until all question and answer documents are identified. In the embodiment of the present invention, the title delimiter refers to the title startin...
Embodiment 3
[0142] In order to improve the correct rate of mined word relations and improve user experience, the embodiment of the present invention provides a word relation mining method, see image 3 , the specific method is as follows:
[0143] 301: The computer prepares the original corpus data;
[0144] Wherein, in this embodiment, its corpus is composed of common documents.
[0145] 302: Take each sentence as a unit, perform word segmentation processing, and obtain a set of lemmas;
[0146] Among them, in order to have correlation between the excavated words, the word segmentation processing is usually performed on each sentence in each document in the corpus, and the word segmentation processing is performed to obtain a set of entries composed of the sentence, for example The sentence is "What's interesting in Beijing, please help, thank you?", through the word segmentation processing of the word segmentation system, it is obtained from the entries of "Beijing, there, what, fun, ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com