Method and device for excavating synonymous attribute words
A technology of attribute words and dictionaries, applied in the field of mining synonymous attribute words, can solve the problems of low recall rate, low efficiency, and human resource consumption
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0075] figure 1 The flow chart of the method provided by Embodiment 1 of the present invention, such as figure 1 As shown, the method includes the following steps:
[0076] Step 101: Obtain query set.
[0077] The query set within a certain period of time can be obtained from the search log as the corpus for extracting synonymous attribute words.
[0078] Step 102: Determine the click vector of each query in the query set, wherein the click vector of the query is composed of the clicked url corresponding to the query and the click weight of each url.
[0079] query i url in the click vector j The click weight w ij can use query i at url j The proportion of clicks on , which can be specifically expressed as the following formula:
[0080] w ij = click ij / Σ k = 1 n click ...
Embodiment 2
[0127] figure 2 The device structure diagram provided for the second embodiment of the present invention, such as figure 2 As shown, the device may include: a data acquisition unit 201 , a structured analysis unit 202 , a data extraction unit 203 , a candidate word extraction unit 204 and a synonym extraction unit 205 .
[0128] The data acquisition unit 201 acquires a query set, specifically, a query set within a certain period of time may be acquired from a search log as a prediction for extracting synonymous attribute words.
[0129] The structured parsing unit 202 performs structured parsing on each query in the query set based on the existing entity word dictionary and attribute word dictionary, and extracts a standard query. The query that does not extract the standard query is used as a non-standard query, and the standard query is composed of entity words Combination with attribute words.
[0130] Specifically, when the structured parsing unit 202 performs structur...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com