A method and device for mining attribute name repetition
A technology of attribute and phrase pairs, which is applied in natural language data processing, special data processing applications, network data retrieval, etc., and can solve problems such as differences and obvious attribute names
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0056] figure 1 The flow chart of the method for mining attribute name retelling provided by Embodiment 1 of the present invention, such as figure 1 As shown, the method may include the following steps:
[0057] Step 101: Obtain at least one resource among Q-Q, Q-T and T-T from a search log (query log) as a candidate sentence pair.
[0058] The purpose of this step is to obtain the sentence pair resources used for subsequent mining from the query log. The query log records the data of the user's query session (session) and the click on the webpage title (title). The specific query log can be the query of a specified period of time log, such as query log for one day.
[0059] The aforementioned Q-Q refers to a query-query pair, which refers to two queries searched by a user in one session, and the meanings of these two queries may be the same.
[0060] The above Q-T refers to the query-clicked title pair, which refers to the query and the corresponding clicked title. Usually...
Embodiment 2
[0091] figure 2 The device structure diagram of the retelling of mining attribute names provided by Embodiment 2 of the present invention, such as figure 2 As shown, the device includes: a candidate sentence pair acquisition unit 201 , a first phrase pair extraction unit 202 , a second phrase pair extraction unit 203 and a noise filtering unit 204 .
[0092] The candidate sentence pair acquisition unit 201 obtains at least one resource in Q-Q, Q-T and T-T from the query log as a candidate sentence pair, where Q-Q is a sentence pair composed of two queries searched by a user in a session, and Q-T is a query and a corresponding The sentence pair formed by the clicked title, T-T is the sentence pair formed by two clicked titles corresponding to the same query.
[0093] The first phrase pair extracting unit 202 extracts phrase pairs with the same context from each candidate sentence pair as candidate paraphrase phrase pairs. Specifically, phrase pairs can be extracted as candi...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


