Method and device for evaluating relevance of retrieval texts, server and storage medium

An evaluation device and correlation technology, applied in the Internet field, can solve the problems of inability to deal with the diversity of semantic expression, poor ability and timeliness, and inability to take into account both generalization processing ability and accuracy, and achieve both accuracy and generalization recognition. capabilities, improving comprehensiveness and matching, evaluating the effects of large text coverage

Active Publication Date: 2018-04-13
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF12 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the evaluation method of manual random evaluation involves excessive manpower investment and limited sample coverage. It only has a high accuracy rate for a single case, and its ability and timeliness to measure the overall status quo are poor, and it is impossible to intervene in batches of the system.
The evaluation method of directly comparing text similarity cannot cope with the diversity of semantic expressions, and the recognition granularity is relatively coarse. This kind of judgment is also covered in general retrieval systems, it is difficult to find in-depth problems, and the overall recognition accuracy is low
The automatic verification under the formulation of clear indicators is limited by the complexity of the strategy logic, and it cannot take into account the generalization processing ability and accuracy at the same time. The iteration cost of this type of decision model is high.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for evaluating relevance of retrieval texts, server and storage medium
  • Method and device for evaluating relevance of retrieval texts, server and storage medium
  • Method and device for evaluating relevance of retrieval texts, server and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0028] figure 1 The flow chart of the method for evaluating the relevance of the retrieved text provided by Embodiment 1 of the present invention, this embodiment is applicable to the situation of evaluating the relevance of the retrieved text, and the method can be executed by an evaluation device for the relevance of the retrieved text, such as Can be configured in the server. Such as figure 1 As shown, the method specifically includes:

[0029] S110, perform text feature extraction on multiple sample pairs composed of query and retrieval texts, wherein the text features include original text features and structured text features.

[0030] Wherein, the query text refers to the text input by the user in the retrieval system, and the text is used to express all the required information retrieved by the user. The retrieval text is the retrieval result provided to the user by the retrieval system based on the query text.

[0031] To measure the quality of a retrieval system,...

Embodiment 2

[0042] In the location-based service (Location Based Service, LBS) field, the retrieval function is the first entrance of various LBS products. Users express all the required information for retrieval by querying text, and LBS products provide Point Of Interest (POI ) as a retrieval result, the correlation effect between the query text and POI determines the opportunity for the product to provide users with more in-depth services. The correlation between the query text and the POI can be evaluated through the method for evaluating the correlation of the retrieved text provided in the above embodiments. Specifically, Embodiment 2 provides an evaluation method for relevancy of retrieved texts applied in the field of LBS. Figure 2a It is a flow chart of the method for evaluating the correlation between retrieval and POI text provided by Embodiment 2 of the present invention. This embodiment is further optimized on the basis of the foregoing embodiments. Such as Figure 2a As s...

Embodiment 3

[0062] This third embodiment is further optimized on the basis of the above-mentioned embodiments, and further illustrates the method for extracting core description texts for queries with relational symbols and POI text samples. The flow chart is as follows Figure 3a and Figure 3b As shown, among them, Figure 3a It is a flow chart of the core description text method for extracting query and POI texts with parallel relationship symbols among multiple sample pairs in Embodiment 3 of the present invention, Figure 3b It is a flow chart of the method for extracting core description texts of query and POI texts with non-parallel relationship symbols in multiple sample pairs in Embodiment 3 of the present invention.

[0063] When extracting the core description texts of queries and POI texts with parallel relationship symbols in multiple sample pairs, such as Figure 3a As shown, the method of extracting the core description text of the query and POI text with parallel relatio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a method and device for evaluating relevance of retrieval texts, a server and a storage medium. The method comprises the following steps: performing text features extraction on a plurality of sample pairs composed of query and retrieval texts, wherein the text features include original text features and structured text features; and taking the text features and correlation labels of the plurality of sample pairs as corpuses and training, and obtaining an evaluation model, wherein the evaluation mode is used for evaluating the relevance between the query and retrieval texts. According to the embodiment of the invention, when evaluating the relevance of the retrieval texts, the depth and automation of evaluation problems are taken into account, and the accuracy and generalization recognition capability of the decision logic are taken into account; besides, the comprehensiveness and the matching degree of retrieval recall can be improved through evaluation, and user experience can be improved; and meanwhile, the training and use of the evaluation model make the coverage of evaluated texts large and the cost of manual evaluation is reduced.

Description

technical field [0001] The embodiments of the present invention relate to Internet technologies, and in particular to an evaluation method, device, server and storage medium for retrieving text relevance. Background technique [0002] In the retrieval system, the user expresses all the required information through the query text, and the correlation between the retrieval results provided by the retrieval system and the query text determines the opportunity for the retrieval system to provide users with more in-depth services. The key factor to measure the quality of a retrieval system is whether it can accurately and efficiently evaluate the relevance of retrieved texts. [0003] In the prior art, methods for evaluating the relevance of retrieved texts include: manual sampling, sample selection by product evaluators, and manual comparison of multiple versions or products; direct comparison of text similarity through query text and recall text , calculate the length or propo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/334G06F40/289
Inventor 王健金鑫
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products