Search content sorting method and device, storage medium and electronic equipment

A technology for searching content and sorting methods, which is applied in the search field and can solve the problems of small application range of correlation models and high labeling costs

Active Publication Date: 2020-12-18
BEIJING SANKUAI ONLINE TECH CO LTD
View PDF9 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The main purpose of the present disclosure is to provide a search content sorting method, device, storage medium and electronic equipment to solve the technical problem in the related art that the application range of the correlation model is small and the labeling cost is high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Search content sorting method and device, storage medium and electronic equipment
  • Search content sorting method and device, storage medium and electronic equipment
  • Search content sorting method and device, storage medium and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0063] Specific embodiments of the present disclosure will be described in detail below in conjunction with the accompanying drawings. It should be understood that the specific embodiments described here are only used to illustrate and explain the present disclosure, and are not intended to limit the present disclosure.

[0064] In related technologies for ranking search content, Query-Doc text relevance features can be determined through BM25 model and DSSM (Deep Structured Semantic Models, deep structure semantic model). However, the BM25 model can only handle the case where Query and Document have overlapping words (literal matching), and cannot handle the semantic correlation of words. Due to the semantic difference between Query and Document, there may be many situations where the semantics are similar but the text does not match. In addition, there are semantic differences between literal matching text, such as "machine learning" and "learning machine". Therefore, the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a search content sorting method and device, a storage medium and electronic equipment, and the method comprises the steps: determining a correlation score between each searchcontent corresponding to a search word and the search word through a pre-trained semantic correlation model; sorting the plurality of search contents according to the correlation score, wherein the training process of the semantic correlation model comprises the following steps: pre-training a language model through a plurality of search term samples and a first search content sample determined according to historical operation behaviors of a user for a plurality of search contents corresponding to each search term sample; and finely adjusting the pre-trained language model through the plurality of search term samples and two second search content samples corresponding to each search term sample, wherein the second search content samples are attached with tags used for representing whetherthe search content samples are related to the search term samples or not. According to the method, the correlation score of the search content can be determined through the pre-trained and fine-tunedsemantic correlation model, the application range of the semantic correlation model is widened, and the annotation cost is reduced.

Description

technical field [0001] The present disclosure relates to the technical field of search, and in particular, to a search content sorting method, device, storage medium and electronic equipment. Background technique [0002] The search platform can recommend several search results (hereinafter referred to as Document) to the user according to the keywords entered by the user (hereinafter referred to as Query). The search results need to be displayed to the user in the itinerary search result list after sorting. Therefore, the accuracy of the search result sorting directly affects the performance of the platform. Among them, text semantic relevance is one of the core factors in ranking, which is directly linked to the search experience. The purpose of judging the semantic relevance of text is to calculate the degree of correlation between the search term and the search content, that is, to judge whether the search content meets the user's search needs. It is one of the functio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/9538G06F16/9535
CPCG06F16/9535G06F16/9538
Inventor 杨扬王金刚步佳昊周翔李勇张富峥陈胜仙云森王仲远
Owner BEIJING SANKUAI ONLINE TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products