Search text processing method and device, equipment, storage medium and program product

A processing method and text technology, applied in the computer field, can solve problems such as inability to better support real-time search scenarios, complex error correction process of deep learning models, and ineffectiveness of deep learning models, so as to improve the accuracy of error correction, Reduce overall time consumption and improve accuracy

Pending Publication Date: 2022-04-12
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In recent years, deep learning has developed rapidly. Some deep learning models based on neural networks have achieved remarkable results in the field of text processing. However, in commodity search scenarios, search terms are usually short texts, and the lack of contextual information makes deep learning models unable to perform in commodity search scenarios. play a role in
In addition, due to the complexity of the error correction process of the deep learning model, it cannot well support real-time search scenarios

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Search text processing method and device, equipment, storage medium and program product
  • Search text processing method and device, equipment, storage medium and program product
  • Search text processing method and device, equipment, storage medium and program product

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0073] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0074] In order to facilitate the understanding of the embodiments of the present application, some nouns in the embodiments of the present application are firstly explained below.

[0075] Text error correction: When the user enters the text, due to the user's input method or the user's own knowledge and habits, there are typos in the entered text. In the product search scenario, it is necessary to identify typos in the input text and correct the typos, so as to display the product information corresponding to the corrected text to the user.

[0076] Error Detection: Det...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a search text processing method and device, computer equipment, a storage medium and a computer program product. The method comprises the following steps: acquiring a search text for searching commodities; the method comprises the following steps: performing error correction on commodity words extracted from a commodity corpus to obtain a commodity word bank, and performing word segmentation processing on a search text to obtain a word sequence; phrases formed by the independent words in the word sequence and the adjacent words of the independent words are used as potential wrongly-identified words in the search text; on the basis of the pinyin editing distance, candidate words used for correcting the potential wrongly-identified words are searched; using the commodity corpus after error correction to train the language model to obtain a commodity word language model, and determining statement smoothness of potential wrong words and candidate words; and when the statement smoothness of the potential wrong words and the target candidate words meets a replacement condition, replacing the potential wrong words in the search text with the target candidate words to obtain an error correction text. The method is suitable for a commodity search scene.

Description

technical field [0001] The present application relates to the field of computer technology, and in particular to a search text processing method, device, computer equipment, storage medium and computer program product. Background technique [0002] With the development of mobile Internet technology, online shopping has become more convenient, and users often search for products on e-commerce platforms for purchase. Limited by the user's knowledge and habits, or errors caused by the pinyin input method and handwriting input method in the Chinese scene, it is easy for users to enter search terms that contain typos. Without error correction, they will not be able to search for what they want to buy Products of. In the product search scenario, about 2.5% of the search terms contain typos. [0003] In recent years, deep learning has developed rapidly. Some deep learning models based on neural networks have achieved remarkable results in the field of text processing. However, in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/31G06F16/33G06F16/35G06F16/36G06F40/232G06F40/289
Inventor 余自强
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products