Unlock instant, AI-driven research and patent intelligence for your innovation.

Corpus processing and model training method and system

A processing method and corpus technology, applied in the field of computer systems, can solve problems such as missing characters in input search terms, wrong input of search terms, etc.

Pending Publication Date: 2020-03-17
BEIJING DIDI INFINITY TECH & DEV
View PDF12 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the actual use process, there are often problems such as incorrect input of search terms, missing characters in input search terms, etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Corpus processing and model training method and system
  • Corpus processing and model training method and system
  • Corpus processing and model training method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] In order to more clearly illustrate the technical solutions of the embodiments of the present application, the following will briefly introduce the accompanying drawings that need to be used in the description of the embodiments. Obviously, the accompanying drawings in the following description are only some examples or embodiments of the present application, and those skilled in the art can also apply the present application to other similar scenarios. Unless otherwise apparent from context or otherwise indicated, like reference numerals in the figures represent like structures or operations.

[0031] As indicated in this application and claims, the words "a", "an", "an" and / or "the" do not refer to the singular and may include the plural unless the context clearly indicates an exception. Generally speaking, the terms "comprising" and "comprising" only suggest the inclusion of clearly identified steps and elements, and these steps and elements do not constitute an exc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a corpus processing and model training method and system. The method comprises the following steps: mining a user session; obtaining a search word input by a user and a selection result; combining the input search words with the selected result to form at least one corpus pair; and constructing a parallel corpus based on the at least one group of corpus pairs. The obtainedparallel corpus can be further subjected to model training. According to the method provided by the invention, the search word error correction model can be established by mining the user session, analyzing the self-error-correction behavior in the user search process, obtaining the parallel corpus and taking the parallel corpus as a sample to perform model training.

Description

technical field [0001] The present invention relates to a computer system, in particular to a method and system for model training by corpus processing. Background technique [0002] With the development and popularization of the Internet, more and more people are accustomed to obtaining knowledge, information and services through computing devices. Efficient and fast search has also become an indispensable part of people's lives. Entering search terms in the search box is the most common way to search. In the actual use process, there are often problems such as inputting wrong search terms and missing characters when entering search terms. [0003] In order to solve the above problems, people have proposed a search term error correction method. Contents of the invention [0004] The present invention provides a method for processing corpus, which specifically includes obtaining search words and selected results input by users, combining the input search words and selec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/9535G06F40/232G06F3/023
CPCG06F3/0237
Inventor 胡娟陈欢宋奇
Owner BEIJING DIDI INFINITY TECH & DEV