Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Corpus collecting method and device and computer equipment

A computer program and corpus technology, applied in computing, special data processing applications, instruments, etc., can solve problems such as difficulty in effect optimization, low efficiency, high labor cost, etc. The effect of quality assurance

Pending Publication Date: 2018-06-12
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF5 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the above-mentioned corpus collection method is far away from the real scene, and it is easy to fall into personal thinking and language stereotypes, resulting in poor authenticity of the corpus, and the need to imagine the scene by yourself, which is inefficient; invest a lot of manpower to develop a complete dialogue understanding and interaction system, and the labor cost is high. , the development cycle is long, and it cannot meet the rapidly developing technology and product requirements. Each module needs to be developed and tuned separately, but the tuning of individual modules also needs corpus support; in the absence of corpus, it is very difficult to optimize the effect of each module , the effect is difficult to achieve the ideal

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Corpus collecting method and device and computer equipment
  • Corpus collecting method and device and computer equipment
  • Corpus collecting method and device and computer equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] Embodiments of the present application are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary, and are intended to explain the present application, and should not be construed as limiting the present application.

[0024] figure 1 It is a flowchart of an embodiment of the corpus collection method of the present application, such as figure 1 As shown, the above-mentioned corpus collection method may include:

[0025] Step 101, receiving a query request input by a user.

[0026] Wherein, the above-mentioned query request may be corpus input through text, voice or picture, and this embodiment does not limit the form of the above-mentioned user input query request.

[0027] In this embodiment, the query request input by the above user b...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a corpus collecting method and device and computer equipment. The corpus collecting method comprises the following steps: receiving a query request input by a user; showing templated response for the user, wherein the templated response comprises a feedback portion and a guiding portion, the feedback portion comprises result feedback to a queried text input by the user, andthe guiding portion comprises guidance of the user to a next input; and receiving corpuses input by the user. A real dialogue scenario can be shown for a corpus collecting person, and collection difficulty of task-guided multiple corpuses is reduced by balance between efficiency and validity of conversation.

Description

technical field [0001] The present application relates to the technical field of man-machine dialogue, in particular to a method, device and computer equipment for collecting corpus. Background technique [0002] Now, there are more and more scenarios for task-oriented dialogue understanding and interactive applications, but most of the scenarios only support a single round of dialogue. The technical implementation of multi-round dialogue understanding and interaction is much more difficult than single-round dialogue. The primary reason is that the acquisition of corpus for multi-round dialogue is much more difficult than single-round dialogue. The corpus of a single round of dialogue can be obtained through simple and direct enrichment by relevant personnel who are familiar with the business, but due to the interactive process of multiple rounds of dialogue, it cannot be enriched out of thin air, so it is more difficult to obtain, which directly leads to the loss of Techno...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/3329
Inventor 李和瀚周晓
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products