Conversation marking method and device, aggregation server and storage medium

A technology of aggregation server and marking method, which is applied in the fields of device, aggregation server and storage medium, and session marking method, which can solve the problems of inability to interact with result feedback, huge amount of interactive data, and high labor cost, so as to improve marking efficiency and have a wide range of applications , to achieve a simple and convenient effect

Inactive Publication Date: 2018-04-10
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF7 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the existing technology, users cannot give feedback on wrong interaction results after using the voice conversation system, so the back-end personnel of the system need to manually verify the user's interaction data on a regular basis, and repeat the verification and screening of the user's input voice one by one , and mark the wrong interaction data for the retraining and improvement of the voice conversation system
[0004] However, due to the wide range of applications of the voice conversation system and the large number of users, the amount of user interaction data is very large
Therefore, using the data processing method of manual and repetitive screening and labeling of massive data at the back end in the existing technology, the labor cost is high and the processing efficiency is low.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Conversation marking method and device, aggregation server and storage medium
  • Conversation marking method and device, aggregation server and storage medium
  • Conversation marking method and device, aggregation server and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0028] figure 1 It is a flow chart of a session marking method provided by Embodiment 1 of the present invention. This embodiment is applicable to the situation of human-computer interaction data collection in a voice conversation system, and the method can be executed by a session marking device, which can It is realized by means of software and / or hardware. refer to figure 1 , the method specifically includes the following steps:

[0029] S110. Obtain first session information and second session information corresponding to the session to be marked according to a predetermined session identifier; wherein, the first session information includes: input voice information and text information formed after performing voice recognition on the input voice information ; The second session information includes: text information and output text information formed after performing speech recognition on the input speech information.

[0030] In a specific embodiment of the present in...

Embodiment 2

[0047] Based on the first embodiment above, this embodiment provides a preferred implementation of the session marking method, which can aggregate interaction data according to a unique session identifier, and mark interactive text information at different processing stages. figure 2 A flow chart of a session marking method provided by Embodiment 2 of the present invention, such as figure 2 As shown, the method includes the following specific steps:

[0048] S210. Acquire, in the voice server, first session information corresponding to the session to be marked according to the session identifier.

[0049] In a specific embodiment of the present invention, the session identifier refers to information with an identification function that uniquely corresponds to the input voice information and is generated when the voice server receives the original user conversation voice, that is, the input voice information. The first session information includes session information receive...

Embodiment 3

[0063] image 3 A schematic structural diagram of a session marking device provided by Embodiment 3 of the present invention. This embodiment is applicable to the collection of human-computer interaction data in a voice conversation system, and the device can implement the session marking method described in any embodiment of the present invention. . Specifically, the device includes:

[0064] An acquisition module 310, configured to acquire first session information and second session information corresponding to the session to be marked according to a predetermined session identifier; wherein, the first session information includes: input voice information and speech to the input voice information text information formed after recognition; the second session information includes: text information and output text information formed after speech recognition is performed on the input speech information;

[0065] An aggregation module 320, configured to aggregate the first ses...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a conversation marking method and device, an aggregation server and a storage medium. The conversation marking method includes: acquiring first conversation information and second conversation information corresponding to a conversation to be marked according to a predetermined conversation identifier, wherein the first conversation information includes input voice information and text information formed after voice recognition is performed on the input voice information, and the second conversation information includes text information formed after voice recognition isperformed on the input voice information, and output text information; aggregating the first conversation information and the second conversation information according to the conversation identifier;receiving a conversion marking command fed back by a user, wherein the conversation marking command includes a first conversation marking command or a second conversation marking command; and markingthe aggregated first conversion information and the aggregated second conversion information according to the first conversation marking command and the second conversation marking command respectively. The manpower input can be reduced while back end data of a voice conversation system is marked, and the data processing efficiency is improved.

Description

technical field [0001] The invention relates to the technical field of Internet applications, in particular to a session marking method, device, aggregation server and storage medium. Background technique [0002] With the rapid development of artificial intelligence, intelligent robots based on voice conversation systems are applied to various fields. Through natural language dialogue, functions such as audio-visual entertainment, information query, life services, and travel conditions can be realized. [0003] At present, due to the limitation of existing speech technology and semantic technology, the ability of the machine to recognize speech and understand semantics needs to be continuously improved. Therefore, the interaction data between the user and the machine is collected regularly, and the user interaction data is verified and marked. , is a necessary prerequisite for improving the voice conversation system. In the prior art, users cannot give feedback on wrong in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/21G10L15/26
CPCG10L15/26G06F40/117
Inventor 廖大春
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products