Real-time audio dialogue report generation method and device, electronic equipment and storage medium

A technology of report generation and recording device, which is applied in speech analysis, speech recognition, electrical digital data processing, etc. It can solve the problems of dialogue report delay, low efficiency and accuracy of dialogue report generation, long dialogue report generation time, etc., to improve Effects of text quality, improvement of generation efficiency and accuracy, and reduction of text quantity

Pending Publication Date: 2021-09-21
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, due to the long dialogue audio, the transcribed text can be thousands of rounds. If the dialogue report is analyzed after the dialogue is over, on the one hand, due to the large amount of transcribed text, the generation time of the dialogue report will be longer, resulting in a dialogue report. Delay; on the other hand, predicting topics, customer concerns, and customer wishes on a large amount of text in a short period of time will bring huge pressure to the server, which will affect the accuracy and efficiency of text prediction, resulting in the efficiency of dialogue report generation. and low accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Real-time audio dialogue report generation method and device, electronic equipment and storage medium
  • Real-time audio dialogue report generation method and device, electronic equipment and storage medium
  • Real-time audio dialogue report generation method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0060] figure 1 It is a flowchart of a method for generating a real-time audio dialogue report provided by Embodiment 1 of the present invention.

[0061] In this embodiment, the method for generating a real-time audio dialog report can be applied to an electronic device, and for an electronic device that needs to generate a real-time audio dialog report, the real-time audio dialog provided by the method of the present invention can be directly integrated on the electronic device The function of report generation may run in the electronic device in the form of a software development kit (Software Development Kit, SDK).

[0062] like figure 1 As shown, the method for generating a real-time audio dialogue report specifically includes the following steps. According to different requirements, the order of the steps in the flow chart can be changed, and some of them can be omitted.

[0063] S11, in response to the audio dialogue request, query whether there are idle ASR resources...

Embodiment 2

[0139] image 3 It is a structural diagram of a real-time audio dialogue report generation device provided by Embodiment 2 of the present invention.

[0140] In some embodiments, the real-time audio dialogue report generating device 30 may include a plurality of functional modules composed of program code segments. The program codes of the various program segments in the real-time audio dialogue report generation device 30 can be stored in the memory of the electronic device, and executed by the at least one processor to execute (see for details figure 1 and figure 2 Description) A feature for real-time audio conversation report generation.

[0141] In this embodiment, the real-time audio dialogue report generation device 30 can be divided into multiple functional modules according to the functions it performs. The functional modules may include: a query module 301 , a control module 302 , an identification module 303 , a preprocessing module 304 , a monitoring module 305 ...

Embodiment 3

[0213] see Figure 4 As shown in , it is a schematic structural diagram of the electronic device provided by Embodiment 3 of the present invention. In a preferred embodiment of the present invention, the electronic device 4 includes a memory 41 , at least one processor 42 , at least one communication bus 43 and a transceiver 44 .

[0214] Those skilled in the art should understand that, Figure 4 The structure of the electronic device shown does not constitute a limitation of the embodiment of the present invention, it can be a bus structure or a star structure, and the electronic device 4 can also include more or less other hardware than shown in the figure Or software, or a different arrangement of components.

[0215] In some embodiments, the electronic device 4 is an electronic device that can automatically perform numerical calculation and / or information processing according to preset or stored instructions, and its hardware includes but not limited to microprocessors, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of artificial intelligence, and provides a real-time audio dialogue report generation method and device, electronic equipment and a storage medium. The method comprises the steps of: reporting an audio dialogue of a current sentence in real time, carrying out decoding, and carrying out ASR recognition on an obtained target audio of the current sentence to obtain a first transcribed text of the current sentence; performing first preprocessing on the first transcribed text of the current sentence to obtain a second transcribed text of the current sentence; dynamically cutting the audio dialogue by taking the second transcribed text of the current sentence as a center, and determining a target transcribed text of the current sentence; inputting the target transcribed text into a pre-trained prediction model to obtain a prediction result of the current sentence; and when detecting that the audio dialogue is ended, aggregating the prediction results of all sentences to obtain a dialogue report of the audio dialogue. By dynamically cutting the audio dialogue and aggregating the prediction results of all sentences after the dialogue is finished to obtain the dialogue report, the dialogue report generation efficiency and accuracy are improved.

Description

technical field [0001] The invention relates to the technical field of artificial intelligence, in particular to a method, device, electronic equipment and storage medium for generating a real-time audio dialog report. Background technique [0002] At present, in the process of audio dialogue processing, for long dialogue audio, when the audio dialogue ends, a summary report is generated for the audio dialogue. [0003] However, due to the long dialogue audio, the transcribed text can be thousands of rounds. If the dialogue report is analyzed after the dialogue is over, on the one hand, due to the large amount of transcribed text, the generation time of the dialogue report will be longer, resulting in a dialogue report. Delay; on the other hand, predicting topics, customer concerns, and customer wishes on a large amount of text in a short period of time will bring huge pressure to the server, which will affect the accuracy and efficiency of text prediction, resulting in the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/26G10L15/16G06N3/02G06K9/62G06F40/30G06F40/211
CPCG10L15/26G10L15/16G06N3/02G06F40/30G06F40/211G06F18/214
Inventor 侯晓龙任俊松
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products