Real-time audio dialogue report generation method and device, electronic equipment and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of report generation and recording device, which is applied in speech analysis, speech recognition, electrical digital data processing, etc. It can solve the problems of dialogue report delay, low efficiency and accuracy of dialogue report generation, long dialogue report generation time, etc., to improve Effects of text quality, improvement of generation efficiency and accuracy, and reduction of text quantity

Pending Publication Date: 2021-09-21

PING AN TECH (SHENZHEN) CO LTD

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] However, due to the long dialogue audio, the transcribed text can be thousands of rounds. If the dialogue report is analyzed after the dialogue is over, on the one hand, due to the large amount of transcribed text, the generation time of the dialogue report will be longer, resulting in a dialogue report. Delay; on the other hand, predicting topics, customer concerns, and customer wishes on a large amount of text in a short period of time will bring huge pressure to the server, which will affect the accuracy and efficiency of text prediction, resulting in the efficiency of dialogue report generation. and low accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0060] figure 1 It is a flowchart of a method for generating a real-time audio dialogue report provided by Embodiment 1 of the present invention.

[0061] In this embodiment, the method for generating a real-time audio dialog report can be applied to an electronic device, and for an electronic device that needs to generate a real-time audio dialog report, the real-time audio dialog provided by the method of the present invention can be directly integrated on the electronic device The function of report generation may run in the electronic device in the form of a software development kit (Software Development Kit, SDK).

[0062] like figure 1 As shown, the method for generating a real-time audio dialogue report specifically includes the following steps. According to different requirements, the order of the steps in the flow chart can be changed, and some of them can be omitted.

[0063] S11, in response to the audio dialogue request, query whether there are idle ASR resources...

Embodiment 2

[0139] image 3 It is a structural diagram of a real-time audio dialogue report generation device provided by Embodiment 2 of the present invention.

[0140] In some embodiments, the real-time audio dialogue report generating device 30 may include a plurality of functional modules composed of program code segments. The program codes of the various program segments in the real-time audio dialogue report generation device 30 can be stored in the memory of the electronic device, and executed by the at least one processor to execute (see for details figure 1 and figure 2 Description) A feature for real-time audio conversation report generation.

[0141] In this embodiment, the real-time audio dialogue report generation device 30 can be divided into multiple functional modules according to the functions it performs. The functional modules may include: a query module 301 , a control module 302 , an identification module 303 , a preprocessing module 304 , a monitoring module 305 ...

Embodiment 3

[0213] see Figure 4 As shown in , it is a schematic structural diagram of the electronic device provided by Embodiment 3 of the present invention. In a preferred embodiment of the present invention, the electronic device 4 includes a memory 41 , at least one processor 42 , at least one communication bus 43 and a transceiver 44 .

[0214] Those skilled in the art should understand that, Figure 4 The structure of the electronic device shown does not constitute a limitation of the embodiment of the present invention, it can be a bus structure or a star structure, and the electronic device 4 can also include more or less other hardware than shown in the figure Or software, or a different arrangement of components.

[0215] In some embodiments, the electronic device 4 is an electronic device that can automatically perform numerical calculation and / or information processing according to preset or stored instructions, and its hardware includes but not limited to microprocessors, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to the technical field of artificial intelligence, and provides a real-time audio dialogue report generation method and device, electronic equipment and a storage medium. The method comprises the steps of: reporting an audio dialogue of a current sentence in real time, carrying out decoding, and carrying out ASR recognition on an obtained target audio of the current sentence to obtain a first transcribed text of the current sentence; performing first preprocessing on the first transcribed text of the current sentence to obtain a second transcribed text of the current sentence; dynamically cutting the audio dialogue by taking the second transcribed text of the current sentence as a center, and determining a target transcribed text of the current sentence; inputting the target transcribed text into a pre-trained prediction model to obtain a prediction result of the current sentence; and when detecting that the audio dialogue is ended, aggregating the prediction results of all sentences to obtain a dialogue report of the audio dialogue. By dynamically cutting the audio dialogue and aggregating the prediction results of all sentences after the dialogue is finished to obtain the dialogue report, the dialogue report generation efficiency and accuracy are improved.

Description

technical field [0001] The invention relates to the technical field of artificial intelligence, in particular to a method, device, electronic equipment and storage medium for generating a real-time audio dialog report. Background technique [0002] At present, in the process of audio dialogue processing, for long dialogue audio, when the audio dialogue ends, a summary report is generated for the audio dialogue. [0003] However, due to the long dialogue audio, the transcribed text can be thousands of rounds. If the dialogue report is analyzed after the dialogue is over, on the one hand, due to the large amount of transcribed text, the generation time of the dialogue report will be longer, resulting in a dialogue report. Delay; on the other hand, predicting topics, customer concerns, and customer wishes on a large amount of text in a short period of time will bring huge pressure to the server, which will affect the accuracy and efficiency of text prediction, resulting in the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L15/26G10L15/16G06N3/02G06K9/62G06F40/30G06F40/211

CPCG10L15/26G10L15/16G06N3/02G06F40/30G06F40/211G06F18/214

Inventor 侯晓龙任俊松

Owner PING AN TECH (SHENZHEN) CO LTD

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Real-time audio dialogue report generation method and device, electronic equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology