Telephone recording annotation method and system, storage medium and electronic equipment
A telephone and audio technology, applied in the field of audio signal processing, can solve the problems of inconvenient speech recognition and speech synthesis training, low efficiency of manual audio annotation, etc., to reduce the time of manual audio annotation and improve performance.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0049] This embodiment relates to a semi-automatic labeling method for customer service recordings, which belongs to the field of audio signal processing, and belongs to the stages of audio signal preprocessing and labeling processing. The method of endpoint detection in the field of speech signal processing is mainly used to find out the effective speech segment in the long speech, then cut and recognize the speech, and finally listen to and modify the misrecognized text subjectively.
[0050] The cut and tagged audio can be used not only for speech recognition to obtain the content of customer service recordings, but also for corpus training for speech synthesis. The voice after speech synthesis can make the intelligent customer service speak naturally like a human. The combination of the two can be used in some enterprise customer service centers, especially the intelligent customer service of the travel service center, which can reduce a lot of labor costs and greatly impro...
Embodiment 2
[0075] This embodiment provides a telephone recording labeling system, which executes the method described in Embodiment 1, such as Figure 5 shown, including:
[0076] Audio processing module 1, is used for obtaining the audio file of a telephone recording, and carries out the processing of channel separation and format conversion to said audio file;
[0077] It includes: a channel separation module 11, which is used to separate the audio file from the left channel and the right channel, and save the separated left channel audio data and right channel audio data;
[0078] And a format conversion module 12, configured to convert the sampling frequency, bit width and encoding format of the left channel audio data and the right channel audio data.
[0079] Cutting module 2, for cutting the processed audio file by VAD method;
[0080] It comprises: initialization module 21, is used for initializing the parameter of VAD, and described parameter comprises frame length;
[0081] ...
Embodiment 3
[0087] This embodiment provides a computer-readable storage medium, on which a computer program is stored. When the program is executed by a processor, the steps of the method for annotating a telephone recording provided in Embodiment 1 are realized.
[0088] Wherein, the readable storage medium may more specifically include but not limited to: portable disk, hard disk, random access memory, read-only memory, erasable programmable read-only memory, optical storage device, magnetic storage device or any of the above-mentioned the right combination.
[0089] In a possible implementation manner, the present invention can also be implemented in the form of a program product, which includes program code, and when the program product runs on the terminal device, the program code is used to make the terminal device execute The steps of the telephone recording labeling method in embodiment 1.
[0090] Wherein, the program code for executing the present invention can be written in an...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


