Unlock instant, AI-driven research and patent intelligence for your innovation.

Intelligent track division method, device and system for monaural call recording

A call recording, monophonic technology, applied in speech analysis, neural learning methods, devices with speech recognition, etc., can solve problems such as low recognition accuracy and inability to distinguish speaker roles

Pending Publication Date: 2021-11-23
SHANGHAI QIYUE INFORMATION TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention aims to solve the problem that the call recording of the existing Internet phone and voice conference is usually a monophonic call recording, and the input ASR system cannot distinguish the speaker's role, and the recognition accuracy is low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Intelligent track division method, device and system for monaural call recording
  • Intelligent track division method, device and system for monaural call recording
  • Intelligent track division method, device and system for monaural call recording

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] Exemplary embodiments of the present invention will now be described more fully with reference to the accompanying drawings, and although the exemplary embodiments may be embodied in many specific forms, these should not be construed as limited to the embodiments set forth herein. On the contrary, these exemplary embodiments are provided in order to make the content of the present invention more complete and more convenient to fully convey the inventive concept to those skilled in the art.

[0042] On the premise of complying with the technical concept of the present invention, the structure, performance, effect or other features described in a specific embodiment can be combined in any suitable way into one or more other embodiments.

[0043] During the introduction of specific embodiments, detailed descriptions of structures, performances, effects or other features are intended to enable those skilled in the art to fully understand the embodiments. However, it does no...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an intelligent track division method, device and system for monaural call recording, which are used for separating multi-person voices in monaural call recording, and the method comprises the following steps: preprocessing audio data of the call recording to obtain preprocessed audio data; performing frame attribute detection on the preprocessed audio data, and judging frame attribute information of each frame in the audio data; determining a voice starting point in the audio data according to the frame attribute information of each frame in the audio data, and deleting the audio data before the voice starting point to obtain pure voice audio data; and inputting the pure voice audio data into a track division model to obtain track division information of the pure voice audio data. According to the technical scheme, voice starting point detection is carried out firstly, interference is eliminated, only a pure voice part is reserved, and then actual speaker roles are separated out for subsequent ASR correct recognition.

Description

technical field [0001] The invention relates to the field of computer information processing, in particular to an intelligent tracking method, device and system for monophonic call recording. Background technique [0002] When recording a call on a traditional telephone, the call recording is usually in two channels, and it is easy to distinguish the vocal roles corresponding to different channels when restoring the content. With the development of Internet technology, Internet telephony and voice conferencing have gradually become popular. In order to reduce the requirements for network speed and improve call quality in Internet telephony and audio conferencing, monaural channels are often used. If call recording is also performed in mono recorded in the form of soundtracks. [0003] If such a monophonic recording is directly input into a speech recognition (ASR) system, since there is only one audio channel, the speaker of each utterance cannot be restored from the recogn...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/26G10L15/02G10L15/05G10L25/30G10L25/45G10L25/87G10L21/028G06N3/04G06N3/08G06K9/62
CPCG10L15/26G10L15/02G10L15/05G10L25/30G10L25/45G10L25/87G10L21/028G06N3/08H04M2250/74G06N3/045G06F18/2135
Inventor 孔醍郑渊中朱小波钟雨崎叶峰
Owner SHANGHAI QIYUE INFORMATION TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More