Audio data processing method, device and system

A technology of audio data and processing methods, which is applied in the field of communication, and can solve problems such as difficulty in ensuring the accuracy of voiceprint recognition, large influence of voiceprint features, and unreliable recognition results.

Pending Publication Date: 2022-04-12
HUAWEI TECH CO LTD
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since there are many scenes of free conversation in video conferencing, when the audio data is segmented for a long time, a piece of voice may contain the voices of multiple people. When identifying, the identification result will be unreliable
[0004] The premise of realizing the above solution is that the conference participants need to register their voiceprints in the voiceprint recognition system, but the channel during voice collection has a great influence on the characteristics of voiceprints. Generally, a single channel is used for pre-registering voiceprints, while the channel for recognition Diverse, it is difficult to guarantee the accuracy of voiceprint recognition of sounds collected by different sound channels

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio data processing method, device and system
  • Audio data processing method, device and system
  • Audio data processing method, device and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0069] In order to make the purpose, technical solutions and advantages of the present application clearer, the embodiments of the present application will be described below in conjunction with the accompanying drawings. Apparently, the described embodiments are only part of the present application, rather than all of them. . Those skilled in the art know that, with the emergence of new application scenarios, the technical solutions provided in the embodiments of the present application are also applicable to similar technical problems.

[0070] The terms "first", "second" and the like in the specification and claims of the present application and the above drawings are used to distinguish similar objects, and are not necessarily used to describe a specific sequence or sequence. It is to be understood that the terms so used are interchangeable under appropriate circumstances such that the embodiments described herein can be practiced in sequences other than those illustrated ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides an audio data processing method, equipment and system, which are used for classifying conference audio data according to identities of spokesmen. The conference record processing method specifically comprises the steps that the conference record processing device obtains audio data of a first conference place, sound source orientation information corresponding to the audio data and an identity recognition result, and additional domain information comprises the sound source orientation information corresponding to the audio data; the identity recognition result is used for indicating a corresponding relation between spokesman identity information obtained through a portrait recognition method and speaking time information of a spokesman; then the conference record processing device performs voice segmentation on the audio data to obtain first segmented audio data of the audio data; and finally, the conference record processing device determines a spokesman corresponding to the first segment of audio data according to the voiceprint feature of the first segment of audio data and the identity recognition result.

Description

technical field [0001] The present application relates to the communication field, and in particular to an audio data processing method, device and system. Background technique [0002] With the rapid development of video conferencing technology, similar to the manual generation of meeting minutes during ordinary meetings, there is also a need for meeting minutes in multi-point video conferencing. Existing products can automatically record the audio, video, data and other content of the entire conference during the video conference process. If only the audio data is simply recorded, when reviewing the key content or specific content of the conference, it will not be able to achieve normal Meeting minutes sorting needs that can be classified by speakers. [0003] During the video conference, if it can be determined that only one person is speaking in the entire audio file, the audio data of the entire file can be directly sent to the voiceprint recognition system for identif...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L17/22G06V40/10G06V40/16G10L15/22G10L15/24G10L15/25G10L15/26H04N7/15
CPCG06V40/10G06V40/16G10L15/24G10L15/25G10L15/26G10L15/22G10L17/22H04N7/15G10L17/00G10L17/14
Inventor 张鹏
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products