Speaker separation method and device, electronic equipment and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A separation method and speaker technology, applied in speech analysis, character and pattern recognition, instruments, etc., can solve the problem of low speaker separation accuracy, achieve accurate speaker separation, and avoid the effect of voiceprint feature differences.

Pending Publication Date: 2022-03-15

IFLYTEK CO LTD

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The present invention provides a speaker separation method, device, electronic equipment and storage medium to solve the defect of low speaker separation accuracy in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0055] In order to make the purpose, technical solutions and advantages of the present invention clearer, the technical solutions in the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the present invention. Obviously, the described embodiments are part of the embodiments of the present invention , but not all examples. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0056] At present, the traditional speaker separation method mostly detects the endpoint of the audio file, retains the audio file corresponding to the speech segment, and then performs feature extraction on the audio file to obtain multiple voiceprint features, and clusters all voiceprint features. Then compare each type of voiceprint features with the voiceprint features in the voiceprint f...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a speaker separation method and device, electronic equipment and a storage medium, and the method comprises the steps: carrying out the detection of a person in a video frame, and obtaining the position of a person in the video frame; performing sound source positioning on the audio segments corresponding to the video frames to obtain sound source positions; and based on the relative position relationship between the personnel position and the sound source position, carrying out speaker separation on the audio band. According to the speaker separation method and device, the electronic equipment and the storage medium provided by the invention, the influence of environmental noise and the voiceprint feature difference degree of different roles of speakers can be avoided, and then speaker separation can be accurately carried out on the audio band based on the relative position relationship between the speaker position and the sound source position.

Description

technical field [0001] The invention relates to the field of intelligent voice technology, in particular to a speaker separation method, device, electronic equipment and storage medium. Background technique [0002] Speaker separation refers to dividing the audio data belonging to each speaker in an audio file, merging the audio data of the same speaker into one category, and separating the audio data of different speakers. [0003] At present, speaker separation is mostly achieved by extracting the voiceprint features of each speaker in the audio file, and comparing the voiceprint features of each speaker with the voiceprint features in the voiceprint feature library. However, the audio file may contain multiple voiceprint features corresponding to different speakers, and the voiceprint features of some speakers are not stored in the voiceprint feature database, which leads to the problem of low speaker separation accuracy. Contents of the invention [0004] The inventio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L17/02G10L21/0216G10L21/0272G06V40/10

CPCG10L21/0272G10L17/02G10L21/0216G10L2021/02166

Inventor 刘文超殷保才李渊强程虎

Owner IFLYTEK CO LTD

Speaker separation method and device, electronic equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology