Model training method and device, voice separation method and device and electronic equipment

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology for model training and speech separation, which is applied in the fields of devices and electronic equipment, model training methods, and speech separation methods, and can solve problems such as low accuracy of speech separation.

Active Publication Date: 2021-05-18

SOUNDAI TECH CO LTD

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] Embodiments of the present disclosure provide a model training method, speech separation method, device, and electronic equipment to solve the problem of segmenting speech based on experience in the prior art, and the split speech segments are likely to contain two or more utterances Human speech, which leads to the problem of low accuracy of speech separation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0027] The technical solutions in the embodiments of the present disclosure will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present disclosure. Obviously, the described embodiments are part of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0028] In the embodiments of the present disclosure, electronic devices include but are not limited to mobile phones, tablet computers, notebook computers, palmtop computers, vehicle-mounted mobile terminals, wearable devices, and pedometers.

[0029] see figure 1 , figure 1 is a schematic flowchart of a model training method provided by an embodiment of the present disclosure, such as figure 1 shown, including the following steps:

[0030] Step 101...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a model training method and device, a voice separation method and device and electronic equipment, and the method comprises the steps: the voice features of a sound signal are input into N pre-trained first neural network models, and N output results are obtained, wherein the N output results are voice features of the voice of the speaker corresponding to N pickup areas separated from the sound signal, and N is an integer greater than 1; voice features of the sound signals are input into a second neural network model, the second neural network model is trained, and a loss function used for training the second neural network model is determined based on the N output results. According to the embodiment of the invention, voice separation is carried out by adopting the trained second neural network model, so that the accuracy of voice separation can be improved.

Description

technical field [0001] The invention relates to the technical field of speech processing, in particular to a model training method, a speech separation method, a device and electronic equipment. Background technique [0002] Voice is the most natural, convenient and effective way for people to communicate with each other. The voice of interest can be obtained from a large number of voices through voice separation. In the process of speaker separation for speech, it is necessary to segment the speech, and then mark the speaker information for the segmented speech segments. [0003] Currently, speech is segmented based on experience, and the segmented speech segments are likely to contain the speech of two or more speakers, resulting in a low accuracy rate of speech separation. Contents of the invention [0004] Embodiments of the present disclosure provide a model training method, speech separation method, device, and electronic equipment to solve the problem of segmentin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L21/028G10L21/0208G10L25/30G06N3/08G06N3/04

CPCG10L21/028G10L21/0208G10L25/30G06N3/08G10L2021/02087G06N3/045

Inventor陈孝良冯大航赵力常乐

OwnerSOUNDAI TECH CO LTD

Model training method and device, voice separation method and device and electronic equipment

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology