Voice segmentation model training method and device and electronic equipment

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of segmentation model and training method, which is applied in speech analysis, speech recognition, instruments, etc., and can solve the problems of complex speech signal features and low speech segmentation accuracy targets.

Active Publication Date: 2020-06-19

SOUNDAI TECH CO LTD

View PDF6 Cites 4 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] However, the characteristics of speech signals are relatively complex, so the accuracy target of speech segmentation is still relatively low, which has become an urgent problem to be solved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0084] Embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although certain embodiments of the present disclosure are shown in the drawings, it should be understood that the disclosure may be embodied in various forms and should not be construed as limited to the embodiments set forth herein; A more thorough and complete understanding of the present disclosure. It should be understood that the drawings and embodiments of the present disclosure are for exemplary purposes only, and are not intended to limit the protection scope of the present disclosure.

[0085] It should be understood that the various steps described in the method implementations of the present disclosure may be executed in different orders, and / or executed in parallel. Additionally, method embodiments may include additional steps and / or omit performing illustrated steps. The scope of the present disclosure is not limited in this respect. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the disclosure discloses a voice segmentation model training method and device, electronic equipment and a computer readable storage medium. The training method of the voice segmentation model comprises the following steps: acquiring a voice feature map of a sample voice file; acquiring annotation information of a target voice in the voice feature map; initializing model parameters of a voice segmentation model; inputting the voice feature map into the voice segmentation model to obtain prediction information of a target voice output by the voice segmentation model; calculating an error between the prediction information and the annotation information according to a target function; updating parameters of the voice segmentation model according to the error; and inputtingthe voice feature map into the voice segmentation model after parameter updating to iterate the parameter updating process until the error is less than a first threshold. According to the method, thespeech segmentation model is trained through the speech feature image, and a technical problem of inaccurate speech segmentation caused by complex speech signals in the prior art is solved.

Description

technical field [0001] The present disclosure relates to the field of speech segmentation, and in particular to a training method, device, electronic equipment and computer-readable storage medium of a speech segmentation model. Background technique [0002] As a means of human-computer interaction, speech recognition technology is of great significance in liberating human hands. With the emergence of various smart speakers, voice interaction has become the new value of the Internet portal. More and more smart devices have joined the trend of voice recognition and become a bridge for communication between people and devices. Voice segmentation technology is a branch of speech recognition technology, which is used to divide a piece of voice into different categories according to time periods, such as segmenting the voices of non-simultaneous speakers in a piece of voice, voice endpoint detection and wake-up word alignment etc., all belong to the category of speech segmentati...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L15/06G10L15/04

CPCG10L15/063G10L15/04Y02T10/40

Inventor王超陈孝良冯大航

OwnerSOUNDAI TECH CO LTD

Voice segmentation model training method and device and electronic equipment

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology