Human voice segmentation method and device

A technology of human voice and preset time, which is applied in speech analysis, speech recognition, instruments, etc. It can solve the problems of long time consumption and low efficiency, and achieve the effect of reducing workload, improving work efficiency and solving low efficiency

Active Publication Date: 2018-04-27
SPEAKIN TECH CO LTD
View PDF5 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

This technology uses an algorithm called Probability Distribution Clusters (PDCluster) for performing clustered speech processing tasks without requiring any models or assumptions about each section being analyzed. By doing this process faster than current methods, it saves effort by reducing manual labor required while still achieving accurate results.

Problems solved by technology

This patented problem addressed by the present inventors relates to an efficient way to label sound changes between different groups within conversations (Voiceover) based on their spoken words alone rather than relying solely upon previous annotations made manually).

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Human voice segmentation method and device
  • Human voice segmentation method and device
  • Human voice segmentation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The embodiment of the present invention provides a human voice segmentation method and device, which solves the technical problems of low efficiency and long time consumption of the traditional human voice segmentation system.

[0040] In order to make the purpose, features and advantages of the present invention more obvious and understandable, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the following The described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0041] see figure 1 , the present invention provides an embodiment of a human voice segmentation method...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

An embodiment of the invention provides a human voice segmentation method and a human voice segmentation device. The human voice segmentation method comprises the steps of: extracting feature vectorsfrom audio data; performing voice activation monitoring on the audio data, and labeling muted segments and voice segments separately; extracting the voice segments according to labels, segmenting thevoice segments according to a predetermined time length, performing clustering operation on the feature vectors in the segmented voice segments by adopting a probability distribution clustering method, and outputting corresponding clustering labels; and arranging the voice segments corresponding to the different clustering labels according to a time sequence, and outputting the voice segments withdifferent clustering labels after arrangement and merging. The human voice segmentation method adopts the probability distribution clustering method for performing clustering operation, can perform rapid clustering on the feature vectors of voice without modeling the voice segments, adds the voice activation monitoring, only processes the voice segments, improves the working efficiency, and solves the technical problem of low efficiency and long time consumption of the traditional human voice segmentation system.

Description

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Owner SPEAKIN TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products