Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice switching method and device, computer equipment and storage medium

A voice segmentation and computer program technology, applied in voice analysis, voice recognition, instruments, etc., can solve the problems of low voice file segmentation efficiency and low accuracy, so as to improve segmentation efficiency, improve accuracy, and avoid damage Effect

Pending Publication Date: 2018-11-20
PING AN TECH (SHENZHEN) CO LTD
View PDF4 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Embodiments of the present invention provide a voice segmentation method, device, computer equipment, and storage medium to solve the current problems of low efficiency and low accuracy in voice file segmentation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice switching method and device, computer equipment and storage medium
  • Voice switching method and device, computer equipment and storage medium
  • Voice switching method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0033] see figure 1 , figure 1 The application environment of the speech segmentation method provided by the embodiment of the present invention is shown. The voice segmentation method is applied in a voice recognition system for training a voice recognition model. The voice recognition system includes a server and a client, wherein the server and the client are connected through a network, and the user performs voice input through the client. The cl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a voice switching method and device, computer equipment and a storage medium. The method comprises steps that a voice file is obtained for preprocessing to obtain audio data; the audio data is normalized, the audio data is framed, and frame energy of each voice frame is calculated; if the frame energy of the voice frame is less than the preset frame energy threshold, the voice frame is marked as a silence frame; if that the number of consecutive silence frames is greater than the preset silence frame number threshold is detected, the consecutive silence frames are marked as a silent segment; segmentation frames of the voice file are determined based on the silence segment, the voice file is segmented through utilizing the segmentation frames, and a target file is obtained. The method is advantaged in that the frame energy is utilized as a segmentation criterion for voice segmentation, no manual intervention is required, complexity is low, silence and pauses in statements can be accurately identified, and segmentation efficiency is effectively improved while the voice file is accurately segmented.

Description

technical field [0001] The invention relates to the technical field of voice processing, in particular to a voice segmentation method, device, computer equipment and storage medium. Background technique [0002] In the field of speech processing, segmenting speech files is a key issue, because longer speech files consume a lot of system resources during the speech recognition conversion process, and the recognition accuracy is not high. After the speech file is segmented, the calculation amount of the speech recognition can be reduced and the recognition accuracy of the speech recognition system can be improved. At the same time, the accuracy of speech segmentation will directly affect the result of speech recognition. If there is an error in speech segmentation, there may be a large deviation in the recognition of the speech signal, and even the recognition of the speech signal cannot be realized. [0003] However, at present, when audio files need to be segmented by sente...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/04G10L15/06
CPCG10L15/04G10L15/063
Inventor 黄锦伦
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products