Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Intelligent voice axis cutting method, information data processing terminal and computer program

A computer program and intelligent speech technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of inaccurate segmentation of speech segmentation technology, high requirements for manual segmentation talents, and large workload for manual segmentation. Achieve the effect of solving background noise problems, improving accuracy, and eliminating noise interference

Pending Publication Date: 2018-07-17
GLOBAL TONE COMM TECH
View PDF3 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0012] (1) Manual segmentation has higher requirements for talents
[0013] (2) Manual segmentation is inefficient and unable to meet market demand
[0014] (3) The existing voice segmentation technology cannot be accurately segmented when the signal-to-noise ratio is not high
[0016] With the high-performance requirements of the speech recognition system in recent years, it is necessary to intelligently recognize large-scale corpus to meet the video demand of geometric order growth, and the workload of manual segmentation is huge and the efficiency is low.
Using machine automatic segmentation technology can greatly improve the segmentation efficiency, but the traditional automatic segmentation technology cannot meet the existing needs in terms of accuracy. Therefore, it is very important to find a fast and efficient voice automatic segmentation method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Intelligent voice axis cutting method, information data processing terminal and computer program
  • Intelligent voice axis cutting method, information data processing terminal and computer program
  • Intelligent voice axis cutting method, information data processing terminal and computer program

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0065] With the rapid development of artificial intelligence technology, the application of artificial intelligence technology in voice segmentation has become a hot spot.

[0066] like figure 1 As shown, the method for intelligent voice cutting axis provided by the embodiment of the present invention includes the following steps:

[0067] S101: Use a large amount of unlabeled data to initialize model parameters through an unsupervised learning algorithm, which is called pre-training (Pre-training);

[0068] S102: Using a small amount of labeled data, using a traditional neural network learning algorithm (such as the BP ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of computer software, and discloses an intelligent voice axis cutting method, an information data processing terminal and a computer program. The intelligent voice axis cutting method comprises the following steps: pre-training, namely, carrying out initialization on model parameters through a non-supervision learning algorithm by using abundant data not labelled; and model fine-tuning, namely, learning the model parameters by adopting the traditional neural network learning algorithm and using a small amount of labelled data. According to the technical scheme, effective voice segments are obtained through the windowing framing technology, continuous and stable voice signals are obtained, and the identification errors are reduced; voice signalsare effectively enhanced, the capacity of distinguishing the useless signals is enhanced, the noise interference is eliminated, the errors are reduced, and the voice recognition accuracy rate can be improved by 50%; the problem of background noises can be effectively solved, and the voice recognition accuracy rate is improved to 93%. For acoustic feature extraction, voice feature vector sequencescan be extracted according to voice features similar to that of human beings, background noises and channel distortion are eliminated, and the voice recognition accuracy rate is improved to 94.7%.

Description

technical field [0001] The invention belongs to the technical field of computer software, and in particular relates to an intelligent voice cutting method, an information data processing terminal and a computer program. Background technique [0002] At present, the existing technology commonly used in the industry is as follows : Language is the most convenient and fastest way for human beings to exchange information. With the rapid development of modern network technology, video traffic has gradually become the mainstream of the modern network world, and at the same time, the forms of video transmission tend to be diversified. Video is composed of images and voices. With the support of more and more advanced technologies today, voice recognition technology has become a research hotspot. Speech segmentation is the first pass that speech recognition technology must pass through. Speech segmentation refers to the use of computer programs to automatically segment the basic uni...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/02G10L15/04G10L15/06G10L15/28G10L21/0208G10L25/24G10L25/30G10L25/93
CPCG10L15/02G10L15/04G10L15/063G10L15/28G10L21/0208G10L25/24G10L25/30G10L25/93G10L2015/0631
Inventor 孙宏亮程国艮
Owner GLOBAL TONE COMM TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products