Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A method and device for locating voice content in batches

A voice content, batch technology, applied in voice analysis, digital memory information, instruments, etc., can solve the problems of audio positioning noise, slow audio processing, inaccurate positioning, etc., to improve applicability, improve convenience, The effect of improving the operation speed

Active Publication Date: 2020-12-22
成都启英泰伦科技有限公司
View PDF21 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1. The corpus of different frequencies and different channels needs to be processed separately;
[0005] 2. The corpus processing needs to be configured with the same path template, and it cannot process audio with different formats stored in the directory;
[0006] 3. The audio processing speed is slow;
[0007] 4. Audio positioning is easily affected by background noise, resulting in inaccurate positioning;
[0008] 5. Lack of repeatability detection

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and device for locating voice content in batches
  • A method and device for locating voice content in batches

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] Specific embodiments of the present invention will be further described in detail below.

[0036] The method for positioning speech content in batches of the present invention, comprises the steps:

[0037] S1. Play the prompt tone before the recording starts, start recording audio after the prompt tone playback is completed, record and save the audio file including the prompt tone, and record the path of the saved audio file in the path record file;

[0038] S2. traverse and read all the paths recorded in the path record file; when the path does not actually exist or the audio file cannot be found under the path, an error is reported and recorded in the generated error log;

[0039] When traversing the reading process, after the corpus audio file is found, the characteristics of the corpus audio file are read and processed into a monophonic audio file;

[0040] S3. The content of the first part of the audio file is detected and positioned for the first prompt tone, th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for batch positioning of voice content. The method comprises the following steps: S1, playing a prompt tone before recording starts, and recording a path of stored audio files into a path recording file; S2, traversing and reading all paths recorded in the path recording file; S3, carrying out prompt tone detection and positioning on the content of the front-part time of audio files; S4, carrying out prompt tone detection on the audio file of which the prompt tone is detected and part of audio segments are deleted again, and if the audio file of the prompt toneis not detected again, storing the file until no new prompt tone is detected; and S5, after all the detected audio files are processed, ending. The invention further discloses a device for batch positioning of the voice content. The voice recording positions in all the audio files can be found out by accurately positioning the prompt tone positions, and the voice corpus file processing speed is increased.

Description

technical field [0001] The invention belongs to the technical field of voice intelligent recognition, relates to corpus recognition technology, in particular to a method and device for locating voice content in batches. Background technique [0002] In the existing field of artificial intelligence, speech recognition is becoming more and more mature, and most of the development of artificial intelligence is based on speech recognition and processing. However, due to the research and development of speech recognition, it needs to be based on a large amount of corpus. However, the normal recording process is not full of effective content, and there is a large amount of redundant information. The large amount of corpus and the complicated redundant information have become obstacles in the research and development of speech recognition. [0003] There are following disadvantages in the prior art to the method for corpus redundancy processing: [0004] 1. The corpus of differen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L25/03G10L25/51G11C7/16
CPCG10L25/03G10L25/51G11C7/16
Inventor 舒畅何云鹏许兵
Owner 成都启英泰伦科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products