Voice data automatic labeling method and system for voice recognition
A speech recognition and speech data technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of high cost, low efficiency, and long manual tagging cycle of speech data, so as to reduce labor, improve tagging quality, and solve tagging cycle long effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0033] The present invention provides a speech data automatic labeling system for speech recognition, comprising a silence detection module 10, a volume screening module 20, a length screening module 30, a speech recognition module 40, a recognition result judging module 50 and a manual proofreading module 60;
[0034] Each voice is split into a plurality of voice segments by the silence detection algorithm in the silence detection module 10;
[0035] Said volume screening module 20 screens out the voices that meet the requirements by the threshold of the volume, and removes the voices that do not meet the requirements;
[0036] Described length screening module 30 screens out the speech that meets the requirements by the threshold of the speech duration, and removes the speech that does not meet the requirements;
[0037] Described speech recognition module 40 is by speech recognition engine speech recognition is the text corresponding to speech, later stage will add the newl...
Embodiment 2
[0050] The present invention provides a speech data automatic labeling system for speech recognition, comprising a silence detection module 10, a volume screening module 20, a length screening module 30, a speech recognition module 40, a recognition result judging module 50 and a manual proofreading module 60;
[0051] Each voice is split into a plurality of voice segments by the silence detection algorithm in the silence detection module 10;
[0052] Said volume screening module 20 screens out the voices that meet the requirements by the threshold of the volume, and removes the voices that do not meet the requirements;
[0053] Described length screening module 30 screens out the speech that meets the requirements by the threshold of the speech duration, and removes the speech that does not meet the requirements;
[0054] Described speech recognition module 40 is by speech recognition engine speech recognition is the text corresponding to speech, later stage will add the newl...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com