Sentence boundary identification method in spoken language dialogue
A technology of boundary recognition and spoken language, applied in special data processing applications, instruments, electrical digital data processing, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0016] Various details involved in the technical solution of the present invention will be described in detail below in conjunction with the accompanying drawings.
[0017] Preprocessing of spoken corpus
[0018] The acquired oral corpus cannot be directly used for training, but must undergo some preprocessing. Sentence boundary segmentation is to find the end point of the sentence in the continuous text, that is, to predict the occurrence position of those sentence-end punctuation, so as long as it is the end-sentence punctuation, there is no difference for segmentation. The main task of preprocessing is to replace the end-of-sentence punctuation in the corpus with a unified symbol. For the convenience of description, the replacement symbol in this article is represented by "SB"; for other punctuation other than the end-of-sentence punctuation, it must be deleted, because the phonetic It is impossible for the recognized text to contain such punctuation marks. For Chinese, t...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com