Generation method and device of word segmentation training set
A training set and word segmentation technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as expensive economic time
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0019] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals denote the same or similar modules or modules having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention. On the contrary, the embodiments of the present invention include all changes, modifications and equivalents falling within the spirit and scope of the appended claims.
[0020] figure 1 It is a schematic flow chart of a method for generating a word segmentation training set proposed by an embodiment of the present invention, and the method includes:
[0021] S11: Obtain the training corpus, and use different tokenizers to segment the same training corpus respectively, and obtain word segmentation results corresponding to different tokenizers....
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com