Word tag based word labeling method and device, server and storage medium
A technology for tagging words and words, applied in the computer field, can solve the problems of tagging efficiency, low accuracy, lack of guidance in the division process, and limited classification of words to be tagged, so as to reduce manpower consumption, improve accuracy and recall, and improve efficiency effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0025] figure 1 It shows the implementation process of the word tag-based word tagging method provided by Embodiment 1 of the present invention. For the convenience of description, only the parts related to the embodiment of the present invention are shown, and the details are as follows:
[0026] In step S101, words to be tagged are searched in the input text document.
[0027] In the embodiment of the present invention, the words to be tagged are new words that need to be tagged, such as words and words similar to "嘎舞" and "freestyle" that appear on new network media such as Weibo and Facebook (Facebook) , data collection is carried out on this new network media, and text documents for input can be obtained. As an example, the original data is collected on the Weibo platform, and a part of the original data with the latest publishing time is set as a text document for input.
[0028] In the embodiment of the present invention, word segmentation processing can be performed ...
Embodiment 2
[0036] figure 2 The implementation flow of the word classifier training process in the word tag-based word tagging method provided by the second embodiment of the present invention is shown. For the convenience of description, only the parts related to the embodiment of the present invention are shown, and the details are as follows:
[0037] In step S201, search for sample words in the pre-built training data set.
[0038] In the embodiment of the present invention, word segmentation processing can be performed on the training data set, and words whose occurrence frequency exceeds a preset frequency threshold and which do not appear in the known thesaurus are searched in the training data set after word segmentation processing, and these words are set as Sample words, that is, new words in the training data set. As an example, the original data is collected on the Weibo platform, and a part of the original data whose release time is in the middle period is set as the traini...
Embodiment 3
[0069] image 3 The structure of the word labeling device provided by the third embodiment of the present invention is shown. For the convenience of description, only the parts related to the embodiment of the present invention are shown, including:
[0070] The word search unit 31 is configured to search for words to be tagged in the input text document.
[0071] In the embodiment of the present invention, the words to be tagged are new words that need to be tagged, such as words and words similar to "嘎舞" and "freestyle" that appear on new network media such as Weibo and Facebook (Facebook) , data collection is carried out on this new network media, and text documents for input can be obtained. As an example, the original data is collected on the Weibo platform, and a part of the original data with the latest publishing time is set as a text document for input.
[0072]In the embodiment of the present invention, word segmentation processing can be performed on the text in t...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com