Geographical annotation method, device and computer-readable storage medium
A technology based on region and calculation formula, applied in computing, instrument, multimedia data retrieval, etc., can solve the problems of uneven quality of outsourcing personnel, waste of manpower, large errors, etc., to achieve the effect of improving labeling accuracy and labeling efficiency, and reducing information inconsistency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0068] This embodiment provides a region labeling method, such as figure 1 As shown, the method includes:
[0069] S101. Acquire low-weight words and high-weight words in a preset data set, wherein a first weight value of a low-weight word is smaller than a first preset threshold, and a second weight value of a high-weight word is larger than a second preset threshold.
[0070] Wherein, in this step, the preset data set includes multiple documents, and each document has its corresponding regional label. The regional label is used to mark the regional information described by the current document information. The regional label can include multiple levels according to the level of the region. Regional labels, for example, when there are two levels of regional labels, the first-level regional labels can be provinces or municipalities directly under the central government, and the second-level regional labels are corresponding famous cities below the provinces. The first-level re...
Embodiment 2
[0138] This embodiment provides a region labeling method, such as Image 6 As shown, this embodiment will be specifically described below in combination with the practical application of the method.
[0139] Step 601. Obtain the low-weight words and high-weight words in the preset data set. That is, assuming that the preset data set includes multiple documents, each document has its corresponding regional label, and the regional label includes two first-level regional labels, namely "Beijing" and "Fujian", where "Fujian" includes two second-level The geographical labels are "Xiamen" and "Fuzhou". Here, the word segmentation operation is performed on the documents in the preset data set according to the preset word segmentation rules, and the high-weight words in the document with the region label "Beijing" are "Beijing" and "Tiananmen"; the region label is "Fujian" The high-weight words in the documents with the region label "Xiamen" are "Xiamen" and "Gulangyu", and the high...
Embodiment 3
[0147] This embodiment provides a region labeling device 300. It should be noted that the basic principles and technical effects of the region labeling device 300 provided in this embodiment are the same as those of the aforementioned corresponding method embodiments. For a brief description, this embodiment For the parts not mentioned in the example, refer to the corresponding content in the method embodiment. like Figure 7 As shown, the region labeling device 300 includes:
[0148] An acquisition module 301, configured to acquire low-weight words and high-weight words in the preset data set, wherein the first weight value of the low-weight word is less than the first preset threshold, and the second weight value of the high-weight word is greater than the second preset threshold.
[0149] The filtering module 302 is configured to filter out low-weight words in the text to be marked according to the low-weight words in the preset data set.
[0150] The extraction module 3...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com