Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for point of interest data error type positioning and duplicate identification

A technology of data errors and points of interest, applied in the field of data quality control, can solve problems such as high integrity requirements and low POI data accuracy, and achieve the effect of less manual operation, low operation and maintenance costs, and easy operation and learning

Active Publication Date: 2021-05-04
LIAONING MOBILE COMM
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, when the existing similarity calculation algorithm is used to determine the duplication of POI data, the integrity of POI field information is high, and POI data duplication is often caused by incomplete POI field information. Therefore, using the existing technology The accuracy of the scheme to determine whether the POI data is repeated is not high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for point of interest data error type positioning and duplicate identification
  • Method and device for point of interest data error type positioning and duplicate identification
  • Method and device for point of interest data error type positioning and duplicate identification

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0110] In Embodiment 1 of the present invention, in the scenario where the basic field includes a name field, the detailed processing flow of the POI data error type positioning method is as follows: figure 2 shown, including the following steps:

[0111] Step 201: perform word segmentation processing on the name field of the POI data pair, and obtain the number of layers where each word segmentation forming the name field is located;

[0112] Specifically, according to the hierarchical parameters of the set POI name field, the word segmentation process is performed on the name field of the POI data pair, and the number of layers where each word segmentation of the name field forming the POI data pair is obtained; the POI data pair includes user The input POI data and POI raw data corresponding to the POI data input by the user.

[0113] Here, the hierarchical parameters of the POI name field can be set according to the electronic map industry classification standards and ac...

Embodiment 2

[0139] In Embodiment 2 of the present invention, in the scenario where the basic field includes an address field, the detailed processing flow of the POI data error type positioning method is as follows: image 3 shown, including the following steps:

[0140] Step 301: Carry out word segmentation processing to the address field of the POI data pair, and obtain the number of layers where each word segmentation of the address field forming the POI data pair is located;

[0141] Specifically, according to the hierarchical parameters of the set POI address field, word segmentation is performed on the address field of the POI data pair, and the layer number of each word segmentation of the address field forming the POI data pair is obtained; the POI data pair includes user The input POI data and POI raw data corresponding to the POI data input by the user.

[0142] layer 0 city ​​name Layer 6 Former name of the building Tier 1 urban area Layer 7 building...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a point of interest data error type positioning method. According to the hierarchical parameters of the set point of interest POI basic field, the basic field of the POI data pair is subjected to word segmentation processing, and the number of layers where each word segmentation of the basic field is obtained is obtained. ; The POI data pair includes POI data input by the user and POI raw data corresponding to the POI data input by the user; the basic field includes at least one subfield; according to the number of layers where each participle of the subfield is formed, calculate the POI data pair The similarity value between the subfields; according to the error thresholds corresponding to the different error types of the set subfields, when it is determined that the similarity value between the subfields of the POI data pair belongs to the error threshold, the subfield of the original POI data is located type of error. At the same time, the invention also discloses a point-of-interest data error type positioning device, a method and a device for repeatedly identifying point-of-interest data.

Description

technical field [0001] The present invention relates to the field of data quality control, in particular to a method and device for point of interest data error type positioning and duplicate identification. Background technique [0002] Point Of Interest (POI) refers to all geographical objects that can be abstracted as points in geographic information systems, especially some geographical entities that are closely related to people's lives, such as schools, banks, gas stations, etc. The main purpose of POI is to enhance the ability to describe and query the location of things or events by describing the addresses of things or events, thereby improving the accuracy and speed of geographic positioning. In order to provide users with products that meet their personalized service needs, POI data providers such as Baidu Maps and Dianping will establish their own POI databases. The POI database stores a large amount of POI data, and each POI data contains POI information. Aspec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/29G06K9/62
CPCG06F16/29G06F18/22
Inventor 王世民
Owner LIAONING MOBILE COMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products