An address resolution method and a device based on a word segmentation algorithm

A technology of address analysis and word segmentation algorithm, which is applied in the field of geographic information services, can solve the problems of long time-consuming analysis and low analysis accuracy, and achieve the effect of increasing analysis accuracy and strong horizontal scalability

Inactive Publication Date: 2019-01-04
成都映潮科技股份有限公司
View PDF12 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to overcome the deficiencies of the prior art and provide an address resolution method and device based on a word segmentation algorithm to solve the problems of long time-consuming analysis and low resolution accuracy existing in the short text area analysis function in the current map tool

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An address resolution method and a device based on a word segmentation algorithm
  • An address resolution method and a device based on a word segmentation algorithm
  • An address resolution method and a device based on a word segmentation algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0023] Such as figure 1 As shown, an address resolution method based on word segmentation algorithm, the method includes:

[0024] Collect the administrative division data of the National Bureau of Statistics, code the geographical names, and establish a regional cascade relationship; the geographical cascade relationship is a four-level cascade relationship, specifically: provinces / municipalities, cities, districts / counties, and streets.

[0025] In this embodiment, the coding can reflect the hierarchical relationship of the region. For example, the id of Beijing is 1101, then find out the id (11) of its superior region through its id, and then find the name of the superior id (Beijing).

[0026] Construct a regional decision tree, take the country as the root node of the regional decision tree, and the provinces / municipalities as its subordinate nodes, and recursively create the child nodes and leaf nodes of the regional decision tree according to the regional cascading rela...

Embodiment 2

[0040] Such as figure 2 Shown, a kind of address analysis device based on word segmentation algorithm, this device comprises:

[0041] The encoding module is used to collect the administrative division data of the National Bureau of Statistics, encode the names of regions, and establish regional cascading relationships;

[0042] Build a regional decision tree module, which is used to build a regional decision tree, with the country as the root node of the regional decision tree, provinces and municipalities as its subordinate nodes, and recursively create the sub-nodes and leaf nodes of the regional decision tree according to the regional cascading relationship;

[0043] Build a custom region dictionary module, which is used to build a custom region dictionary according to the full name and abbreviation of the region and load it into the database;

[0044] The word segmentation processing module is used to obtain regional information, and uses a word segmentation algorithm i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an address analysis method and a device based on a word segmentation algorithm. The method comprises the following steps: collecting the administrative division data of the National Bureau of Statistics and storing the data in a database; encoding the region name; establishing a region cascade relationship; setting up a region cascade relationship; setting up a region cascade relationship; constructing the regional decision tree, taking the country as the root node of the regional decision tree, province/municipality as its lower node, and creating the sub-node and leafnode of the regional decision tree recursively according to the regional cascade relationship; according to the full name and abbreviation of the region, the user-defined region dictionary being built and loaded into the database; acquiring regional information, segmenting the regional information with a word segmentation algorithm and a user-defined regional dictionary to obtain a regional phrase; according to the order of regional phrases after word segmentation and regional decision tree, address information being obtained. The invention solves the problems of low resolution accuracy and long time consumption of the short text region analysis function provided in the current map tool.

Description

technical field [0001] The invention relates to the technical field of geographic information services, in particular to an address resolution method and device based on a word segmentation algorithm. Background technique [0002] At present, many problems will be encountered when extracting detailed and regular regional information such as provinces / municipalities, cities, counties, streets, etc. from the irregular address information filled in by users. For example, the user only fills in the street information. Knowing its specific location, in this case, it needs to be supplemented with a standard geographical address to know the address of which province / municipality and which city it is. [0003] At present, all the functions that can perform short text geographical analysis are built-in functions in the map tool, but its analysis efficiency is low. It can only analyze 2000 address information in an hour, and the number of requests is limited. It takes a lot of time to...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/29G06F16/33G06F16/387
Inventor 余刚
Owner 成都映潮科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products