Check patentability & draft patents in minutes with Patsnap Eureka AI!

Address matching algorithm based on deep learning model

A deep learning and address matching technology, applied in biological neural network models, computing, computer components, etc., can solve problems such as difficulty in accurately identifying related relationships, inability to accurately identify the same pointing relationship, and lack of address structure.

Pending Publication Date: 2020-11-03
WUHAN UNIV
View PDF0 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the complexity of the expression of Chinese addresses, most addresses only have a certain spatial orientation in semantics, but lack a standard address structure
Traditional address matching methods mainly focus on the matching relationship between words in address texts, and cannot accurately identify the same pointing relationship of different addresses in different expressions: for example, "2502, Lane 1, Longtengge, Fuyong" and "Baishixia Community, Fuyong Street, Baoan No. 2, Lane 1, Longteng Pavilion, Defeng Road" refers to the same geographical location, but there is not much overlap in literal expression, and it is difficult to accurately identify the relevant relationship by directly matching words
Therefore, in this context, the traditional address matching method is no longer applicable to match multi-source heterogeneous massive address data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Address matching algorithm based on deep learning model
  • Address matching algorithm based on deep learning model
  • Address matching algorithm based on deep learning model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0060] The embodiment of the present invention is based on the address matching algorithm of the deep learning model, such as figure 1 , including the following steps:

[0061] Step 1. Data preprocessing. Perform preprocessing work on the corpus, such as removing duplicate addresses in the corpus, removing spaces and special symbols, and modifying typos in the corpus.

[0062] The corpus used in the embodiment of the present invention is a standard address database. The data set used for address text semantic matching contains 84,474 pairs of tagged address data, and its data structure is shown ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an address matching algorithm based on a deep learning model. The address matching algorithm comprises the following steps: firstly, carrying out word segmentation on an address in a corpus by utilizing a Jieba Chinese word segmentation library; carrying out address word vector training by utilizing a word vector (Word2vec) model; and finally, carrying out address text semantic similarity calculation by utilizing an enhanced sequence inference model (ESIM), and outputting a matching result. The method is different from a traditional address matching algorithm which focuses on directly performing similarity calculation and text matching by utilizing literal overlapping of a matching address, and the algorithm focuses on researching the semantic similarity degree ofan address text and completing a matching task on the basis of the semantic similarity degree. A deep learning algorithm suitable for current massive multi-source heterogeneous address data matching tasks is provided.

Description

technical field [0001] The invention relates to the field of computer deep learning, in particular to a deep learning method for address matching. Background technique [0002] With the rapid development of information technology, the spatio-temporal data generated by various industries such as medical care, communication, and logistics are increasing day by day. According to statistics, more than 80% of human activities and urban information are related to geographic spatial location, and its main link is address information. Therefore, addresses are playing an increasingly important role in people's lives. Existing industry data (such as medical care, public security, etc.) usually store spatial location attributes in the form of address text. To realize centralized management, analysis and information sharing in geographic space, it must first be converted into spatial data . To complete this process, it is necessary to find and obtain the geographic coordinates corresp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/289G06F40/30G06F16/29G06F16/903G06K9/62G06N3/04
CPCG06F40/289G06F40/30G06F16/29G06F16/90344G06N3/045G06F18/214
Inventor 亢孟军刘越苏世亮翁敏林玥叶蕾
Owner WUHAN UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More