Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Address text element extraction method based on Hidden Markov and classification algorithm coupling

A technology of element extraction and classification algorithm, which is applied in the field of address text element extraction based on the coupling of hidden Markov and classification algorithms, can solve the problem of weak modeling ability of address text semantic features, and achieve enhanced state prediction ability and universal reference Significance, strengthen the effect of modeling

Active Publication Date: 2021-09-03
WUHAN UNIV
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Aiming at the problem that the Hidden Markov Model has a single observation sequence constraint and the ability to model the semantic features of the address text is weak, the present invention provides a method for extracting address elements in the address text by coupling the Hidden Markov and classification algorithms, which can Automatically extract address elements from address text more accurately

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Address text element extraction method based on Hidden Markov and classification algorithm coupling
  • Address text element extraction method based on Hidden Markov and classification algorithm coupling

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The technical scheme and detailed modeling process of the present invention will be described below with reference to the accompanying drawings and examples. Apparently, the described embodiments are only some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0027] Such as figure 1 As shown, the technical solution provided by this application mainly includes three modules: data preprocessing and labeling, extraction method modeling, method evaluation and optimization.

[0028] Among them, the data preprocessing and labeling module mainly performs word segmentation processing on the data. According to the extraction requirements, state labeling is performed on the data after word segmentation, and the observed state sequence and hidden state se...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of geographical intelligence, and relates to an address text element information extraction method. Comprising the following steps: S1, pre-defining a hidden state and an observation state for a word sequence after word segmentation of an address text, and constructing a hidden Markov model; s2, constructing observation features based on the observation state, and training a multi-classification model of the observation features to the hidden state; and S3, dynamically splicing the prediction probability vectors of the classification model for the hidden state into an observation probability matrix according to columns, replacing a static observation probability matrix in the hidden Markov model, and constructing a coupling model. According to the method, while the hidden Markov sequence modeling capability is reserved, modeling of an observation state to the hidden state indication capability is enhanced through a classification algorithm fusing multi-dimensional observation characteristics; the invention can be used for mapping irregular address text data in the spontaneous geographic information field into structured address element information, and has universal reference significance for other sequence state modeling scenes applicable to hidden Markov.

Description

technical field [0001] This application belongs to the technical field of geographic intelligence, and designs a method for extracting address elements in address text, in particular to an address text element extraction method based on the coupling of hidden Markov and classification algorithms. Background technique [0002] With the development of Web technology and Volunteered Geographic Information (VGI for short), geospatial information such as OpenStreetMap and user check-in records spontaneously contributed by users through mobile Internet devices has become more and more important in the field of geographic information science. Data Sources. Among them, unstructured geographic text information is one of the important data types. Mapping these unstructured text data into resulting geographic information has become an important research direction in the field of geographic information systems. [0003] The address text data provided in VGI is created spontaneously by ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/216G06F40/279G06F40/289G06F40/30
CPCG06F40/216G06F40/279G06F40/289G06F40/30Y02D10/00
Inventor 李锐刘朝辉
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products