Internet-oriented place name extraction and standardization method

A technology of the Internet and place names, applied in structured data retrieval, geographic information databases, instruments, etc., can solve problems such as insufficient standards, homonyms, and description errors, and achieve the effect of improving accuracy

Inactive Publication Date: 2016-01-06
CHINASO INFORMATION TECH
View PDF4 Cites 37 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The existing geographic location information mining algorithms mainly use keyword matching methods. Due to the problems of description errors, inaccuracies, homonyms, and insufficient standards in the place name and address information in the text under the Internet environment, the location based on keyword matching The accuracy of information mining algorithms is low, which is not enough to meet the requirements of various industries for geographic information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Internet-oriented place name extraction and standardization method
  • Internet-oriented place name extraction and standardization method
  • Internet-oriented place name extraction and standardization method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The present invention aims at the existence mode and structural characteristics of place names and addresses in Internet webpages, uses the recognition rules and dynamic relations of place names and addresses, and conducts identification on the basis of national administrative division information and the national basic place names and addresses database, and studies the expression model and extraction of multi-level place names and addresses Method, through the upper and lower semantic relationship of place names and addresses in the text, and referring to the standard model of place names and addresses, the automatic identification, extraction and standardization of Chinese place names and addresses in the text information of Internet web pages are realized, thereby improving the accuracy of Internet place name and address information extraction and standardization, It provides a technical basis for the address matching process based on place name address information an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an internet-oriented place name extraction and standardization method. According to the method, aimed at existing ways and structural features of place names and addresses in an internet page, identification is performed based on state administrative division information and a nationwide basic place name and address library by utilizing identification rules and dynamic relationships of the place names and the addresses, a multi-stage place name and address expression model and an extraction method are researched, automatic identification, extraction and standardization of Chinese place names and addresses in text information of the internet page are realized with reference to a place name and address standard model through superior-subordinate semantic relationships of the place names and the addresses in the text, and a technical basis is provided for spatial positioning of related geographic information of geographic entities, events and the like.

Description

technical field [0001] The invention relates to a method for extracting and standardizing place names and addresses, in particular to a method for extracting and standardizing place names and addresses based on the characteristics of Internet information text information facing the Internet. The spatial positioning provides the technical basis. Background technique [0002] With the rapid development of Internet technology, the Internet has become the largest gathering place for geographic information. Internet geographic information has entered the era of big data. In the next 10 years, at least 80% of human-computer interaction text data will involve geographic information. The Internet will become a large-scale constantly updated Geographical information databases, how to mine these geographic information and use them in geographic information services is the main problem. [0003] Place name and address data is the most commonly used social public information resource, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/284G06F16/29G06F16/9537
Inventor 杨治安王静蔡地胡威索玉霞杜立佳李秀娟
Owner CHINASO INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products