Method for geographic semantic mining based on text big data

A technology of semantic mining and text data, applied in electronic digital data processing, special data processing applications, instruments, etc., can solve the problem of less research and achieve the effect of strengthening understanding

Inactive Publication Date: 2018-12-04
PEKING UNIV
View PDF4 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] To sum up, it is of great significance to mine people's cognition of geographical semantics of a region, and the huge text data with geographic location tags provides a data basis for information mining, but there are few studies based on this

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for geographic semantic mining based on text big data
  • Method for geographic semantic mining based on text big data
  • Method for geographic semantic mining based on text big data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The present invention will be further elaborated below through specific embodiments in conjunction with the accompanying drawings.

[0035] In this embodiment, the data used is microblog data with geographical information in Beijing for the whole year of 2016. There are a total of 4,975,416 microblogs, and the Beijing Fifth Ring Road is divided into 234 regions.

[0036] Such as figure 1 As shown, the geographical semantic mining method based on text big data of the present embodiment comprises the following steps:

[0037]1) Data crawling:

[0038] Since there is no public data source, in this embodiment, a crawler is used to crawl data from microblog data in the network, so as to obtain text data with geographic location tags.

[0039] 2) Text data annotation:

[0040] The text data itself does not have a geographic semantic theme, so in order to know the geographic semantic information contained in the text more accurately, it is necessary to assign a geographic s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a geographic semantic mining method based on text big data. The method comprises: using data crawling to obtain text data with geographic position labels, then assigning a geographic semantic topic to the selected part of the text data, and preprocessing the text data to generate a word vector, and then obtaining the geographic semantic topics of all the texts by a machinelearning method, and finally outputting all the geographic semantic topics in the form of vectors. The method speculates geographical semantics of a region according to the text data of the region, and provides theoretical support and assumptions for further urban planning, commercial location selection, trip planning and the like. A result of the method also contributes to strengthening people'sunderstanding of a certain region, and provides assistance for people's travel or play planning.

Description

technical field [0001] The invention relates to data analysis and mining technology, in particular to a geographic semantic mining method based on text big data. Background technique [0002] Geographic semantics is a semantic description of geographic information, which reflects the characteristics of a region and people's cognition of the region. Each geographical location will have its unique semantic information. For example, "Beijing" as a geographical location contains semantic information such as "politics", "tourism", and "culture"; as a geographical location, "Zhongguancun" contains Contains semantic information such as "food", "commercial" and "technology", and the mining of geographical semantics helps to strengthen people's understanding of a certain geographical location. [0003] There are many ways to mine geographic semantics. The direct way is through local life information platforms (such as Dianping.com) or POI (Point Of Interest) information on maps, but...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 孙艳春刘瑜黄罡温九张乐聪
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products