A Spark cluster parallel computing-based traffic jam point discovery method

A technology of traffic congestion and method discovery, applied in computing, computer components, execution paradigms, etc., can solve problems such as easy to fall into local optimal solutions, low precision, and high time complexity

Active Publication Date: 2019-05-10
GUANGXI TRANSPORTATION SCI & TECH GRP CO LTD +1
View PDF8 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Among them, the grid clustering algorithm is the fastest, but the accuracy is not high; the quality of the mean algorithm depends on the selection of the initial clustering center, and it is easy to fall into a local optimal solution; the advantage of the density clustering algorithm is that the clustering effect is good, but the time is complicated high degree

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Spark cluster parallel computing-based traffic jam point discovery method
  • A Spark cluster parallel computing-based traffic jam point discovery method
  • A Spark cluster parallel computing-based traffic jam point discovery method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0091] The specific embodiments of the present invention will be further described below in conjunction with the accompanying drawings, but the present invention is not limited.

[0092] figure 1 A method for discovering traffic congestion points based on spark cluster parallel computing is shown, including the following steps:

[0093] (1) The preprocessing of massive data, including complementing the error between track points and the redundancy of track points within a period of time in the region, the specific process is as follows:

[0094]Scan all data sources, extract the data into the RDD of the Spark cluster to obtain the data set U; divide the data, distribute the divided data to the nodes to obtain the data set {U1, U2, U3...Un}, each node puts The dataset collection is assigned to the Map function, and an interception function is called in the Map function to intercept the last three data fields of each piece of data to obtain the timestamp T, longitude value Long...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a traffic jam point discovery method based on Spark cluster parallel computing. The invention discloses a trajectory big data mining method, relates to the technical field of trajectory big data mining, solves the technical problem of how to quickly cluster massive trajectory data and find a traffic congestion source of an urban hot region, and comprises the following steps: (1) preprocessing the massive data, including complementing errors between trajectory points and eliminating trajectory point redundancy in a certain region for a period of time; (2) adopting a grid-means clustering algorithm to carry out clustering to obtain a plurality of target data samples; (3) clustering by adopting a neighborhood maximum density grid clustering algorithm to obtain an urbanhot traffic road network model; and (4) calculating the adjacent grid density difference by adopting a neighborhood density difference algorithm to obtain a traffic jam source, and storing the obtained result in a Spark memory. According to the method, mass data can be quickly clustered to obtain an urban traffic network model, and quick discovery of a traffic jam source region is realized.

Description

technical field [0001] The invention relates to the technical field of big data mining, in particular to a method for discovering traffic congestion points based on spark cluster parallel computing. Background technique [0002] With the increasing development of urban traffic and the widespread application of vehicle positioning systems, a large amount of trajectory data can be generated every day, but these massive trajectory data have not been well utilized. At the same time, with the rapid development of urban economy and the needs of smart cities, in the face of increasingly severe traffic congestion problems, how to use the current trajectory big data to find the source of traffic congestion and solve urban traffic congestion problems, and promote the progress of smart cities new research hotspot. [0003] Currently, there are many tools for mining and analyzing big data platforms, among which Hadoop and Spark are the main ones. [0004] Compared with Spark, because ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/448G06K9/62G06Q50/26
Inventor 刘阳何倩李双富李祖文江炳城杨辉黄焕徐红
Owner GUANGXI TRANSPORTATION SCI & TECH GRP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products