Method for extracting road traffic information from Internet unstructured text

A road traffic, unstructured technology, applied in the field of traffic information, can solve the problems of lack of implicitness and omission of road traffic information elements, no consideration of road traffic information description, failure to correctly identify road positioning description information, etc., to facilitate automatic processing Effect

Inactive Publication Date: 2014-06-25
INST OF GEOGRAPHICAL SCI & NATURAL RESOURCE RES CAS
View PDF3 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the existing road traffic information system can only deal with structured data expressed in two-dimensional form, and it is necessary to use information extraction technology to extract structured road traffic information from unstructured text on the Internet
The existing information extraction technology does not consider the characteristics of road traffic information description, cannot correctly identify the road location description information based on the linear reference method from the Internet unstructured text, and lacks the ability to deal with the hidden road traffic information elements in the Internet unstructured text description. Ability to include and omit phenomena

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for extracting road traffic information from Internet unstructured text
  • Method for extracting road traffic information from Internet unstructured text
  • Method for extracting road traffic information from Internet unstructured text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to enable those skilled in the art to better understand the solutions of the embodiments of the present invention, the embodiments of the present invention will be further described in detail below in conjunction with the drawings and implementations.

[0042] Such as figure 1 Shown is a flowchart of a method for extracting road traffic information from Internet unstructured text in an embodiment of the present invention, including the following steps:

[0043] Step 1. Define the data structure of road traffic information, which is convenient for organizing and managing road traffic information in the form of a two-dimensional table. The data structure is composed of information elements and specific element attributes of information elements, and can be used to express the types of road traffic information There are road condition information, road traffic restriction information, road traffic control information, road traffic accident information, and road en...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method for extracting road traffic information from an Internet unstructured text. The method comprises the steps of defining a data structure of the road traffic information and a description feature word type of the road traffic information, expanding a few manually-established basic extraction modes to obtain an extraction mode bank, generating a feature word type sequence after the input Internet unstructured text is preprocessed, obtaining a matched extraction mode of the input text according to the similarity of the feature word type sequence, utilizing the matching extraction mode for extracting a positioning information element and a type information element of the road traffic information from the Internet unstructured text, utilizing a regular expression and a judgment rule for extracting a time information element from the input text, and obtaining the road traffic information through the combination of the positioning information element, the type information element and the time information element. By the means of the method for extracting the road traffic information from the Internet unstructured text, real-time processing can be carried out on the unstructured text collected from the Internet, the road traffic information can be extracted, and the traffic information collecting means are enriched.

Description

technical field [0001] The invention relates to the field of traffic information, in particular to a method for extracting road traffic information from Internet unstructured text. Background technique [0002] The continuous increase in the number of motor vehicles in cities has made urban road traffic problems increasingly prominent, and the public's demand for real-time road traffic information is also more urgent. Road traffic information mainly includes road traffic flow, road conditions, traffic restrictions, traffic control, traffic events, traffic weather and road surface environment information. Existing real-time road traffic information collection technologies, such as fixed sensor technology (induction coil, video surveillance and microwave detection), floating vehicle technology with GPS and wireless communication equipment, mobile communication terminal signaling analysis technology, etc. It has been widely used, but it cannot collect road traffic information ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/30G06F16/313G06F40/211
Inventor 陆锋仇培元张恒才
Owner INST OF GEOGRAPHICAL SCI & NATURAL RESOURCE RES CAS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products