Part-of-speech tagging-based internet news related place name identification method and system

A part-of-speech tagging and recognition method technology, applied in geographic information databases, instruments, calculations, etc., can solve the low-level problem of accurate place name extraction, and achieve the effect of simple implementation, good promotion and application value, and low accuracy

Active Publication Date: 2019-11-01
INSPUR SOFTWARE CO LTD
View PDF4 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The technical task of the present invention is to address the above-mentioned existing problems and provide a method for identifying place-names involved in Internet news based o

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Part-of-speech tagging-based internet news related place name identification method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0036] Such as figure 1 As shown, the Internet news based on part-of-speech tagging of the present invention involves the place name recognition method, utilizes the contextual information of the overall reporting area of ​​the news media column to supplement the news, assists the place name disambiguation program to correctly judge the place name, and utilizes part-of-speech tagging to convert the news content into pure Noun phrase sequence, place name recognition is performed on the noun phrase sequence, the place name recognition results are reduced twice to eliminate inaccurate place names, and finally the weighted summary of the two place name reduction results is carried out to confirm the place name.

[0037] Specifically include the following steps:

[0038] S1. Determine the overall reporting area of ​​the media column: the place name that appears in an absolute proportion among all the place names pointed to by the news media under this column.

[0039] If the propo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a part-of-speech tagging-based internet news related place name identification method and a part-of-speech tagging-based internet news related place name identification system,and belongs to the technical field of natural language processing. According to the part-of-speech tagging-based internet news related place name identification method, the method comprises the stepsof supplementing the context information of news by utilizing the overall reporting region of a news media column; assisting a place name disambiguation program to correctly judge a place name, converting news contents into a pure noun phrase sequence by utilizing part-of-speech tagging, carrying out place name identification on the noun phrase sequence, carrying out place name subtraction on place name identification results twice, eliminating inaccurate place names, and finally carrying out weighted summary on the two place name subtraction results to confirm the place name. The part-of-speech tagging-based internet news related place name identification method is popular and easy to understand. The implementation process of the method is simple. The problem that news related place names are low in extraction accuracy can be effectively solved. The part-of-speech tagging-based internet news related place name identification method has good application and popularization value.

Description

technical field [0001] The invention relates to the technical field of natural language processing, and specifically provides a method and system for identifying place names involved in Internet news based on part-of-speech tagging. Background technique [0002] Place-name recognition is a subcategory of entity recognition in the field of natural language processing. Traditional place name recognition technology involves the establishment of place name hierarchical model based on administrative division dictionary, word segmentation algorithm, place name disambiguation and other technologies. Usually, the place name recognition technology uses the forward maximum matching word segmentation algorithm to maximize the match place name. At the same time, the mainstream place name disambiguation method will introduce context information through hierarchical clustering in the process of place name recognition to solve the problem of address ambiguity. [0003] However, the curre...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06F16/29G06F16/9537
CPCG06F16/29G06F16/9537
Inventor 苏坤雄彭光
Owner INSPUR SOFTWARE CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products