Method for extracting entity address message in text context

A technology of address information and physical address, which is applied in the direction of instruments, calculations, electrical digital data processing, etc., and can solve problems such as inability to extract

Inactive Publication Date: 2009-09-02
PEKING UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this method can only deal with fixed-format addresses in the text, and cannot extract address description information in formats other than templates.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for extracting entity address message in text context
  • Method for extracting entity address message in text context
  • Method for extracting entity address message in text context

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] Below we use a specific example to illustrate how to implement the method described in the present invention to calculate the spatial correlation between addresses and entities in web pages. Assume that the sentence in black below is about the content of several webpages of the entity "Punk Beauty Salon", and the part in italics is the marked address part.

[0037] Fenfangxueyan (Chongwenmen), Room 710, Building A, New World Taihua Apartment, No. 5, Chongwenmenwai Street, Chongwen District. Ruibaona Skin Care, Room 414, Jinfenghe Property Complex Building, No. 8, Xinjiekouwai Street, Xicheng District, City, Dreams come true, Beauty and Body, 8D. Punk Beauty Salon, No. 3, Science and Technology Exhibition Center, No. 48, North Third Ring West Road, Haidian District, North Floor, Building 8, Huaqing Jiayuan, Wudaokou, Haidian District

[0038] Exploring the creepy city of rats in South Asia·Beijing man openly proposed to Yang Lijuan and willing to support his mother·The l...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides a method for extracting entity address information in a text context, which collects a collection of web pages containing entity names, calculates the initial correlation degree and corrects the initial correlation degree to obtain the final correlation degree, and finally ranks according to the correlation degree , and return the top results to the user; the present invention can effectively find out the address information related to the physical space specified by the user from the addresses included in the web page, and assist the user to locate.

Description

technical field [0001] The invention relates to the field of text information extraction, in particular to a method for extracting entity address information in a text context. Background technique [0002] It is an important task in the field of text information extraction to find descriptive information from text and connect it to a given entity to form a complete description of the entity. Because entities such as institutions, events, and people generally have their address description information, this information plays a very important role in the positioning of entities. How to effectively extract the address description information related to a given physical space from the text context is a necessary and practical work. However, in the context of textual context to extract address description information related to entities, there is still little related research work in China, and there is a lack of effective extraction methods. A common practice is to extract ad...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 罗英伟汪小林周晓鲁许卓群
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products