Automatic foreign name identification and control method based on context semantics

A control method and foreign technology, applied in the fields of instrument, calculation, electrical digital data processing, etc., can solve the problems that the recognition effect needs to be improved, the special research on foreign name recognition is few, etc., so as to reduce the recognition error and improve the recognition effect. Effect

Inactive Publication Date: 2013-03-06
EAST CHINA NORMAL UNIVERSITY
View PDF2 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, there are many studies on Chinese names and good results have been achieved, while there are rel

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic foreign name identification and control method based on context semantics
  • Automatic foreign name identification and control method based on context semantics
  • Automatic foreign name identification and control method based on context semantics

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0022] By reading the description of foreigner name recognition with reference to the following drawings, the other features, purposes and advantages of the present invention will become more apparent:

[0023] figure 1 Shows a flow chart of the method for automatic recognition of foreign names based on contextual semantics according to the first embodiment of the present invention. Specifically, this figure shows 4 steps. The first step is step S201, which analyzes the text to be recognized and Get the set of candidate foreign name strings. Next is step S202, using the foreign name rule set to correct and screen the candidate foreign name string set to obtain the first intermediate foreign name string set. Step S202 is followed by step S203, using probability statistics and probability models to further filter the first intermediate foreigner name string set to obtain a recognized foreigner name set; and d. Confirm that it has not been confirmed according to the recognized forei...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an automatic foreign name identification and control method based on context semantics in a natural language processing system by researching foreign name characteristics and combining a statistic probability model. The method is characterized by comprising the following steps of: a. analyzing a text to be identified and acquiring a candidate foreign name string set; b. correcting and screening the candidate foreign name string set by utilizing a foreign name rule set to acquire a first middle foreign name string set; c. further screening the first middle foreign name string set by utilizing probability statistics and the probability model to further screen the acquired identified foreign name set; and d. determining the unidentified foreign names according to the identified foreign name set. According to the system, the context characteristics of the names and the word characteristics of the foreign names are fully utilized, the identification error caused by word segmentation is greatly reduced, the condition that the other named entities are identified into names is well avoided, and the identification effect is improved.

Description

technical field [0001] The invention relates to the field of natural language processing, in particular to foreign name recognition technology in named entity recognition. Background technique [0002] Named entity recognition is a hot issue and basic work in natural language processing. It is of great significance to natural language processing and has been applied to many fields of natural language processing, such as information retrieval, information extraction and machine translation. Named entities generally include person names, place names, organization names, dates, times, etc. In various named entity recognition, the recognition of personal names has always been in an important position, and its recognition effect has an important impact on Chinese word segmentation. Chinese names include Chinese names and foreign names. At present, there are many studies on Chinese names and good results have been achieved, while there are relatively few special studies on forei...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27
Inventor 王祖兴吕钊顾君忠
Owner EAST CHINA NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products