Constraint model-based short-text temporal expression recognition method for english social media

A social media and constraint model technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as poor effect, short text length, large noise, etc., to achieve accurate recognition, improved accuracy, and improved accuracy Effect

Active Publication Date: 2018-12-25
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF7 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But first of all, the traditional named entity method is aimed at many categories, including person names, organization names, place names, etc. In order to meet the needs of identifying various entities, its feature extraction has a certain degree of universality, and there is no feature for time expressions, resulting in It does not recognize temporal expressions well
Secondly, most of the traditional named entity recognition methods are aimed at formal texts, but the actual network texts are randomly generated by users, and the text length is generally short, and most of them are informal texts, which are noisy and have irregular structure patterns, which lead to traditional naming. Entity methods are less effective at identifying entities in web text

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Constraint model-based short-text temporal expression recognition method for english social media
  • Constraint model-based short-text temporal expression recognition method for english social media
  • Constraint model-based short-text temporal expression recognition method for english social media

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] Embodiments of the present invention will be further described below in conjunction with the accompanying drawings.

[0042] Recognizing time expressions can extract time information in text content sent by people, and has very important applications in event detection, automatic question answering, information extraction, etc. The traditional named entity recognition method is the main method to solve the temporal entity recognition in social network texts. Because named entity recognition needs to recognize many entity categories, it is not targeted in the feature formulation, and there is interference between different types of entities, resulting in The recognition accuracy of each type of entity is not high, so the extracted temporal entity is not accurate. On the other hand, in social networks, since the text information usually exists in the form of short text, and the format is not standardized, there is a lot of noise in the text, and the traditional feature fo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method for recognizing short text time expression of English social media based on a constraint model, which belongs to the field of text time information extraction. Aiming at the problem that the features of the existing identification method are not pertinent to the time expression and the number is small, the invention constructs the feature set from three aspects of the word structure, the grammatical structure and the combined features, so that the precision of the machine learning model for the time expression is greatly improved; as that condition random fieldshare the transfer characteristics in the solution proces, resulting in an inaccurate identification of the boundaries of the time expression, in the process of solving, the feature set of the extracted current position is used to modify the transfer feature of the current position, and the method of constrained random field is proposed to solve the problem, so as to improve the accuracy of the identified time expression and make the boundary recognition of the time expression more accurate.

Description

technical field [0001] The invention belongs to the field of text time information extraction, in particular to a method for recognizing short text time expressions in English social media based on a constraint model. Background technique [0002] With the rapid development of the Internet and smart mobile terminals, social media, a new product, is also developing rapidly at the same time. More and more people are used to publishing various text information on social media, and the time information in the text is very important. A part of time information in text is also often referred to as time expression. The time expression can express the relationship between the text content and time sent by people, so that the timeliness of the text content sent by people can be judged, and it plays a very important role in the application of event detection, automatic question answering, information extraction, etc. How to extract time expressions from short texts in social media ha...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/21
CPCG06F40/117G06F40/284
Inventor 费高雷亓克娜胡光岷
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products