Knowledge graph construction-oriented text time extraction and standardization method

A technology of time extraction and knowledge graph, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve the problem of poor normalization of time phrases, difficulty in manually formulating accurate and comprehensive rule systems, and ineffective relative time conversion. Complete time completion and other issues

Active Publication Date: 2018-07-20
TONGJI UNIV
View PDF4 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Among them, there are many researches on time information extraction, generally using rule-based methods or machine learning-based methods. The rule-based method is simple, but it is difficult to manually formulate an accurate and comprehensive rule system.
Model training based on machine learning must require a certain scale of labeled training data, and the cost of manual labeling is high
In addition, due to the flexible and diverse forms of time information, there are many difficulties in the mapping of Chinese time information, such as relative time conversion problems, incomplete time completion problems, etc., resulting in poor normalization of time phrases.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Knowledge graph construction-oriented text time extraction and standardization method
  • Knowledge graph construction-oriented text time extraction and standardization method
  • Knowledge graph construction-oriented text time extraction and standardization method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0040] The invention discloses a text time extraction and standardization method for knowledge graph construction, which realizes the extraction, mapping and semantic modeling of time information, thereby providing a general time information expression mode for general knowledge graph construction. Knowledge graphs generally store processed and normalized knowledge tuples, and a large proportion of knowledge is event-based and has a certain real-time nature. However, the existing knowledge graphs lack consideration of time information. The extraction and normalization of text time and integration into knowledge tuples are of great significance for building a complete knowledge graph of time information.

[0041] The present invention deeply analyzes the law of expression of time information in natural language, and constructs a set of time semantic understanding model oriented to natural language text, thereby providing a general time expression mode.

[0042] The purpose of t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a knowledge graph construction-oriented text time extraction and standardization method. The method comprises the following steps of constructing a time information knowledgebase; according to a time expression template in the time information knowledge base, extracting time phrases, prepositions and time nouns of locality from a to-be-identified text, and automatically mapping the time phrases into absolute time expressions in sequence; judging semantic types of the time phrases by utilizing a time semantic modeling algorithm; and outputting a time semantic model quintuple TSM=(AT, RTP, PP, PD, ST). Compared with the prior art, the method has the advantages that data, about time information, of knowledge tuples in a knowledge graph is supplemented; the problem oftime information mapping is solved; a high-quality time information base is formed; and the like.

Description

technical field [0001] The present invention relates to an information extraction method in the field of big data and natural language processing, in particular to a text time extraction and standardization method oriented to knowledge map construction. Background technique [0002] Knowledge tuples, the basic elements of a knowledge map, can be divided into commonsense knowledge and event-based knowledge. Event-based knowledge has a strong real-time nature, and only when time information is integrated into knowledge tuples can it fully and accurately express all of them. the knowledge contained. For event-based knowledge tuples extracted from news texts, timeliness requirements are high. Accurately grasping the time context of knowledge existence and obtaining time semantics is of great significance for improving knowledge tuples and building knowledge graphs. [0003] Time exists objectively, but it needs to be described with the help of natural language. Temporal inform...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/367G06F40/289G06F40/30
Inventor 向阳贾圣宾吕东东陈晓军
Owner TONGJI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products