Entity relationship extracting method of Zang language

An entity relationship, Tibetan language technology, applied in special data processing applications, instruments, electrical and digital data processing, etc., can solve problems such as structural representation without knowledge, inability to achieve in-depth information mining, and lack of comprehensive and accurate relevant information. , to achieve the effect of improving the accuracy

Active Publication Date: 2015-07-29
MINZU UNIVERSITY OF CHINA
View PDF4 Cites 58 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In Tibetan, usually called (Dalai Lama) for (The Dalai Lama), while current search engines do not show the relationship between the two
Moreover, all search results are mainly displayed in text containing keywords, without knowledge structure representation
Therefore, we cannot get comprehensive and accurate relevant information, let alone realize the in-depth mining of information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Entity relationship extracting method of Zang language
  • Entity relationship extracting method of Zang language
  • Entity relationship extracting method of Zang language

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] The technical solutions of the present invention will be described in further detail below with reference to the accompanying drawings and embodiments.

[0022] The invention establishes a Tibetan entity relationship classification model through the lexical semantic features of the Tibetan entity relationship and the sentence feature vector representation, so as to realize the extraction of the Tibetan entity relationship.

[0023] figure 1 It is a flow chart of the Tibetan entity relationship extraction method of the present invention, as shown in the figure, the method includes the following steps:

[0024] Step 101, extract training corpus.

[0025] Specifically, the training corpus is extracted from the Tibetan-Chinese text corpus information.

[0026] A text corpus of 5,000 sentences annotated with semantic roles in Tibetan is derived from the Minority Language Sub-Center of the National Language Resources Monitoring and Research Center. The corpus is processed t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an entity relationship extracting method of the Zang language. The method comprises the following steps: extracting training linguistic data from the Zang-Chinese text linguistic data information; constructing a Zang word vector model; acquiring an entity relationship characteristic vector from the Zang word vector model; using the entity relationship characteristic vector as an input to construct an entity relationship classification model based on a neural network; and applying multiple layers of characteristic extractions to the entity relationship characteristic vector, thereby finally acquiring a Zang language entity relationship classification. The extraction of the Zang language entity relationship is achieved by constructing the Zang word vector model, researching and solving lexical semantic characteristics and sentence characteristic vector expression methods of the Zang language entity relationship, and further constructing the Zang language entity relationship classification model, accordingly increasing the accuracy in the Zang language entity relationship classification, and providing technical supports and services to the researches in the fields of the Zang language knowledge mapping, question-answering system, information extraction, information search, and the like.

Description

technical field [0001] The invention relates to a method for extracting a Tibetan entity relationship, in particular to a method for extracting a Tibetan entity relationship based on a word vector. Background technique [0002] With the rapid popularization of the Internet, especially the rapid increase of Internet users in developing countries, the number of non-English text resources on the Internet has increased rapidly, and its growth rate has far exceeded the speed of 10 years ago. Published in multiple languages. According to a survey by the Minority Language Sub-Center of the National Language Resources Monitoring and Research Center of Minzu University of China: by the end of December 2011, the total number of websites in the mainland’s minority languages ​​was about 1,250, including 840 websites in Uyghur and 146 in Tibetan. and 136 Mongolian websites. "Compared with the growth rate of Internet users nationwide, the growth rate of Internet users of ethnic minoriti...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
Inventor 孙媛
Owner MINZU UNIVERSITY OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products