Supercharge Your Innovation With Domain-Expert AI Agents!

Tibetan Entity Relationship Extraction Method

An entity relationship, Tibetan language technology, applied in the fields of instrumentation, computing, electrical and digital data processing, etc., can solve the problems of structural representation without knowledge, inability to realize in-depth information mining, and inability to obtain comprehensive and accurate relevant information, etc. The effect of accuracy

Active Publication Date: 2018-08-07
MINZU UNIVERSITY OF CHINA
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In Tibetan, usually called (Dalai Lama) for (The Dalai Lama), while current search engines do not show the relationship between the two
Moreover, all search results are mainly displayed in text containing keywords, without knowledge structure representation
Therefore, we cannot get comprehensive and accurate relevant information, let alone realize the in-depth mining of information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Tibetan Entity Relationship Extraction Method
  • Tibetan Entity Relationship Extraction Method
  • Tibetan Entity Relationship Extraction Method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] The technical solutions of the present invention will be described in further detail below with reference to the accompanying drawings and embodiments.

[0022] The invention establishes a Tibetan entity relationship classification model through the lexical semantic features of the Tibetan entity relationship and the sentence feature vector representation, so as to realize the extraction of the Tibetan entity relationship.

[0023] figure 1 It is a flow chart of the Tibetan entity relationship extraction method of the present invention, as shown in the figure, the method includes the following steps:

[0024] Step 101, extract training corpus.

[0025] Specifically, the training corpus is extracted from the Tibetan-Chinese text corpus information.

[0026] A text corpus of 5,000 sentences annotated with semantic roles in Tibetan is derived from the Minority Language Sub-Center of the National Language Resources Monitoring and Research Center. The corpus is processed t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method for extracting a Tibetan entity relationship, which comprises the following steps: extracting training corpus from Tibetan-Chinese text corpus information; constructing a Tibetan word vector model; obtaining entity relationship feature vectors through the Tibetan word vector model; The relationship feature vector is used as input to build a neural network-based entity relationship classification model, and multi-layer feature extraction is performed on the entity relationship feature vector, and finally the Tibetan entity relationship classification is obtained. The present invention studies and solves the lexical semantic features and sentence feature vector representation methods of Tibetan entity relations by establishing a Tibetan word vector model, and then realizes the extraction of Tibetan entity relations by establishing a Tibetan entity relationship classification model, improving the Tibetan language The accuracy of entity relationship classification provides technical support and services for research in areas such as Tibetan knowledge graphs, question answering systems, information extraction, and information retrieval.

Description

technical field [0001] The invention relates to a method for extracting a Tibetan entity relationship, in particular to a method for extracting a Tibetan entity relationship based on a word vector. Background technique [0002] With the rapid popularization of the Internet, especially the rapid increase of Internet users in developing countries, the number of non-English text resources on the Internet has increased rapidly, and its growth rate has far exceeded the speed of 10 years ago. Published in multiple languages. According to a survey by the Minority Language Sub-Center of the National Language Resources Monitoring and Research Center of Minzu University of China: by the end of December 2011, the total number of websites in the mainland’s minority languages ​​was about 1,250, including 840 websites in Uyghur and 146 in Tibetan. and 136 Mongolian websites. "Compared with the growth rate of Internet users nationwide, the growth rate of Internet users of ethnic minoriti...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30G06F17/27
Inventor 孙媛
Owner MINZU UNIVERSITY OF CHINA
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More