Cross-language knowledge graph-oriented Chinese-Thai entity alignment method

A knowledge graph and cross-language technology, which is applied in cross-language knowledge graphs and Chinese-Thai entity alignment for cross-language knowledge graphs, can solve the problems of low alignment accuracy and low alignment degree of non-famous entities, and reduce the difficulty of construction , improve the quality of alignment, and overcome the effect of low translation accuracy

Inactive Publication Date: 2020-09-29
北京天云信安科技有限公司
View PDF9 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to provide a Chinese-Thai entity alignment method for cross-lingual knowledge graphs in order to solve the problem of low alignment accuracy of non-famous entities in bilingual sentences in the prior art during the construction of cross-language knowledge graphs
This method can achieve bilingual entity alignment more effectively and accurately, and solve the problem of low level of entity alignment in cross-language knowledge graph construction.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-language knowledge graph-oriented Chinese-Thai entity alignment method
  • Cross-language knowledge graph-oriented Chinese-Thai entity alignment method
  • Cross-language knowledge graph-oriented Chinese-Thai entity alignment method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0034] This example takes the Chinese-Thai bilingual dataset as an example, uses Python as the development language, and uses Pycharm software as the development environment.

[0035] refer to image 3 , a Chinese-Thai entity alignment method for cross-language knowledge graphs, including the following steps:

[0036] 1) Acquisition of bilingual datasets: Obtain Chinese-Thai bilingual alignment data from Wikidata, YAGO multilingual knowledge base or major Chinese-Thai bilingual websites. The datasets are all aligned Chinese-Thai bilingual sentences, and entities in Chinese sentences can be found in Find its relatively aligned entity in the Thai sentence, as shown in Table a, for example, the Chinese entity in the 1-A sentence in Chinese can find the aligned Thai entity in the 1-B sentence in Thai;

[0037] Table a Chinese-Thai alignment sentence data example

[0038]

[0039] 2) Construction and training of the machine translation model: construct the Transformer translat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a cross-language knowledge graph-oriented Chinese-Thai entity alignment method, which is characterized by comprising the following steps of: (1) bilingual data set acquisition;(2) machine translation model constructing and training; (3) entity extraction; and (4) entity translation and matching. The method can achieve bilingual entity alignment more effectively and more accurately, and solves the problem that the entity alignment degree of cross-language knowledge graph construction is low at present.

Description

technical field [0001] The invention relates to the field of artificial intelligence, belongs to cross-language knowledge graph technology, and specifically relates to a Chinese-Thai entity alignment method for cross-language knowledge graphs. Background technique [0002] With the continuous development of artificial intelligence, knowledge is particularly important in all fields of artificial intelligence. In recent years, the construction of cross-lingual knowledge graphs has become a hot research field. Although there are more and more sentences about bilingual alignment on the Internet, due to the low degree of alignment of these entities, the accuracy of multilingual entity alignment is often not satisfactory, and the construction of cross-lingual knowledge graphs is therefore limited. [0003] Entities mainly include person names, place names, organization names, etc. Generally speaking, the more commonly used entity alignment method at this stage is to first perform...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/58G06F40/295G06F16/33G06F16/36G06N3/04G06N3/08
CPCG06F40/58G06F16/3344G06F16/367G06F40/295G06N3/08G06N3/048G06N3/045
Inventor 黄永忠吴辉文庄浩宇徐鑫宇张晨昊
Owner 北京天云信安科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products