International organization science and technology term topic sentence extraction method driven by multiple machine translation engines

A translation engine and technology of technical terminology, applied in the field of topic sentence extraction of scientific and technical terms in international organizations, can solve problems such as intractability, difficulty in cross-language knowledge discovery and monitoring, and plummeting accuracy

Pending Publication Date: 2021-11-09
TIANJIN NORMAL UNIVERSITY
View PDF6 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Existing machine translation is mainly based on statistical machine translation. Generally, it can translate a single sentence in a general field with high accuracy. However, there are big problems in the translation of professional knowledge such as terminology. Most of them only translate the superficial meaning of the corresponding words. The overall information of the sentence is missing, resulting in insufficient translation accuracy for professional documents
This is also one of the main difficulties of existing machine translation engines, which are difficult to apply to cross-language knowl

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • International organization science and technology term topic sentence extraction method driven by multiple machine translation engines
  • International organization science and technology term topic sentence extraction method driven by multiple machine translation engines
  • International organization science and technology term topic sentence extraction method driven by multiple machine translation engines

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment

[0072] The present invention provides a method for extracting topic sentences of scientific and technological terms of international organizations driven by multiple machine translation engines. The third is to segment and block to compare text similarity, and judge the fluency and syntactic compliance of a single sentence; the fourth is to extract and output the best translation. It mainly includes the following steps:

[0073] Step A, call a variety of machine translation engines to translate the input content, and output the results by sentence to form the original translation;

[0074] Specifically include the following steps:

[0075] A01, input the target content extracted from various documents, including papers, patents, and scientific reports, into the machine translation system. The input content includes but not limited to text, voice, documents, and images;

[0076] A02, with the sentence as the basic unit, the machine translation service engine will convert the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an international organization science and technology term topic sentence extraction method based on a multi-machine translation engine, and relates to the technical field of information science and knowledge engineering. According to the method, on the basis of a cross-language term knowledge base, by calling various machine translation engines and adopting a natural language chunk processing technology, the technical steps of automatic term sentence recognition, topic statement chunk analysis, translation connection and fusion and the like of the cross-language science and technology text are designed. Therefore, term knowledge point rapid identification, subject sentence automatic detection and high-quality translation fusion generation are carried out on international organization science and technology knowledge, so that subject sentence extraction accuracy and fluency are guaranteed, finally science and technology information issued by an international organization is dynamically monitored, and the knowledge processing requirement of a user for cross-language professional science and technology knowledge is met.

Description

technical field [0001] The invention relates to the technical fields of information science and knowledge engineering, in particular to a method for extracting topic sentences of scientific and technological terms of international organizations driven by multiple machine translation engines. Background technique [0002] Machine translation, that is, translating text in one language into another by computer, has become one of the important methods to solve the multilingual barrier at present. As early as 2013, Google Translate provided translation services up to one billion times a day, which is equivalent to the annual human translation volume in the world, and the number of words processed is equivalent to one million books. [0003] International organizations have published a large number of professional knowledge documents, which are highly authoritative and rich in knowledge content. However, due to language barriers, it is difficult for domestic users to quickly under...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/58G06F40/211G06F40/284G06F40/30
CPCG06F40/58G06F40/30G06F40/284G06F40/211
Inventor 宋培彦鞠佳辰冯超慧
Owner TIANJIN NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products