Method and system for translating integral patent documents

A patented document and a full-text technology, applied in the field of machine translation, can solve the problems of inconsistency, time-consuming and labor-intensive complex noun phrases

Active Publication Date: 2014-01-01
中国专利信息中心
View PDF3 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Therefore, in the translation process of the prior art, complex noun phrases cannot maintain consistency, and at the same time, the same phrase is analyzed and translated many times, which is time-consuming and laborious

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for translating integral patent documents
  • Method and system for translating integral patent documents
  • Method and system for translating integral patent documents

Examples

Experimental program
Comparison scheme
Effect test

example 1

[0025] [Example 1] A system for controlling the flight of an aircraft, characterized in that,...

[0026] "A kind of" and "characterized by" can be used as initial boundary information, and the template: {kind of}+{phrase A}+{characterized by} can be used to extract the phrase "a system for controlling the flight of an aircraft".

[0027] The phrase extraction method can also be a rule extraction method, that is, use the part-of-speech tagging feature POS (part-of-speech) plus the combination method of prefix and suffix for phrase extraction. The example of the written rule is as follows: (-1)CAT(V)+(0) CAT[N]+(1)Suffix→NP[0,1].

[0028] 【Example 2】...Provide part-of-speech tagging method

[0029] Among them, the suffix is ​​"method", and the part-of-speech tagging feature is: provide / vpart-of-speech / n / label / nvmethod / n.

[0030] Combine the suffix "method" with "part-of-speech / n / tagging / nv" to get the phrase "part-of-speech tagging method".

[0031] The phrase...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a system for translating integral patent documents. The method includes acquiring phrases by a template-based or rule process or a weight process; modifying the phrases by a phrase frequency process or a modified phrase frequency process or a memory reference process or the like to finally obtain recognized noun phrases RNP; marking RNP information for the recognized noun phrases in a full text, translating the recognized noun phrases RNP and storing relevant information in a phrase memory; translating the full text sentence by sentence and directly fetching translated texts from the phrase memory without spreading phrases marked with the RNP information; sequentially outputting the translated texts according to title information of the original text after the full text is completely translated. The method and the system have the advantages that commonly used complicated noun phrases in the patent documents can be acquired, so that analysis time for sentences containing the commonly used complicated noun phrases can be shortened, the translation speed can be increased, and the translation consistency of the commonly used complicated noun phrases can be guaranteed.

Description

technical field [0001] The invention relates to machine translation technology, in particular to a machine translation method and a translation system for the entire patent document. Background technique [0002] Machine translation is the use of computers to translate text from one natural language to another. Its research methods are divided into rules and statistics. Due to the long development cycle of the rule system and the large demand for funds and manpower, the progress of the rule system is slow. Relatively speaking, statistical methods have advantages such as short development cycle and easy processing of large-scale corpus. Among statistical machine translation methods, phrase-based translation methods are well developed. But from the current point of view, for the translation of professional fields, such as in the translation of patent documents, longer phrases are often divided into several phrases for translation. For example, "The ultra-low temperature he...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/28G06F17/27
Inventor 任智军李进蒋宏飞杨婧
Owner 中国专利信息中心
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products