Large data real-time storage method based on translation document

A big data and data technology, applied in the field of big data real-time storage based on translation files, can solve the problems of low storage speed and low efficiency, achieve fast storage speed, solve data storage, and improve the effect of data call speed

Inactive Publication Date: 2016-01-13
成都优译信息技术股份有限公司
View PDF4 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, directly storing data in the memory has a low storage speed; sequentially searching the hard disk data until the same content is found and called again, its efficiency is very low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0019] A method for real-time storage of big data based on translation files, comprising:

[0020] A. Obtain aligned corpus data;

[0021] B. Establishing a database index for aligning corpus data;

[0022] C. Distributed storage of the aligned corpus data according to the database index.

[0023] For example: the system receives a set of aligned corpus data, and submits the data to the back-end index server through the front-end page. Write to a different memory machine. Using this method, data integrity can be guaranteed, the average performance of sequential data writing can reach 7w / s, the average performance of random reading can reach 1.6w / s, and the concurrent reading and writing can reach 5K / s when reading and writing at a ratio of 1:10. up to 5W / s.

[0024] Example

[0025] On the basis of the above-mentioned big data real-time storage method based on translation files, this embodiment is optimized, that is, in step C, when the data is stored, the aligned corpus ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention discloses a large data real-time storage method based on a translation document. The method comprises: acquiring aligned corpus data; establishing a database index for the aligned corpus data; and according to the database index, carrying out distributed storage on the aligned corpus data. The method disclosed by the present invention is not only fast in storage speed but also high in calling efficiency.

Description

technical field [0001] The invention relates to the field of storage methods, in particular to a method for real-time storage of big data based on translation files. Background technique [0002] With the continuous advancement of science and technology, international exchanges are becoming more frequent, the world economy is becoming more and more open, globalization is getting deeper and deeper, and there are more and more translations between documents and materials in various languages, especially English and Chinese between. Translating documents involves all aspects of life: trade, law, electronics, communications, computers, machinery, chemicals, petroleum, medicine, food and other fields. [0003] Translation belongs to the service industry, and the service industry should always be customer-oriented. Today, when the amount of translation is increasing and the number of words in the document is increasing, how to improve the translation speed and meet the needs of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/901
Inventor 王榆升张马成王兴强
Owner 成都优译信息技术股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products