Semantics-based sci-tech information processing method and system

A processing method and technology, applied in the field of data processing, can solve problems such as low efficiency and accuracy, and achieve the effect of solving information overload, eliminating obstacles to content understanding, and improving acquisition accuracy.

Inactive Publication Date: 2017-04-19
THE 28TH RES INST OF CHINA ELECTRONICS TECH GROUP CORP
View PDF8 Cites 24 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Therefore, the technical problem to be solved by the embodiments of the present invention is that the collection, processing and analysis of scientific and technological information in the prior art is mainly manual, and the efficiency and accuracy are not high.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semantics-based sci-tech information processing method and system
  • Semantics-based sci-tech information processing method and system
  • Semantics-based sci-tech information processing method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0064] This embodiment provides a semantic-based method for processing scientific and technological information, which is especially suitable for intelligent retrieval and analysis of scientific and technological information, such as figure 1 As shown, the semantic-based scientific and technological information processing method includes the following steps:

[0065] S1. Acquiring website data. The website data may include various contents, which mainly includes webpage content of the website.

[0066] S2. According to the Chinese-English bilingual parallel corpus, the above-mentioned website data is translated into Chinese / English through a decoding algorithm. The Chinese / English translation can be from Chinese to English, or from English to Chinese. The Chinese-English bilingual parallel corpus is a large-scale corpus containing a large number of Chinese-English sentence pairs, which is the basis for building a translation system. Through the steps of corpus cleaning, Chine...

Embodiment 2

[0094] Corresponding to Embodiment 1, this embodiment provides a semantic-based scientific and technological information processing system, such as figure 2 shown, including:

[0095] Obtaining module 1, used to obtain website data;

[0096] The translation module 2 is used to translate the above website data into Chinese / English through a decoding algorithm based on the Chinese-English bilingual parallel corpus;

[0097] The summary module 3 is used to generate a summary according to the translated website data;

[0098] A classification module 4, configured to classify according to the above summary and generate a classification label;

[0099] The storage module 5 is used to store the above-mentioned translated website data, abstracts and classification labels into the full-text search database, refer to as image 3 In the system structure diagram shown, the translation module, summary module, and classification module are respectively connected to the full-text search ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention discloses a semantics-based sci-tech information processing method and system, and belongs to the technical field of data processing. The method comprises the following steps: acquiring network data; according to a Chinese-English bilingual parallel corpus, translating the network data into Chinese / English by means of a decoding algorithm; generating an abstract according to the translated network data; performing classification according to the abstract, and generating a class tag; and storing the translated network data, the abstract and the class tag into a full-text retrieving database. According to the method and system disclosed by the present invention, by using technologies such as automatic search of sci-tech information, automatic abstracting of the sci-tech information and automatic classification of texts, sci-tech information related to scientific development, technical innovation and recent news can be automatically acquired by means of a public information channel from the Internet, so that acquisition accuracy is improved, the cross-language content understanding barrier is eliminated, the problem of information overload is solved, and the efficiency of reading and understanding information of the user is increased.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a semantic-based scientific and technological information processing method and system. Background technique [0002] Scientific and technological information refers to useful knowledge about scientific development, technological innovation, and the latest developments obtained through public information channels. The collection of scientific and technological information has always been highly valued by countries all over the world, because scientific and technological information work shoulders important responsibilities in all aspects of scientific research and production at home and abroad. The basis of science and technology information research is the collection and analysis of information resources. With the development of computer information technology, computer information retrieval system provides a favorable platform for scientific and technological informati...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/345G06F16/35G06F16/951G06F40/289
Inventor 袁林韩国辉贲兴龙陈晓琳梁增玉马旭冯燕来王睿苏雪阳黄明魁
Owner THE 28TH RES INST OF CHINA ELECTRONICS TECH GROUP CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products