A method for extracting semantically similar and grammatically regular sentences from academic literature

A semantically similar, sentence technology, applied in the computer field, can solve problems such as difficulty in expressing ideas professionally, and inability to determine the grammatical norms of sentences, saving time and energy

Active Publication Date: 2018-07-10
南京中智腾飞航空科技研究院有限公司
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method solves the problem that it is difficult for non-native English-speaking researchers and students to express their ideas professionally or determine the grammatical norms of sentences when writing scientific and technological documents

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method for extracting semantically similar and grammatically regular sentences from academic literature
  • A method for extracting semantically similar and grammatically regular sentences from academic literature
  • A method for extracting semantically similar and grammatically regular sentences from academic literature

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0068] The ACSnano journal published by the American ACS Press is used as the database source. The database contains some electronic papers published by the ACSnano journal and is used with the authorization of the ACS Press. Extract the first author's country and keywords from all papers in the database. For example, the author of the document "Rational Design of Hybrid Graphene Films for High-Performance Transparent Electrodes" is from RICE University in the United States, and the keywords are: graphene, transparent electrode, metal grid, flexible. Since the author is from the United States, the English proficiency weight of the author of this document is set to 1, Q c =1. Extract the abstracts and texts of all documents in the database, divide them into sentences and extract the main components of sentences. The main components of a sentence refer to extracting the subject, predicate, object, attributive and adverbial of the sentence as the main grammatical components of ...

Embodiment 2

[0072] The Optical Engineering journals included in the Optical Society of America OSA are used as the database, which includes some electronic papers published in the Optical Engineering journals. Extract the first author's country and keywords from all papers in the database. For example, the author of the document "Two-color infrared counter-countermeasure based on the signal ratio between two detection bands for a crossed-arraytracker" is from Pukyong National University in South Korea. The keywords are: infrared seeker; two-color counter-countermeasure; crossed-array tracker. The author's English proficiency weight is set to Q c =0.5. And the author of the document "Countermeasure effectiveness against a man-portable air-defense system containing a two-color spinscan infrared seeker" is from Cranfield University in the United States. The keywords are: man-portable air-defense; simulation; infrared; electro-optics; countermeasures . The English proficiency weight of th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for extracting sentences with similar meanings and standard grammar from academic documents.The method comprises following steps: firstly, defining databases to be visited as academic documents to be issued; making similarity calculations based on keywords in the fields of sentences input by a user and countries where authors of documents are and main parts of input sentences and sentences of the databases and extracting sentences with higher similarity based on weights of defined elements for user's reference in order to obtain sentences with the proper meanings and document sources of the obtained sentences.The method for extracting sentences with similar meanings and standard grammar from academic documents can rapidly obtain reference sentences for standard expressions when non-native authors for English thesis are writing scientific documents.

Description

technical field [0001] The invention belongs to the technical field of computers, and in particular relates to a method for matching English sentences, especially a method for matching sentences with irregular grammar. Background technique [0002] Sentence similarity has important application value in bilingual translation, automatic question answering, plagiarism checking and other fields. There are many calculation methods for sentence similarity, and different application fields have different emphases. Some focus on the matching degree of text surface content, such as plagiarism checking in papers, etc.; some focus on the similarity of the internal semantics contained in sentences, such as bilingual Translation and automatic question answering, etc. Take the plagiarism check of papers as an example for a brief explanation: the databases included in the plagiarism check of papers are mainly published documents, patents, works, web pages, etc., and must include all liter...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/27
Inventor 孙维国李墨
Owner 南京中智腾飞航空科技研究院有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products