Method for extracting sentences with similar meanings and standard grammar from academic documents

A semantically similar, literature technology, applied in the computer field, can solve problems such as difficulty in expressing ideas professionally, and inability to determine the grammatical norms of sentences, saving time and energy

Active Publication Date: 2016-06-15
南京中智腾飞航空科技研究院有限公司
View PDF3 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method solves the problem that it is difficult for non-native English-speaking researchers and students to express their ideas professionally or determine the grammatical norms of sentences when writing scientific and technological documents

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for extracting sentences with similar meanings and standard grammar from academic documents
  • Method for extracting sentences with similar meanings and standard grammar from academic documents
  • Method for extracting sentences with similar meanings and standard grammar from academic documents

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0068] The ACSnano journal published by the American ACS Press is used as the database source. The database contains some electronic papers published by the ACSnano journal and is used with the authorization of the ACS Press. Extract the first author's country and keywords from all papers in the database. For example, the author of the document "RationalDesignofHybridGrapheneFilmsforHigh-PerformanceTransparentElectrodes" is from RICE University in the United States, and the keywords are: graphene, transparentelectrode, metalgrid, flexible. Since the author is from the United States, the English proficiency weight of the author of this document is set to 1, Q c =1. Extract the abstracts and texts of all documents in the database, divide them into sentences and extract the main components of sentences. The main components of a sentence refer to extracting the subject, predicate, object, attributive and adverbial of the sentence as the main grammatical components of the sentenc...

Embodiment 2

[0072] The Optical Engineering journal included in the Optical Society of America OSA is used as the database, which includes some electronic papers published in the Optical Engineering journal. Extract the first author's country and keywords from all papers in the database. For example, the author of the document "Two-colorinfraredcounter-countermeasurebasedonthesignalratiobetweentwodetectionbandsforacrossed-arraytracker" is from PukyongNationalUniversity of South Korea, and the keywords are: infraredseeker; two-colorcounter-countermeasure; crossed-arraytracker. The author's English proficiency weight is set to Q c =0.5. And the author of the document "Countermeasureeffectivenessagainstaman-portableair-defensesystemcontainingatwo-colorspinscaninfraredseeker" is from Cranfield University in the United States, and the keywords are:man-portableair-defense;simulation;infrared;electro-optics;countermeasures. The English proficiency weight of the author of this document is set to...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for extracting sentences with similar meanings and standard grammar from academic documents.The method comprises following steps: firstly, defining databases to be visited as academic documents to be issued; making similarity calculations based on keywords in the fields of sentences input by a user and countries where authors of documents are and main parts of input sentences and sentences of the databases and extracting sentences with higher similarity based on weights of defined elements for user's reference in order to obtain sentences with the proper meanings and document sources of the obtained sentences.The method for extracting sentences with similar meanings and standard grammar from academic documents can rapidly obtain reference sentences for standard expressions when non-native authors for English thesis are writing scientific documents.

Description

technical field [0001] The invention belongs to the technical field of computers, and in particular relates to a method for matching English sentences, especially a method for matching sentences with irregular grammar. Background technique [0002] Sentence similarity has important application value in bilingual translation, automatic question answering, plagiarism checking and other fields. There are many calculation methods for sentence similarity, and different application fields have different emphases. Some focus on the matching degree of text surface content, such as plagiarism checking in papers, etc.; some focus on the similarity of the internal semantics contained in sentences, such as bilingual Translation and automatic question answering, etc. Take the plagiarism check of papers as an example for a brief explanation: the databases included in the plagiarism check of papers are mainly published documents, patents, works, web pages, etc., and must include all liter...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
Inventor 孙维国李墨
Owner 南京中智腾飞航空科技研究院有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products