Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

58 results about "Plagiarism detection" patented technology

Plagiarism detection is the process of locating instances of plagiarism within a work or document. The widespread use of computers and the advent of the Internet have made it easier to plagiarize the work of others.

Software similarity detection method based on dynamic control flow graph sequence birthmark

ActiveCN108830049AAvoid lack of source codeAvoid the difficult problem of reverse disassemblyProgram/content distribution protectionGraph sequencePlagiarism detection
The invention discloses a software similarity detection method based on dynamic control flow graph sequence birthmark. The method comprises the following steps: firstly assembling a starting address of a basic block in the plug-in program record program execution process and a branch hopping address at the ending of the basic block under a dynamic plug-in platform DynamoRIO; and then analyzing a log file, constructing a program dynamic control flow graph, and endowing the weight; establishing a weight sequence birthmark set WSB, and serving the length ratio of the WSB as parameter to compute the similarity of each pair of programs. By adopting the dynamic plug-in analysis and extracting the feature of the software in operation, the problems that the source code is absent and the reverse disassembling is difficult in the software plagiarism detection can be avoided; only the basic block starting address and the branch hopping condition are recorded in the dynamic plug-in analysis, and the expenditure is less in comparison with the birthmark based on the dynamic data flow tracking and like technology; the influence by unrelated interference information in the dynamic operation can beresisted, and the program similarity can be detected even if the software encrypts by using an encryption shell.
Owner:SICHUAN UNIV +2

Thesis plagiarism detection method and system

The invention provides a thesis plagiarism detection method and system. The method comprises the following steps: recording materials by a comparison library; recording segmented words and corresponding word classes by a segmented word library; carrying out word segmentation by a word segmentation module; generating segmented word class characteristic values by a segmented word characteristic value generation module; determining segmented word free vector dimensions by a segmented word free vector dimension determination module; generating segmented word simplified vector dimensions by a segmented word simplified vector dimension generation module; generating segmented word characteristic vectors by a segmented word characteristic vector generation module; carrying out word segmentation on a to-be-authenticated document by a to-be-authenticated document word segmentation module so as to obtain a segmented word result; determining segmented word free vector dimensions by a to-be-authenticated document segmented word free vector dimension determination module; generating to-be-authenticated document segmented word simplified vector dimensions by a to-be-authenticated document segmented word simplified vector dimension generation module; generating to-be-authenticated document segmented word characteristic vectors by a to-be-authenticated document segmented word characteristic vector generation module; and carrying out similarity comparison.
Owner:湖南通远网络股份有限公司

Video plagiarism detection method and device, equipment and medium

The invention discloses a video plagiarism detection method, and the method comprises the steps: obtaining at least one base library video and query video, carrying out interval frame extraction to acquire a plurality of base library images and a plurality of query images; and inputting the plurality of base library images and the plurality of query images into a convolutional neural network for feature extraction to acquire base library video frame features and query video frame features; obtaining the similarity between each query video frame feature and each base library video frame feature, and taking the base library video frames of which the similarity is higher than a first preset threshold as neighbor matching frames; classifying the neighbor matching frames according to the codingidentifiers to generate at least one base library video frame set; selecting the base library video corresponding to the at least one base library video frame set as a candidate video; and forming avideo pair by the query video and each candidate video, and searching a suspected plagiarism fragment in each matched video pair through a network flow algorithm. In addition, the invention also provides a video plagiarism detection device, equipment and a medium.
Owner:深圳神目信息技术有限公司

Query generation method for source retrieval based on machine learning in plagiarism detection

The invention discloses a query generation method for source retrieval based on machine learning in plagiarism detection, relates to the technical field of information retrieval, in particular to a query generation technology in an information retrieval technology, and solves the problems of dependency on expert experience and lack of continuous improvement capability in a method for performing query generation by adopting a heuristic-based method in a source retrieval technology of the prior art. The method comprises the steps of obtaining a group of alternative query sets defined in the specification by adopting n existing query generation methods for a suspicious document fragment sk; sorting all alternative queries in the set to obtain a sorting list; and taking first m queries of the sorting list as queries, defined in the specification, of the suspicious document fragment sk. According to the method, an inherent research thought for the query generation method in the technical field of existing source retrieval is overcome, and a characteristic that different source retrieval methods have different source retrieval performances on the same suspicious document fragment is fully utilized.
Owner:HEILONGJIANG INST OF TECH

Software local plagiarism detection method based on dynamic instruction dependency graph birthmark

ActiveCN108399321ANot easy to confuse and destroyImproved ability to combat deep obfuscationProgram/content distribution protectionDynamic instrumentationPlagiarism detection
The present invention provides a software local plagiarism detection method based on the dynamic instruction dependency graph birthmark. The method comprises: 1) using dynamic instrumentation to perform instruction level monitoring on a to-be-analyzed program, and capturing an instruction trajectory of each function; 2) for a dynamic instruction trajectory recording each function, carrying out data dependency and control dependency analysis, and constructing a dynamic instruction dependency graph birthmark; 3) calculating the similarity between instruction dependency graph birthmarks, and implementing the measure of similarity between functions; 4) based on the given threshold, constructing a list of suspicious functions for each function in the plaintiff program; 5) extracting the staticfunction call graph of the program, and performing precise pairing of the suspicious functions under the guidance of the calling dependency; and 6) based on the calling dependency, assembling matchedfunction pairs to generate a plagiarism evidence map, and measuring the proportion of suspected plagiarism part. According to the method provided by the present invention, local plagiarism detection is implemented by constructing a function-level birthmark; and the concept of a plagiarism evidence map is proposed for the first time, and the effectiveness of the evidence can be greatly enhanced.
Owner:XIAN UNIV OF POSTS & TELECOMM

Plagiarism source retrieval sorting model construction method and plagiarism source retrieval sorting method

The invention provides a plagiarism source retrieval sorting model construction method and a plagiarism source retrieval sorting method. According to the plagiarism source retrieval sorting model construction method, training samples are utilized to train a predetermined sorting logic regression model through an order pair-based sorting learning manner on the basis of a degree of aggregation between each plagiarism source document of a reference document and the reference document until a value of a predetermined loss function is minimum, the predetermined loss function includes first and second sub-loss functions, the first sub-loss function represents a loss caused by sorting errors of order pairs formed on the basis of the plagiarism source documents and non-plagiarism source documentsof the reference document, and the second sub-loss function represents a loss caused by sorting errors of order pairs formed by plagiarism source documents with different degrees of aggregation. The plagiarism source retrieval sorting method utilizes the above obtained sorting model to resort retrieval results of suspicious documents. The above technology of the invention can more accurately sortthe source retrieval results of the suspicious documents in plagiarism detection.
Owner:HEILONGJIANG INST OF TECH

Cross-linguistic plagiarism detection method based on multiple features

The invention provides a cross-linguistic plagiarism detection method based on multiple features. The method comprises the steps of 1, corpus building; 2, translation feature building, wherein according to the europeanized phenomenon and the translation body problem which generally occur in translated articles, translation feature building is conducted, by means of feature selection, the featuresare cleaned and filtered to obtain the effective features, and noneffective features or the features with unapparent effects are filtered out; 3, feature selection, wherein the effective features areselected from the multiple features for classifier training, and then whether or not the cross-linguistic plagiarism problem exists in a certain article or multiple articles is classified; 4, based onplagiarism detection corresponding to the features, for Chinese features, accurate English feature corresponding is conducted, and according to the translation features and the structural features, plagiarism results are correspondingly filtered and generated, and through WordNet, final confirmation is conducted on the plagiarism results. By means of the method, the cross-linguistic plagiarism problem can be solved according to the multiple kinds of features mined from translation.
Owner:HARBIN ENG UNIV

Multi-language code plagiarism detection method based on pseudo twin network

PendingCN112394973ABreak through the limitations of not considering the structural characteristics of the codeBreakthroughs that are susceptible to redundant codeSoftware maintainance/managementNeural architecturesData packData set
The invention discloses a multi-language code plagiarism detection method based on a pseudo twin network, and the method comprises the steps: 1), obtaining basic data which comprises a pre-training data set and a multi-language code plagiarism detection training data set; 2) preprocessing the pre-training data set to obtain an accurate mark vector; 3) preprocessing the multi-language code plagiarism detection training data set to preliminarily judge whether the code is plagiarism or not; and 4) further judging whether the plagiarism exists in the multi-language code plagiarism detection training data set or not. According to the method, the limitation that code structure characteristics are not considered when codes are taken as texts to be processed in an existing multi-language code plagiarism detection method based on machine learning is broken through; in combination with structural characteristics of codes based on an abstract syntax tree, a convolutional neural network, a bidirectional long-short-term memory artificial neural network and a novel attention neural network are embedded into a pseudo twin network, so that multi-language code plagiarism detection is realized, andthe code plagiarism detection efficiency and precision are effectively improved.
Owner:SHANDONG UNIV OF TECH

Multi-thread program plagiarism detection method based on dynamic birthmarks and related equipment

The embodiment of the invention provides a multi-thread program plagiarism detection method based on dynamic birthmarks and related equipment. The method comprises the steps of inserting a custom function into a to-be-tested program by adopting a dynamic instrumentation technology to obtain a system call sequence; processing the system call sequence by utilizing a D-Kgram algorithm with a variableK value, and respectively generating a plurality of sub-sequences of which the gram lengths are different K values; performing single-thread screening on the plurality of sub-sequences to obtain a feature sub-sequence set; respectively constructing dynamic birthmarks of the original program and the suspicious program; converting the dynamic birthmarks into vectors, and obtaining the similarity between the original program and the suspicious program by using a cosine similarity method; and calculating the mean value of the similarity under multiple inputs, and obtaining a conclusion whether the suspicious program plagiarizes the original program or not according to the detection threshold. According to the method and the related equipment provided by the invention, the influence of threadinterleaving characteristics on the dynamic birthmarks can be effectively avoided, so that the plagiarism detection effect is better.
Owner:BEIJING UNIV OF POSTS & TELECOMM

Test program plagiarism detection method based on test code fragment similarity

The invention relates to a test program plagiarism detection method based on test code fragment similarity. The test program plagiarism detection method comprises the following steps: for each to-be-tested method in a to-be-tested program, firstly, calculating a unique method identifier based on a class name, a method name and a parameter sequence; secondly, extracting all test code fragment setsfrom the test program, wherein each test fragment corresponds to one to-be-tested method; then, analyzing the similarity between the test fragments to obtain a similarity analysis report, and calculating a similarity value between the fragments; and finally, calculating the overall similarity degree value of the test programs by utilizing the similarity value of the test fragments, and judging theplagiarism condition between the test programs more accurately by utilizing the overall similarity degree value of the test programs. The test program plagiarism detection method aims to fill the blank of a test code similarity detection technology, and solves the problems of low precision of test code similarity analysis and low efficiency of test code plagiarism detection mainly depending on manual operation at present, thereby improving the efficiency and precision of test code similarity detection.
Owner:NANJING UNIV

A method and system for detecting plagiarism in papers

The invention provides a thesis plagiarism detection method and system. The method comprises the following steps: recording materials by a comparison library; recording segmented words and corresponding word classes by a segmented word library; carrying out word segmentation by a word segmentation module; generating segmented word class characteristic values by a segmented word characteristic value generation module; determining segmented word free vector dimensions by a segmented word free vector dimension determination module; generating segmented word simplified vector dimensions by a segmented word simplified vector dimension generation module; generating segmented word characteristic vectors by a segmented word characteristic vector generation module; carrying out word segmentation on a to-be-authenticated document by a to-be-authenticated document word segmentation module so as to obtain a segmented word result; determining segmented word free vector dimensions by a to-be-authenticated document segmented word free vector dimension determination module; generating to-be-authenticated document segmented word simplified vector dimensions by a to-be-authenticated document segmented word simplified vector dimension generation module; generating to-be-authenticated document segmented word characteristic vectors by a to-be-authenticated document segmented word characteristic vector generation module; and carrying out similarity comparison.
Owner:湖南通远网络股份有限公司

Formula plagiarism detection method and system

The invention provides a formula plagiarism detection method and system. According to the method and the system, a comparative library is used for recording materials; a segmented word library is used for recording segmented words and corresponding parts of speech; the segmented word library further includes a formula library; a word segmentation module is used for carrying out word segmentation; a segmented word eigenvalue generation module is used for generating eigenvalues of the parts of speech of the segmented words; a segmented word free-vector dimension determining module is used for determining the free-vector dimension of each segmented word; a segmented word simplified-vector dimension generation module is used for generating the simplified-vector dimension of each segmented word; a segmented word feature vector generation module is used for generating a feature vector of each segmented word; a to-be-identified document word segmentation module is used for carrying out word segmentation on a to-be-identified document to obtain a word segmentation result; a to-be-identified document segmented word free-vector dimension determining module is used for determining the free-vector dimension of each segmented word; a to-be-identified document segmented word simplified-vector dimension generation module is used for generating the simplified-vector dimension of each segmented word of the to-be-identified document; a to-be-identified document segmented word feature vector generation module is used for generating the feature vector of each segmented word of the to-be-identified document; and similarity comparison is carried out.
Owner:夏峰
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products