A method for detecting suspected code plagiarism based on a random forest model
A random forest model and code technology, applied to computer parts, character and pattern recognition, instruments, etc., can solve the problems of unproven accuracy, low discrimination, and unstable data
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0070] In order to make the present invention more comprehensible, preferred embodiments are described in detail below with accompanying drawings.
[0071] A kind of code plagiarism suspicion detection method based on random forest model provided by the present invention, its specific implementation mode is as follows:
[0072] Extract the feature value according to the code of the topic submitted by the students and the relevant topic information, and enter the data preparation stage. When processing each piece of code, the code and comments are separated, and irrelevant information in the beginning and end of the code, such as newline, indentation and space characters, are removed, which makes the processing of later feature values more convenient. Then we extracted nine attributes as the entry point for model training. These nine attributes are: whether the maximum similarity between the student's code and other students' codes exceeds the similarity threshold (CPMS), the...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com