A Cross-lingual Plagiarism Detection Method Based on Fingerprint Fusion
A detection method and cross-lingual technology, applied in natural language data processing, semantic analysis, digital data information retrieval, etc., can solve problems such as plagiarism and plagiarism
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0069] The following examples describe the present invention in more detail.
[0070] 1. Text preprocessing
[0071] Text preprocessing includes word segmentation technology, part-of-speech tagging, stop word removal, etc. English text needs root restoration, and due to the complexity and polysemy of Chinese, and there are no segmentation marks like spaces in English text, only punctuation marks Segmentation makes the preprocessing of Chinese text more complicated, and the accuracy of text preprocessing also has a great impact on the subsequent experimental results. The Chinese text and the English text need to be preprocessed separately to obtain the noun sequence.
[0072] Input: text information to be analyzed
[0073] Output: Chinese and English feature sets
[0074] Step 1: Chinese text preprocessing. The Chinese text is preprocessed using the Chinese lexical analysis system ICTCLAS of the Chinese Academy of Sciences, and the program directly calls the API of ICTCLAS ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


