Unlock instant, AI-driven research and patent intelligence for your innovation.

Text detection method, device, computing device, and computer-readable storage medium

A text detection and text technology, applied in the computer field, can solve problems such as low efficiency, low accuracy, and complicated operation, and achieve the effects of improving accuracy, improving detection speed, and improving detection efficiency

Active Publication Date: 2022-05-24
北京万方数据股份有限公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, for these text plagiarism check methods, although the text plagiarism check process can be realized, the operation is complicated, time-consuming, labor-intensive, inefficient, and the accuracy rate is relatively low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text detection method, device, computing device, and computer-readable storage medium
  • Text detection method, device, computing device, and computer-readable storage medium
  • Text detection method, device, computing device, and computer-readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0131] The first step, initialization processing, so as to load the content required in the later stage: initialize the word segmentation system, load the comparison library dictionary (including the main dictionary of the comparison library and the sub-category dictionary of each classification), the class center vector information (each classification information) , document path information and initialize the link metabase.

[0132] The second step is to start segmenting the text submitted for inspection (recording the paragraph ID), segmenting the text (the way of segmenting is consistent with the previous sliding segmentation method), and segmenting each clause. The obtained word segmentation is compared with the main dictionary of the comparison database, and the words existing in the main dictionary are screened out (that is, to ensure that the word segmentation in the sentence submitted for inspection exists in the comparison database). There is also a synonym particip...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application discloses a text detection method, device, computing device, and computer-readable storage medium. The text detection method includes: performing clustering preprocessing on the text to be detected to obtain the corresponding word segmentation in each sentence of the text to be detected. Similar class list; determine the similar sentence list corresponding to all clauses in the text to be detected based on the similar class list, and merge each similar sentence in the similar sentence list to obtain a similar segment; determine based on the similar segment The similarity between the text to be detected and the text to which the similar segment belongs. In this application, the effective detection of text is realized, and the detection efficiency is improved; and the difference calculation method using the TF_IDF value not only improves the calculation accuracy, but also greatly improves the detection speed.

Description

technical field [0001] The present application relates to the field of computer technology, and in particular, to a text detection method, apparatus, computing device, and computer-readable storage medium. Background technique [0002] In the existing technology, with the frequent occurrence of fraud incidents in academia, the voice of intellectual property protection is getting louder and louder, and the research of text duplication checking technology has gradually become a research hotspot of relevant experts and scholars. At present, some scholars at home and abroad have proposed methods of text duplication checking. However, for these text duplication checking methods, although the text duplication checking processing can be realized, the operation is complicated, time-consuming and labor-intensive, and the efficiency is low, and the accuracy rate is relatively low. SUMMARY OF THE INVENTION [0003] The present application provides a text detection method, apparatus,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G06F16/36G06F40/211G06F40/279G06F40/247
CPCG06F40/211G06F40/279G06F40/247
Inventor 于洋刘磊徐香义柏少乾
Owner 北京万方数据股份有限公司