Unlock instant, AI-driven research and patent intelligence for your innovation.

Text tracing method and device and storage medium

A storage medium and text technology, applied in special data processing applications, instruments, electrical and digital data processing, etc., can solve problems such as inaccurate results, single functions, and inability to make sensitive judgments on digital data, to improve comprehensiveness and accuracy. Sex, the effect of reducing chance

Active Publication Date: 2019-05-21
北京中科闻歌科技股份有限公司
View PDF6 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the Jaccard similarity is not sensitive to the word frequency response of the same similar word, and the cosine similarity can make a sensitive judgment on the direction of the vector, but it cannot make a sensitive judgment on the digital data
[0005] Therefore, the function of the existing text similarity comparison method is relatively single, and it can only make accurate judgments in one aspect, but cannot compare all aspects of the text, resulting in inaccurate final results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text tracing method and device and storage medium
  • Text tracing method and device and storage medium
  • Text tracing method and device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0037] In order to facilitate the understanding of the embodiments of the present invention, further explanations will be given below with specific embodiments in conjunction with the accompanying drawings, which are not intended to limit the embodiments of the present invention.

[0038] figure 1 A schematic flow chart of a text traceabilit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention relates to a text tracing method and device and a storage medium, and the method comprises the steps: generating a word bag model according to a to-be-judged text and asource text, the source text comprising a plurality of texts; Respectively carrying out similarity calculation on the word bag model by utilizing a plurality of preset similarity algorithms, and determining a plurality of first similarity sequences of a plurality of texts in the source text and the to-be-judged text under the condition of different similarity algorithms; Determining a second similarity sequence of a plurality of texts in the source text source text and the to-be-judged text based on a first formula and the first similarity sequence; And selecting the text meeting the homologycondition from the source text as the homology text of the to-be-judged text based on the second similarity order, and adopting multiple similarity algorithms to make up for the deficiency of a single algorithm, thereby reducing the contingency caused by different calculation modes, and improving the comprehensiveness and accuracy of the calculation result.

Description

technical field [0001] Embodiments of the present invention relate to the field of text data processing, and in particular, to a text source tracing method, device, and storage medium. Background technique [0002] Text traceability is mainly used in academic integrity detection, search engine optimization and other fields. The purpose is to judge whether the content of a text is copied or adapted from another or multiple texts. The principle of obtaining homologous texts is mainly based on the comparison of text similarity for judgment. [0003] In existing schemes, Jaccard similarity or cosine similarity comparison is usually used for text tracing. Jaccard similarity is a method to measure the similarity and difference between limited sample sets. The Jaccard coefficient is defined as comparing the two The ratio of the intersection and union of , the larger the value, the higher the similarity; the cosine similarity measures the difference between the two vectors by calcu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/22G06F17/27
Inventor 孔庆超白洁王磊曹家彭鑫汪小东罗引赵菲菲张西娜
Owner 北京中科闻歌科技股份有限公司