Media text similarity detection method
A text similarity and detection method technology, applied in the field of natural language processing, can solve problems such as low retrieval efficiency and weak semantic features of text fingerprints, and achieve the effect of improving retrieval efficiency, high accuracy and precision
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0047] Embodiment 1: as figure 1 Shown, the present invention is a kind of media text similarity detection method, and concrete implementation steps are as follows:
[0048] Step 1, media text collection. This example crawls the source code of web pages containing self-media manuscripts from mainstream self-media platforms in the Internet, and ensures that the number of each type of self-media manuscripts is even, and then stores the source code of the webpage in the database.
[0049] Step 2, media manuscript preprocessing. Since the source code of the web page containing media text is obtained by using the crawler tool, it is necessary to extract the text content of the web page source code.
[0050] Sub-step 2-1, manuscript web page preprocessing. Because the source code of the webpage containing the self-media text contains multiple tags, the tags corresponding to the manuscript text are inconsistent on different self-media platforms, so it is necessary to analyze diffe...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


