Document repeatability identification method and device, electronic equipment and storage medium
Patent Information
- Authority / Receiving Office
- CN ยท China
- Current Assignee / Owner
- CHINA CONSTRUCTION BANK
- Publication Date
- 2021-06-08
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The embodiments of the present application relate to the technical field of artificial intelligence, in particular to the technical field of natural language processing, and specifically to a method, device, electronic device, and storage medium for identifying repetitiveness of documents. Background technique
[0002] With the development of Internet technology, all walks of life and various documents can be obtained from the Internet. For example, financial institutions access a large number of financial documents from the Internet every day, including market express, financial information, research reports, policy interpretations, announcements, etc. Many documents from different data sources are the same or similar. If it is not filtered, a large number of duplicate documents or similar documents will flood in, which will greatly affect the accurate transmission of information and affect work efficiency. Therefore, it is particularly important to ...