Cross-language document similarity detection method
A detection method and similarity technology, which is applied in the field of cross-language document similarity detection, can solve the problems of cross-language document similarity detection barriers, document similarity detection inapplicability, etc., and achieve the effect of solving changes and deformations
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0033] The present invention will be further described in detail with reference to the accompanying drawings and embodiments.
[0034] The cross-language document similarity detection method of the present invention, such as figure 1 As shown, it specifically includes the following steps:
[0035] Step 1. The source document and the target document to be compared are respectively converted into intermediate documents based on words in the same language. The source document and the target document are plain text documents in any language.
[0036] The conversion method is as follows: firstly divide the source document or the target document at the granularity of one or several words; then convert each divided word or phrase into a set Slot composed of an intermediate representation, the intermediate representation Words or phrases in a certain language corresponding to the words or phrases divided into the source document or the target document; finally, an index is built for...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com