Query generation method for source retrieval based on machine learning in plagiarism detection
A query generation and machine learning technology, applied in the field of information retrieval, which can solve problems such as lack of continuous improvement ability
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
specific Embodiment approach 1
[0053] The specific embodiment one, the query generation method of the source retrieval based on machine learning in a kind of plagiarism detection described in the present embodiment is:
[0054] for a suspicious document fragment s k , using the existing n query generation methods to obtain a set of candidate query sets Sort all the candidate queries in the set to obtain a sorted list;
[0055] Take the first m queries of the sorted list as suspicious document fragments s k query
[0056] In this embodiment, the set of candidate queries The candidate query of , is to use the existing source retrieval query generation method in the suspicious document fragment s k extracted from the is to use the existing query generation method 1 in the suspicious document fragment s k Alternative queries extracted from above.
[0057] The existing source retrieval query generation method described in this embodiment is an existing known query generation method, for example: TF, ...
specific Embodiment approach 2
[0058] Specific embodiment 2. This embodiment is a further limitation of the query generation method based on machine learning source retrieval in plagiarism detection described in specific embodiment 1. In this embodiment, all alternatives in the set The principle of query sorting is to sort from high to low according to the evaluation indicators of source retrieval corresponding to each query.
[0059]The evaluation index of the source retrieval refers to the index obtained by the existing evaluation method for evaluating the retrieval results of the source retrieval, which indicates the quality of the source retrieval. In this embodiment, the sorting basis for the selected queries is limited to the evaluation index of the source retrieval, that is, the query obtained by the query method with a relatively high evaluation index is selected as the final query, thereby improving the quality of the source retrieval.
specific Embodiment approach 3
[0060] Specific Embodiment 3. This embodiment is a further limitation of the query generation method based on machine learning source retrieval in the plagiarism detection described in specific embodiment 1. In this embodiment, the sorting is based on a machine learning method Achieved.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com