Deep web self-adapting crawling method based on minimum searchable mode
A query mode and self-adaptive technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problem of lack of sufficient basis for keyword selection, limited processing capacity, and extraction of the minimum queryable mode that does not involve DeepWeb query forms Methods and other issues
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0053] A method for crawling the Deep Web based on a minimum queryable pattern, specifically comprising the following steps:
[0054] 1) Generate the minimum queryable pattern set S of the target Deep Web query form mep ;
[0055] 2) Add seed candidate query q i into the set of candidate queries. Candidate queries can be expressed as q i (kv, mep j ) where mep j for S mep The minimum queryable mode in , kv is filled to mep j Keyvectors for ;
[0056] 3) For each minimum queryable pattern mep in the minimum queryable pattern set j Predict its model return rate P new (q(mep j )) is the expected rate of return on new records for the smallest queryable mode;
[0057] 4) For each candidate query q in the candidate query set i (kv, mep j ) to estimate the conditional rate of return P of its keyword vector kv to the new record new (q i (kv|mep j )).
[0058] 5) For query q in the candidate query set i (kv, mep j ) to compute the query q i Return to new record P n...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap