Sorting type hidden network database data acquisition method
An acquisition method and database technology, applied in the field of data acquisition of sorted hidden network database, can solve problems such as lack of research, incomplete solution of crawling word selection, etc., and achieve the effects of reducing repetition rate, improving coverage rate, and low cost
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0036] The present invention will be described in detail below with reference to the accompanying drawings and examples.
[0037] The invention provides a method for obtaining data in a sorted hidden network database, which uses a crawling method to obtain data in a target database; during the crawling operation, the crawling keywords are selected using a method based on document frequency estimation , mainly includes four parts: sample data set acquisition, extraction of candidate keyword sets from sample data set, document frequency estimation of candidate keywords, determination of crawled keywords. Finally, according to the obtained crawling keywords, the documents of the sorted hidden web database are crawled to obtain the sorted hidden web database data.
[0038] The specific ideas of the method of the present invention are as follows: first, obtain a certain number of documents from the sorting type hidden network data source DB to form a document sample set D; then, ob...
PUM

Abstract
Description
Claims
Application Information

- R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com