Method capable of combining word vector with bootstrap learning for obtaining and organizing domain entity hyponymy
A technology of word vectors and domains, applied in natural language data processing, special data processing applications, instruments, etc., can solve the problems of low extraction efficiency and high corpus dependence, and achieve the effect of improving accuracy and easy extraction
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0056] Embodiment 1: as Figure 1-3 As shown, a method for acquiring and organizing the hyponym relationship of domain entities combined with word vectors and bootstrap learning, the specific steps of the method are as follows:
[0057] Step1. Firstly, according to the bootstrap learning method, obtain candidate hyponymy relationship examples from the text in the tourism field;
[0058] Step1.1. First, manually write a crawler program to crawl text information in the tourism field from travel websites and encyclopedia entries;
[0059] The present invention considers that the positions and tags to be crawled in the crawler program are different due to different webpage structures, and there is no ready-made program, so programs need to be written for different tasks of crawling. It is necessary to select the corpus of different travel webpage themes as comprehensively as possible. Such as Baidu Encyclopedia entries, travel webpage information, etc.
[0060] Step1.2, the pre...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com