A method and device for acquiring an entry
An acquisition method and entry technology, which is applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of insufficient collection of entity entries, achieve effective knowledge search, and improve the effect of structured data materials
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0068] figure 1 It is a flow chart of the method for obtaining entries provided by this embodiment, such as figure 1 As shown, the method includes:
[0069] Step S101. Obtain a set of existing entries of the same category in the entry database.
[0070] The entry database may be an encyclopedia entry database, an input method entry database and other classified entry databases. In the present invention, the encyclopedia entry database is used as an example for illustration.
[0071] The classification can adopt the original categories of the classification entry library, including: songs, movies, characters, nature, culture, geography, history, life, society, art, economy, science and technology, sports and other categories, or can be used for existing Some entries are divided into categories using existing classification or clustering methods (such as Bayesian classification method, decision tree method, support vector machine SVM, etc.).
[0072] Obtain the set of existin...
Embodiment 2
[0088] Figure 4 It is a flow chart of the method for obtaining entries provided by this embodiment, such as Figure 4 As shown, the method includes:
[0089] Step S401. Acquiring a collection of existing entries of the same category in the entry database.
[0090] Step S402 , search using the acquired set of existing entries to obtain the anchor text containing the existing entries, and record the location of the webpage where the anchor text of the existing entries is located.
[0091] Step S403 , according to the recorded webpage position, extract the anchor text whose contextual distance from the anchor text of the existing entry satisfies the preset requirement at the corresponding position.
[0092] The above steps S401 to S403 are correspondingly the same as the steps S101 to S103 in the first embodiment, and will not be repeated here.
[0093] Step S404 , comparing the extracted anchor text with the term database to obtain unrecorded anchor text.
[0094] Since the...
Embodiment 3
[0125] Figure 5 It is a schematic diagram of the device for acquiring entries provided in this embodiment. Such as Figure 5 As shown, the device includes:
[0126] Existing entry obtaining module 501 is used to obtain the collection of existing entries of the same category in the entry database.
[0127] The entry database may be an encyclopedia entry database, an input method entry database and other classified entry databases. In the present invention, the encyclopedia entry database is used as an example for illustration.
[0128] The classification can adopt the original categories of the classification entry library, including: songs, movies, characters, nature, culture, geography, history, life, society, art, economy, science and technology, sports and other categories, or can be used for existing Some entries are divided into categories using existing classification or clustering methods (such as Bayesian classification method, decision tree method, support vector ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com