Method and device for generating English concept vectors based on wikipedia link structure
A concept vector and concept technology, applied in the field of English concept vector generation based on the Wikipedia link structure, can solve the problem that the word vector method cannot distinguish the concept of the meaning of the word in essence, and achieve the effect of overcoming polysemy and accurate semantic representation
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0064] In order to be able to accurately learn the vector representation of word sense concepts, it is necessary to construct training data with concepts as objects. There are a large number of concept annotations in Wikipedia, and these concept annotations have rich semantic link relationships, which provides the possibility to construct training data for concept vectors.
[0065] The purpose of Embodiment 1 is to provide a method for generating English concept vectors based on the Wikipedia link structure.
[0066] In order to achieve the above object, the present invention adopts the following technical scheme:
[0067] Such as figure 1 as shown,
[0068] A method for generating English concept vectors based on Wikipedia link structure, the method comprising:
[0069] Step (1): constructing a link information base according to the title concept and / or link concept in the English Wikipedia page;
[0070] Step (2): According to whether there is a link concept in the sampl...
Embodiment 2
[0259] The purpose of Embodiment 2 is to provide a computer-readable storage medium.
[0260] In order to achieve the above object, the present invention adopts the following technical scheme:
[0261] A computer-readable storage medium, in which a plurality of instructions are stored, and the instructions are adapted to be loaded by a processor of a terminal device and perform the following processing:
[0262] Build a link information base based on the title concept and / or link concept in English Wikipedia pages;
[0263] According to whether there is a link concept in the sample in the link information base, construct training positive examples and training negative examples respectively, and select a certain number of training positive examples and training negative examples to establish a training data set;
[0264] Establish a concept vector model, which includes an input layer, an embedding layer, a concept vector operation layer and an output layer;
[0265] The conc...
Embodiment 3
[0267] The purpose of Embodiment 3 is to provide a terminal device.
[0268] In order to achieve the above object, the present invention adopts the following technical scheme:
[0269] A terminal device, including a processor and a computer-readable storage medium, the processor is used to implement instructions; the computer-readable storage medium is used to store multiple instructions, and the instructions are suitable for being loaded by the processor and performing the following processing:
[0270] Build a link information base based on the title concept and / or link concept in English Wikipedia pages;
[0271] According to whether there is a link concept in the sample in the link information base, construct training positive examples and training negative examples respectively, and select a certain number of training positive examples and training negative examples to establish a training data set;
[0272] Establish a concept vector model, which includes an input layer...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com