Course knowledge relationship extraction method and system based on sentence bag attention remote supervision
A remote supervision and relationship extraction technology, applied in the field of relationship extraction, can solve problems affecting test results and difficult evaluation of model performance
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0209] Example 1: Course Knowledge Entity Extraction Experiment
[0210] The present invention at first extracts knowledge entity by three kinds of keyword extraction algorithms of TF-IDF, TextRank, Word2vec word clustering, comprises following steps:
[0211] Step 1, preprocessing the text;
[0212] Step 2, extract knowledge entities through three keyword extraction algorithms: TF-IDF, TextRank, and Word2vec word clustering;
[0213] In the present invention, the text is preprocessed in the step 1, including: transcoding the text format, removing redundant symbols and stop words, and word segmentation operations, and the step 1 further includes:
[0214] Step 1.1, merge the files in the folder and write them into one file, which is convenient for subsequent operations, and modify the encoding of the files to UTF-8, which is more conducive to subsequent program processing after modification;
[0215] Step 1.2, remove the redundant symbols in the text, because these texts nee...
Embodiment 2
[0242] Example 2: Noise reduction experiment of remote supervised extraction
[0243] The second embodiment of the present invention is based on artificially marked knowledge triplets, using the remote supervision method to automatically obtain the training corpus from the course teaching text; then use PCNN to extract sentence features, and use the sentence bag attention mechanism to mark the remote supervision method The large amount of noise existing in the data is denoised. Then step 2 in embodiment 1, concrete steps are as follows:
[0244] Step 3: Manually labeled knowledge triplets;
[0245] Step 4: Automatically obtain the training corpus from the course teaching text using the distance supervision method and the triplet of step 3;
[0246] Step 5: Use PCNN to extract sentence features, and use the sentence bag attention mechanism to denoise a large amount of noise existing in the data marked by the remote supervision method;
[0247] In the present invention, the a...
Embodiment 3
[0300] Example 3: Experiment of course knowledge point relationship extraction
[0301] The third embodiment of the present invention is the extraction of course knowledge point relations. The contextual semantic information is captured through the word vector with attention, and the location information and type information of the entity are fused to construct entity features, which are input into the Bi_LSTM model to obtain knowledge point relation extraction.
[0302] Then step 5 of the embodiment, in the step 6, the Bi_LSTM model obtains knowledge point relationship extraction, and the steps include:
[0303] Step 6: Capture the contextual semantic information through the word vector with attention, and integrate the location information and type information of the entity to construct entity features, and input the Bi_LSTM model to obtain knowledge point relationship extraction.
[0304] The three features of entity-aware attention include: (1) Bi_LSTM hidden layer state H...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


