Method for automatically extracting sentence template
A technology of automatic extraction and extraction method, which is applied in the field of studying the similarity of sentences and structures, and can solve the problems of easy omission, labor, and time-consuming.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0023] The method of automatically extracting sentence templates includes the following steps:
[0024] (1) Sentence: Divide the text into several sentences according to punctuation marks; and put serial numbers in front of the sentences in order;
[0025] (2) Word segmentation: Use word segmentation technology to divide each sentence obtained from the sentence into small pieces based on each word;
[0026] (3) After the word segmentation is completed, divide the sentence into several groups according to the number of words in the sentence from more to less or from less to more;
[0027] (4) Template extraction: Apply the LCS algorithm to the same group of sentences to obtain the longest common subsequence. While obtaining the longest common subsequence, delete the longest common subsequence whose internal active part has a length of zero to obtain the sentence template.
Embodiment 2
[0029] The method of automatically extracting sentence templates includes the following steps:
[0030] (1) Sentence: Divide the text into several sentences according to punctuation marks; and put serial numbers in front of the sentences in order;
[0031] (2) Word segmentation: Use word segmentation technology to divide each sentence obtained from the sentence into small pieces based on each word;
[0032] (3) Template extraction: On the basis of the word segmentation result, the LCS algorithm is applied to the sentence to obtain the longest common subsequence, that is, the sentence template is obtained.
Embodiment 3
[0034] The method of automatically extracting sentence templates includes the following steps:
[0035] (1) Sentence: Divide the text into several sentences according to punctuation marks; and put serial numbers in front of the sentences in order;
[0036] (2) Word segmentation: Use word segmentation technology to divide each sentence obtained from the sentence into small pieces based on each word;
[0037] (3) After the word segmentation is completed, divide the sentence into several groups according to the number of words in the sentence from more to less or from less to more;
[0038] (4) Template extraction: Apply the LCS algorithm to the sentences in the same group to get the longest common subsequence, which is the sentence template.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com