Method, system, device and medium for screening Chinese nouns based on edit distance
A technology of editing distance and screening method, which is applied in the field of text processing, can solve the problems of complex calculation methods, training corpus, and low accuracy, and achieve the effect of expanding the screening range, increasing the amount of data samples, and high accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0070] Please refer to figure 1 , figure 1 It is a schematic flow chart of a Chinese noun screening method based on edit distance. Embodiment 1 of the present invention provides a Chinese noun screening method based on edit distance. The method includes:
[0071] Build a data dictionary, wherein, in the data dictionary, words are stored in groups, and each phrase corresponds to a word quotation and a plurality of similar words;
[0072] Obtaining a reference word, matching the reference word with the index in the data dictionary, if the matching is successful, obtaining a plurality of similar words corresponding to the index;
[0073] Combining a plurality of similar words obtained by matching with the reference word to obtain a screening phrase;
[0074] Compute the similarity between each word in the screened phrase and each word in the screened data set;
[0075] Screening out words corresponding to the similarity greater than a threshold from the screening data set to o...
Embodiment 2
[0095] Please refer to Figure 9 , Figure 9 It is a schematic diagram of the composition of the Chinese noun screening system based on edit distance. Embodiment 2 of the present invention provides a Chinese noun screening system based on edit distance. The system includes:
[0096] A construction unit is used to construct a data dictionary, wherein the words in the data dictionary are stored in groups, and each phrase corresponds to a word quotation and a plurality of similar words;
[0097] The matching unit is used to obtain a reference word, and matches the reference word with the index in the data dictionary, and if the matching is successful, obtains a plurality of similar words corresponding to the index;
[0098] A combination unit, configured to combine a plurality of similar words obtained by matching with the reference word to obtain a screening phrase;
[0099] A computing unit, used to calculate the similarity between each word in the screening phrase group and ea...
Embodiment 3
[0102] Embodiment 3 of the present invention provides a device for screening Chinese nouns based on edit distance, including a memory, a processor, and a computer program stored in the memory and operable on the processor, and the processor executes the The computer program realizes the steps of the Chinese noun screening method based on edit distance.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com