Method for generating distractors of English similar word forms by being combined with word class

A technology of distractors and parts of speech, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as unreasonable design, insufficient accuracy and rationality of generating near-form words, and low similarity of distractors.

Inactive Publication Date: 2014-07-02
DALIAN UNIV
View PDF2 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The traditional near-word interference generation algorithm mainly uses the edit distance algorithm to calculate word similarity, but the edit distance algorithm itself has some defects, resulting in insufficient accuracy and rationality in generating near-form words, and the similarity of interference items is low. unreasonable question

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for generating distractors of English similar word forms by being combined with word class
  • Method for generating distractors of English similar word forms by being combined with word class
  • Method for generating distractors of English similar word forms by being combined with word class

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The present invention will be further described below in conjunction with the accompanying drawings of the specification.

[0045] The present invention introduces the LCS algorithm on the basis of the edit distance algorithm, and normalizes the fusion of the two, which improves the accuracy and reliability of the calculation of the similarity of words. Then, on this basis, it combines the part of speech of the English word itself as the most filtering condition to generate more reasonable word noise items. Finally, through experimental comparison, it is proved that the algorithm is more accurate and reasonable than the traditional interference item generation algorithm based on edit distance.

[0046] Such as figure 1 As shown, a method for generating English synonym noise items combined with parts of speech of the present invention includes the following steps:

[0047] Select the source word from the thesaurus as the source word string str1, and other words as the target...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for generating distractors of English similar word forms by being combined with word class. The method includes steps of selecting a source word from a word bank as a source word character string, utilizing other words as target word character strings, traversing all words in the word bank, and solving similarity between the source word character string and the target character strings according to uniformized integration similarity algorithm; controlling the threshold value of the similarity within 0.6-1.0, and taking the words within the range of the threshold value as optional words; subjecting the optional words and the source word output in the last step to similarity calculation combined with the word class, and controlling the threshold value a of the similarity within 0.6-1.0, thereby obtaining the distractors of the source word; finishing once processing course. By introducing the LCS (longest common subsequence) algorithm to uniformized integration, blindness in calculating similarity of the English words by singly depending on one similarity algorithm is changed, reliability and accuracy in generation of the distractors of the English similar word forms are improved, and the problem that words with same meaning but in different word classes repeatedly appear is solved.

Description

Technical field [0001] The invention relates to a natural language processing method, in particular to a method for generating English synonym noise items combined with part of speech. Background technique [0002] In the process of English learning, we often encounter some confusing words. Confusing words mainly include synonyms and similar words, among which the similar words are words with similar word forms. For example: the adjective sensitive means "sensitive", while the adjective sensible means "sensible". Although sensitive and sensible have a common root and the same part of speech, these two words are not synonymous, but synonymous. In the design of English test questions or other English learning resources, similar words often appear as interference items of correct word options, thereby increasing the difficulty of selection and improving learners' mastery of words. [0003] The traditional synonym noise generation algorithm mainly uses the edit distance algorithm to...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
Inventor 盖荣丽汪祖民孙晓辉
Owner DALIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products