Data mining-based word usage knowledge acquisition system and method

A data mining and usage technology, which is applied in special data processing applications, electrical digital data processing, instruments, etc., can solve problems such as difficult effective knowledge, and it is difficult for users to find correct examples of word usage, so as to facilitate usage knowledge and improve user experience. The effect of experience needs

Active Publication Date: 2011-10-12
SHENZHEN SHI JI GUANG SU INFORMATION TECH
View PDF3 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Usually, the usage knowledge of words or phrases can be searched through the Internet, however, it is difficult to rely on the results obtained by general search engines as the effective knowledge we need, because the search results only list the web pages related to the word , rather than considering whether it is relevant in terms of linguistic roles
In addition, a large amount of redundant information in search results makes it difficult for users to find instances of correct word usage

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data mining-based word usage knowledge acquisition system and method
  • Data mining-based word usage knowledge acquisition system and method
  • Data mining-based word usage knowledge acquisition system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] figure 1 A system for obtaining word usage knowledge based on data mining in an embodiment is shown, and the system includes an input device 10, a query analysis device 20, a multi-input mode processing device 30, a web page analysis device 40, a usage knowledge extraction device 50 and an output device 60 . in:

[0034] The input device 10 is used for inputting words or phrases to be searched. In one embodiment, the word or phrase to be searched inputted by the input device 10 has multiple modes. For example, if it is necessary to find the usage knowledge of the word "solve", a single word input mode (such as "solve") and a target language collocation mode can be used. (such as "solve problem"), category patterns (such as " difficulty, thing", " n.", etc.), comparison mode (such as "solveproblem / issue") and other modes to search.

[0035] The query analysis device 20 is used to analyze the keywords in the word or phrase to be searched, and send the word or phras...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a data mining-based word usage knowledge acquisition system and method. The system comprises an input device, a search analysis device, a multi-input mode processing device, a webpage analysis device, a usage knowledge extraction device and an output device, wherein the input device is used for inputting a word or a phrase to be searched; the search analysis device analyzesa keyword in the word or phrase to be searched, and processes the word and the phrase to be searched in the corresponding input mode processing device according to the analysis result; the multi-input mode processing device analyzes and expands the word or the phrase to be searched by utilizing semantic knowledge and dictionaries to form a search item, and searches the webpage information according to the search item so as to acquire a webpage related to the word or the phrase to be searched; the webpage analysis device analyzes the searched webpage, and converts the webpage into a candidate text; the usage knowledge extraction device processes the candidate text, and extracts context information and typical sentences of the word or the phrase to be searched; and the output device outputsthe context information and the typical sentences. By adopting the device and the method, the word usage knowledge can be acquired accurately.

Description

【Technical field】 [0001] The present invention relates to the technical field of computer information processing, in particular to a system and method for acquiring word usage knowledge based on data mining. 【Background technique】 [0002] When people read, write, and translate in a foreign language, they often encounter words and phrases that are not included in the dictionary, and the translation of the same word or phrase is often different in different contexts. Therefore, how to write authentic words and sentences is the key to each Problems faced by speakers of foreign languages. For Chinese students, due to the differences in Chinese and English culture and language styles, coupled with the lack of knowledge of English collocations (such as: form-name collocation, verb-noun collocation, and verb-introduction collocation), the problem of how to write idiomatic words and sentences is difficult. appear particularly prominent. [0003] The development of the Internet ha...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
Inventor 方高林
Owner SHENZHEN SHI JI GUANG SU INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products