Data mining-based word usage knowledge acquisition system and method

A data mining and usage technology, which is applied in special data processing applications, electrical digital data processing, instruments, etc., can solve problems such as difficult effective knowledge, and it is difficult for users to find correct examples of word usage, so as to facilitate usage knowledge and improve user experience. The effect of experience needs

Active Publication Date: 2013-04-24
SHENZHEN SHI JI GUANG SU INFORMATION TECH
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Usually, the usage knowledge of words or phrases can be searched through the Internet, however, it is difficult to rely on the results obtained by general search engines as the effective knowledge we need, because the search results only list the web pages related to the word , rather than considering whether it is relevant in terms of linguistic roles
In addition, a large amount of redundant information in search results makes it difficult for users to find instances of correct word usage

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data mining-based word usage knowledge acquisition system and method
  • Data mining-based word usage knowledge acquisition system and method
  • Data mining-based word usage knowledge acquisition system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] figure 1 A system for acquiring word usage knowledge based on data mining in an embodiment is shown, the system includes an input device 10 , a query analysis device 20 , a multi-input mode processing device 30 , a web page analysis device 40 , a usage knowledge extraction device 50 and an output device 60 . in:

[0034] The input device 10 is used for inputting the word or phrase to be searched. In one embodiment, the word or phrase to be searched entered by the input device 10 has multiple modes. For example, to find the usage knowledge of the word "solve", the single word input mode (such as "solve"), the target language collocation mode can be adopted. (e.g. "solve problem"), category mode (e.g. " difficulty, thing", " n.", etc.), comparison mode (such as "solveproblem / issue") and other modes to search.

[0035] The query analysis device 20 is configured to analyze the keywords in the word or phrase to be searched, and send the word or phrase to be searched t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a data mining-based word usage knowledge acquisition system and method. The system comprises an input device, a search analysis device, a multi-input mode processing device, a webpage analysis device, a usage knowledge extraction device and an output device, wherein the input device is used for inputting a word or a phrase to be searched; the search analysis device analyzesa keyword in the word or phrase to be searched, and processes the word and the phrase to be searched in the corresponding input mode processing device according to the analysis result; the multi-input mode processing device analyzes and expands the word or the phrase to be searched by utilizing semantic knowledge and dictionaries to form a search item, and searches the webpage information according to the search item so as to acquire a webpage related to the word or the phrase to be searched; the webpage analysis device analyzes the searched webpage, and converts the webpage into a candidate text; the usage knowledge extraction device processes the candidate text, and extracts context information and typical sentences of the word or the phrase to be searched; and the output device outputsthe context information and the typical sentences. By adopting the device and the method, the word usage knowledge can be acquired accurately.

Description

【Technical field】 [0001] The invention relates to the technical field of computer information processing, in particular to a system and method for acquiring knowledge of word usage based on data mining. 【Background technique】 [0002] When people use foreign languages ​​to read, write, and translate, they often encounter words and phrases that are not included in the dictionary, and the same word or phrase often has different translations in different contexts. Therefore, how to write authentic words and sentences is a matter of every Problems faced by people using foreign languages. For Chinese students, due to the differences in Chinese and English culture and language styles, coupled with the lack of knowledge of English collocations (such as: form-name collocation, verb-name collocation, verb-media collocation), the problem of how to write authentic words and sentences is appear particularly prominent. [0003] The development of the Internet has provided us with unpre...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30G06F17/27
Inventor 方高林
Owner SHENZHEN SHI JI GUANG SU INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products