Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Dictionary creation device for monitoring text information, dictionary creation method for monitoring text information, and dictionary creation program for monitoring text information

A text information and dictionary technology, which is applied in electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as difficulty, omission, and dictionary time-consuming, and achieve the effect of high-precision detection.

Inactive Publication Date: 2015-06-03
NEC CORP
View PDF9 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Generating a dictionary with introspection in a dictionary-based text information monitoring system is time-consuming, prone to omissions, and thus difficult

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dictionary creation device for monitoring text information, dictionary creation method for monitoring text information, and dictionary creation program for monitoring text information
  • Dictionary creation device for monitoring text information, dictionary creation method for monitoring text information, and dictionary creation program for monitoring text information
  • Dictionary creation device for monitoring text information, dictionary creation method for monitoring text information, and dictionary creation program for monitoring text information

Examples

Experimental program
Comparison scheme
Effect test

application example 5-

[0132] Only for phrases whose usefulness is not less than the threshold, the characteristic degree calculation unit 3 calculates the characteristic degree of the phrase, and the detection condition determination unit 22 determines whether the phrase is suitable for the detection condition.

[0133] Compared with application example 2, a specific description is given. Figure 8 is another example of the usefulness and score for each phrase.

[0134] Similar to application example 2, the usefulness calculation unit 21 calculates that the usefulness of "Trojan horse" is 4.5, the usefulness of "Trojan" is 1.5, the usefulness of "Trojan horse" is 1.5, and the usefulness of "Trojan horse infection" is 5 , the usefulness of "Trojan horse infection" is 3, the usefulness of "infection" is 1, and the usefulness of "email" is 1.

[0135] The characteristic degree calculation unit 3 calculates, for example, only the characteristic degrees of phrases having a usefulness of 3 or more: "Tro...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The purpose of the present invention is to create a dictionary for monitoring text information such that it is possible to achieve high-precision detection compared to prior art. A feature degree calculation unit (3) compares the statistics of a positive example group and a negative example group, and calculates the degree by which a given phase appears in the positive example group as the feature degree. A usefulness degree calculation unit (21) calculates a usefulness degree by using indices pertaining to the length of the phrase, the frequency at which the phrase appears within the positive example group, and the inclusion relationship between phrases for each phrase extracted by means of a phrase extraction unit (1). A detection condition determination unit (22) uses the usefulness degree calculated by means of the usefulness degree calculation unit (21) and the feature degree calculated by means of the feature degree calculation unit (3) to evaluate the appropriateness of each phrase as a detection condition by means of the product of the usefulness degree and the feature degree, and determines that the phrase is appropriate as a detection condition when the value of the product is greater than a threshold value.

Description

technical field [0001] The present invention relates to a dictionary creation device for monitoring text information, a dictionary creation method for monitoring text information, and a dictionary creation program for monitoring text information. Specifically, the present invention relates to such a dictionary creation device for monitoring text information, a dictionary creation method for monitoring text information, and a dictionary creation program for monitoring text information by which high A dictionary that monitors textual information with precision. Background technique [0002] In order to monitor rumors and the like on the Internet, text information monitoring technology that detects information content to be monitored appearing in a large amount of text has become important. The text information monitoring system employed in the present invention monitors text information on the basis of dictionaries. In other words, as a text information monitoring technique,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
CPCG06F40/242G06F16/374
Inventor 大西贵士土田正明石川开
Owner NEC CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products