Information mining method and device

A technology of information mining and text information, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as no collection of vocabulary collocation pairs, cumbersome and complicated sorting process, high requirements for domain knowledge and language ability, and achieve The effect of accurate information mining

Active Publication Date: 2016-05-11
SHANGHAI YOUYANG XINMEI INFORMATION TECH CO LTD
View PDF5 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, Chinese word collocations are mainly based on manual collection, and there are no word collocation sets and mature and stable word collocation mining methods for specific fields.
Manual collection of word collocations requires high domain knowledge and language skills of analysts, and the sorting process is also very cumbersome and complicated. Therefore, an automated mining method is urgently needed to establish a collection of word collocations in the field.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Information mining method and device
  • Information mining method and device
  • Information mining method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain related inventions, rather than to limit the invention. It should also be noted that, for the convenience of description, only the parts related to the related invention are shown in the drawings.

[0024] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present application will be described in detail below with reference to the accompanying drawings and embodiments.

[0025] figure 1 An exemplary system architecture 100 to which embodiments of the information mining method or information mining device of the present application can be applied is shown.

[0026] Such as figure 1 As shown, the system architecture 100 may include terminal ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an information mining method and device. A specific implementation way of the method comprises the following steps: carrying out sentence segmentation on obtained text information to obtain a sub-sentence set; selecting at least one candidate sub-sentence from the sub-sentence set according to the preset public opinion word set; carrying out word segmentation on the at least one candidate sub-sentence on the basis of a domain dictionary, carrying out dependency parsing on various words obtained after word segmentation to obtain at least one candidate word collocation pair; selecting at least one word collocation pair from the at least one candidate word collocation pair as a first word collocation pair set mined from the text information according to the public opinion word set. The implementation way achieves rapid and accurate information mining.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to the field of information technology, especially to an information mining method and device. Background technique [0002] With the rapid development of information technology, the Internet contains a large amount of information content. Public opinion is the abbreviation of "public opinion situation". Social attitudes generated and held by people and their political, social, and moral orientations. The collocation of public opinion words in the information content can reflect the core content of the information, and can be used for logical derivation in text analysis. At present, Chinese word collocations are mainly based on manual collection, and there are no word collocation sets and mature and stable word collocation mining methods for specific fields. Manual collection of word collocations requires high domain knowledge and language ability of analysts, and the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/3325G06F16/3329G06F40/211
Inventor 张新展
Owner SHANGHAI YOUYANG XINMEI INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products