Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Knowledge base construction method and device, storage medium and computation equipment

A construction method and knowledge base technology, applied in the knowledge base construction method and device, computing equipment, and storage media fields, can solve problems such as unproposed solutions, and achieve the effects of easy understanding, elimination of unfavorable factors, and strong readability

Active Publication Date: 2017-12-29
JINGZAN ADVERTISING SHANGHAI CO LTD
View PDF3 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, how to process the messy and unordered phrase data (that is, the original phrase) into a standardized and structured industry knowledge base has not yet proposed an effective solution.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Knowledge base construction method and device, storage medium and computation equipment
  • Knowledge base construction method and device, storage medium and computation equipment
  • Knowledge base construction method and device, storage medium and computation equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] As mentioned in the background, in order to identify and extract meaningful data for analyzing a certain topic from massive Internet data, the existing technology performs data cleaning according to predetermined rules, but does not consider the impact of industry differentiation on data cleaning. Therefore, the existing data cleaning methods are difficult to establish a knowledge base describing the general information of industry characteristics according to the industry application field.

[0028] An embodiment of the present invention provides a method for constructing a knowledge base, including: determining an industry standard lexicon, and useful word rules and stop word rules corresponding to the industry standard lexicon; Extracting useful words; performing word segmentation on the original phrase to obtain multiple words; if the multiple words include words that match the standard words in the industry standard thesaurus, then the matched words are classified a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a knowledge base construction method and device, a storage medium and computation equipment. The method comprises the following steps of: determining an industrial standard lexicon and a go-word rule and a stop word rule corresponding to the industrial standard lexicon; extracting a go-word from an original phrase on the basis of the go-word rule; carrying out word segmentation on the original phrases to obtain a plurality of words; if the plurality of words comprise words matched with a standard word in the industrial standard lexicon, combining the matched words according to a position relationship in the original phrase so as to obtain a combined word; and combining the combined word with the go-word to obtain a first new phrase, and adding the first new phrase into a knowledge base. By adoption of the method, disordered text data can be processed into ordered industrial knowledge bases with structurized data formats, so that convenience is brought to the subsequent data processing and benefit is brought to improve the correctness of industrial information and industrial knowledges.

Description

technical field [0001] The invention relates to the field of information processing, in particular to a method and device for constructing a knowledge base, a storage medium, and a computing device. Background technique [0002] Most of the information processed by modern big data comes from the Internet. Internet data includes massive data such as public data on the Internet or data crawled by crawlers. There are various sources and formats of Internet data; the information features are not obvious, irregular, and difficult to read, and there are many disturbing information; there are conflicts and even errors in the data. If these conflicting or wrong "dirty data" appear in the statistical results, not only may cause ambiguity, but may even draw wrong conclusions. Therefore, in the Internet data-based big data processing, the prior art generally adopts data cleaning technology to process dirty data. The so-called data cleaning refers to the process of discovering and co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/374G06F40/289
Inventor 汤奇峰齐炜
Owner JINGZAN ADVERTISING SHANGHAI CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products