Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Data Element Recognition Method Based on Logic and Statistical Technology

A technology of statistical technology and identification method, which is applied in the fields of electronic digital data processing, special data processing applications, unstructured text data retrieval, etc.

Active Publication Date: 2017-06-20
KARAMAY HONGYOU SOFTWARE
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The technical problem solved by the present invention is to provide a data element identification method based on logic and statistical technology. This method solves the problem of automatic identification of data elements and lays a good foundation for the later classification of data elements.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Data Element Recognition Method Based on Logic and Statistical Technology
  • A Data Element Recognition Method Based on Logic and Statistical Technology
  • A Data Element Recognition Method Based on Logic and Statistical Technology

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] In order to better understand the technical problems solved by the present invention and the technical solutions provided, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. The specific embodiments described here are only used to explain the implementation of the present invention, but not to limit the present invention.

[0043] In a preferred embodiment, figure 1 It exemplarily shows a flow chart of a data element identification method based on logic and statistical techniques; including:

[0044] The whole data element identification method can be divided into three modules, which are data element language rule acquisition module, conceptual data element identification module (word type) and logic and application data element identification module (phrase type). (1) data element language rule acquisition module: based on the data in the data element library, use the length characteristics o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of data identification, and particularly discloses a method for identifying data elements on the basis of logic and statistic technologies. The method includes enabling clients to submit data processing requests for to-be-tested language materials, counting length characteristics of data in data element libraries and lexical rules of data element structures by the aid of computers, classifying the data according to the length characteristics of the data, respectively summarizing the lexical rules of different categories of data element structures, performing word segmentation and part-of-speech tagging processing on the data in the to-be-tested language materials by the aid of lexical analysis tools, and extracting candidate data elements from the to-be-tested language materials by the aid of the length characteristics and the lexical rules; counting distribution conditions of the candidate data elements in the to-be-tested language materials, and dividing the candidate data elements into a data element category and a non-data element category step by step; respectively checking the candidate data elements under the matching effects of sub-string filter algorithms and phrases, filtering vocabulary activeness, further computing domain relevancy and sorting the phrases according to the domain relevancy from high to low. The method has the advantage that the data elements can be automatically identified by the aid of the method.

Description

technical field [0001] The invention relates to the technical field of data identification, in particular to a data element identification method based on logic and statistical techniques. Background technique [0002] Data Element: Also known as a data type, a data unit described by a series of attributes such as definition, identification, representation, and allowable values. In a certain context, it is usually used to construct a semantically correct, independent and unambiguous specific conceptual semantic information unit, which is used to standardize and classify industry data in a unified name, type, and value; the purpose is to establish a Various types of industry database construction and database integration, as well as data integration, exchange, sharing, service and application of data standardization needs. [0003] The 6W principle is a method of thinking and solving problems proposed by the British writer Kipling, who won the Nobel Prize for Literature, nam...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/35G06F40/30
Inventor 谭远华朱平杨雪
Owner KARAMAY HONGYOU SOFTWARE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products