Method and system for associating specific names with native places in big data environment

A technology of big data and personal names, applied in database models, relational databases, electronic digital data processing, etc.

Inactive Publication Date: 2016-11-23
YANGTZE UNIVERSITY
View PDF7 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0017] In the prior art, there is no relevant technology for the association between special names and hometowns in the big data environment

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for associating specific names with native places in big data environment
  • Method and system for associating specific names with native places in big data environment
  • Method and system for associating specific names with native places in big data environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0062] like Figures 1 to 4 As shown, in view of the defects of the prior art, the present invention proposes a method for associating special names and hometowns in a big data environment, which includes the following steps:

[0063] S1. Collect name and place of origin information, including surname, pronunciation, and place of birth, and perform data fusion, data sampling and mining on the name and place of origin information to obtain the collected and mined data; at the same time, jump to step S2 and step S3;

[0064] Data sampling and mining include classification, clustering, cross-training, etc.

[0065] Extraction of association rules: For identification of special names, the embodiment of the present invention first adopts a manual method to establish a training feature library for special names, and then uses an unsupervised learning method for sample clustering of special names in the library. When building the feature database, each name corresponds to a place of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A method for associating specific names with native places in big data environment comprises the following steps: S1, name and native place information containing family names, pronunciation of the names and the native places is collected, collected and mined data are obtained after data fusion, data sampling and data mining on the name and native place information, and the method jumps to S2 and S3; S2, common names in the collected and mined data are removed through screening, and specific name screening and marking are performed on the data after screening removal, a definition of a sample data structure is obtained through classified check of the specific names and the common names, and the method jumps to S4; S3, the collected and mined data are subjected to feature extraction, association rules are established, and the method jumps to S4; S4, a specific name set and a feature library are established according to the definition of the sample data structure, extracted features and the association rules; S5, an inference model SNNPAR (specific name native place association rules model) is established on the basis of the specific name set and the feature library, and specific name, native place and region inference is performed on the basis of the inference model SNNPAR.

Description

technical field [0001] The present invention relates to the technical field of big data mining, in particular to a method and system for associating special names and hometowns in a big data environment. Background technique [0002] With the development of the information age and the progress of the times, various industries have produced a lot of big data of industry nature. The research on big data has immeasurable knowledge value, economic value and social value for the development of various industries. [0003] At present, scholars at home and abroad have little work on the relationship between special names and hometowns in the big data environment. It mainly includes the following aspects: [0004] Recognition of Chinese names: The research on Chinese word segmentation technology is a basic topic of Chinese information processing, which is widely used in search engines, machine translation, information extraction, text clustering and other fields. At present, the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/284
Inventor 王峰
Owner YANGTZE UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products