Recognition method of personal name and country based on web and gbboosting algorithm

A recognition method and algorithm technology, applied in computing, special data processing applications, instruments, etc., can solve the problem that the classification accuracy rate is higher than random guessing, and achieve the effect of easy implementation

Active Publication Date: 2017-07-04
CHONGQING UNIV OF POSTS & TELECOMM
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the classification accuracy of the above classification method in the scene of classification of names by country needs to be further improved, especially when the spelling of names in two countries is similar, the classification accuracy is only higher than random guessing
It can be seen that the above classification algorithm has great limitations in the application of the classification of personal names and countries

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Recognition method of personal name and country based on web and gbboosting algorithm
  • Recognition method of personal name and country based on web and gbboosting algorithm
  • Recognition method of personal name and country based on web and gbboosting algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

[0025] figure 1 It is a macro-flow chart of the method of the present invention. As shown in the figure, the method includes the following steps: Step 1: extracting the names of academics in universities through WEB data extraction technology; Step 2: constructing a GBBoosting algorithm: constructing a weak classifier, each weak classifier The classifier outputs a weak classification hypothesis for the input sample, and forms a strong classifier through the weight fusion of all weak classifiers; Step 3: Identify the country to which it belongs through the GBBoosting algorithm.

[0026] Figure 4 Based on the micro-flow chart of this method, now combine Figure 4 The specific implementation steps of this method are described.

[0027] 1. Extract the names of university scholars through WEB data extraction technology

[0028] 1) Search "u...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a name country identification method based on WEB and GBBoosting algorithms, and belongs to the technical field of WEB data mining. The method comprises the steps of I. extracting names of scholars in universities through a WEB data extraction technology; II. constructing a GBBoosting algorithm: constructing weak classifiers, wherein each weak classifier outputs a weak classification hypothesis to an input sample, and a strong classifier is formed through weight fusion of all weak classifiers; and III. identifying countries of the names through the GBBoosting algorithm. The name country identification method based on WEB and GBBoosting algorithms disclosed by the invention effectively solves a problem on classifying names of two countries which are similar in the spelling way; and meanwhile, the method, compared to existing other classifying methods, is easier to implement, and can be better applied to engineering practices such as name country or city country semantic annotation.

Description

technical field [0001] The invention belongs to the technical field of WEB data mining, and in particular relates to a name and country recognition method based on WEB and GBBoosting algorithm. Background technique [0002] With the rapid development of the Internet and the increasing abundance of WEB resources, in order to quickly and accurately mine necessary and meaningful data from massive data information, in recent years, WEB semantic analysis technology and text classification technology have been widely used in the field of WEB data mining. Applications, WEB-based applications have changed users' living habits and working styles to some extent, and are also welcomed and appreciated by more and more users. [0003] Classification methods such as KNN and Bayesian have achieved good classification results in many classification fields. For example, Jiemei et al. applied KNN to the field of image processing, and proposed a KNN-based classification algorithm to detect the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/334G06F16/367G06F40/216
Inventor 苏畅贾文强王裕坤余跃吴琪
Owner CHONGQING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products