Resume analysis method based on n-gram model

An analysis method and resume technology, applied in the field of computer science, can solve the problems of weak text adaptability and low accuracy of information extraction, and achieve the effect of improving job hunting efficiency, good adaptability, and high accuracy

Active Publication Date: 2017-09-08
SOUTHWEAT UNIV OF SCI & TECH
View PDF6 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The existing resume parsing schemes that apply the above three text information extraction models mostly use simple keyword matching meth

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Resume analysis method based on n-gram model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0021] refer to figure 1 , is a schematic flowchart of the resume parsing method provided by the embodiment of the present invention. The resume parsing method of the present embodiment includes the following steps:

[0022] S1: Collect a predetermined number of resume samples.

[0023] Among them, resume samples can be collected from various recruitment websites. The language of the resume sample is Chinese, and it can also be English or other languages. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a resume analysis method based on an n-gram model. The method comprises the steps that resume samples are collected in advance; commonly used field keywords are classified into different types, and a classification dictionary is formed; the n-gram model is used to conduct statistics of a transfer probability of converting each commonly used field keyword into each sample related word; a target keyword matched with the commonly used field keyword in a to-be-analyzed resume is searched; if the transfer probability corresponding to the target keyword is larger than a preset threshold, the transfer probability corresponding to each commonly used field keyword can be updated according to the target keyword; prefix labels and postfix labels are added to effective keywords in the to-be-analyzed resume; and text contents of the to-be-analyzed resume are extracted by segmentation and then output. According to the invention, the automatic resume analysis can be conducted based on the n-gram model and dictionary segmentation technologies; information extraction accuracy can be increased, and different document formats can be supported; and an abundant talent resource base can be provided for recruitment websites and company HR departments.

Description

technical field [0001] The invention relates to the technical field of computer science, in particular to a resume analysis method based on an n-gram model. Background technique [0002] With the rapid development of Internet technology, the network accommodates a large amount of raw data information of various types. In daily life, a resume is a very common and important text, which contains the author's basic situation, work experience and other information. Therefore, how to automatically, quickly and accurately extract useful information from massive resumes has become an urgent need for HR departments of major recruitment websites, companies and enterprises. [0003] Resume parsing is essentially an application of text information extraction. Currently, there are three main types of text information extraction models: dictionary-based extraction models, rule-based extraction models, and hidden Markov model-based extraction models. [0004] The existing resume parsing...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06Q10/06G06Q10/10G06F17/27
CPCG06F16/35G06F40/289G06Q10/0639G06Q10/1053
Inventor 杨春明张晖李建飞李波赵旭剑
Owner SOUTHWEAT UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products