Job description text similarity calculation method based on topic model

A text similarity, topic model technology, applied in computing, special data processing applications, instruments, etc., can solve problems such as large deviation, and achieve the effect of improving accuracy

Inactive Publication Date: 2016-07-20
裴克铭管理咨询(上海)有限公司
View PDF5 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In order to overcome the deficiencies in the prior art, the present invention provides a topic model-based job description text similarity calculation method, which can overcome the shortcomings of the traditional vector space model when calculating text similarity, such as large deviations, so as to better realize Automatic discrimination function of overlapping functional positions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Job description text similarity calculation method based on topic model
  • Job description text similarity calculation method based on topic model
  • Job description text similarity calculation method based on topic model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] The present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0023] Such as Figure 1 to Figure 4 As shown, a topic model-based job description text similarity calculation method includes the following steps.

[0024] Step 1) Input and storage of job description text: the present invention allows users to input job description text in two ways. In the first way, the user specifies the network address, and the system obtains the text stored on the Internet; in the second way, the user directly inputs the text to be processed on the server side. Massive text data is stored in distributed storage.

[0025] Step 2) Specific feature extraction: According to the characteristics of the job description text, specific features are extracted, such as working years, working location, working hours, education background, major, etc.

[0026] Step 3) Semantic preprocessing: Semantic preprocessing such as sentenc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a job description text similarity calculation method based on a topic model.The method specifically includes the steps of semantic pretreatment, model pretreatment, topic model analysis, clustering analysis, similarity calculation and the like.Projection features of job description texts on different topics are extracted, in combination with multiple specific features such as years of working, working places and education backgrounds, vectorized expression of the job description texts is achieved, and the functions of text similarity calculation and clustering and the like are completed.The texts are expressed through semantic features and field specific features, and the accuracy of similarity calculation of the job description texts is greatly improved.The function of finding jobs with highly-overlapped functions in a massive post and job description database is achieved, and the method assists corresponding departments in completing analysis and decision making.The defects that the deviation is large when a traditional vector space model is used for calculating the text similarity can be overcome, and therefore the automatic judgment function of the function overlapped jobs is better achieved.

Description

technical field [0001] The invention belongs to the technical fields of information retrieval and text mining, and in particular relates to a method for calculating the similarity of job description texts based on topic models. Background technique [0002] With the intensification of competition among enterprises, the proportion of human resources owned by enterprises in the operating costs of enterprises is getting higher and higher. Correspondingly, the deployment and flow of talents within the enterprise is becoming more and more frequent. Therefore, reducing the demand for positions with highly overlapping functions and making full use of on-the-job human resources is one of the important ways for enterprises to reduce costs and improve efficiency. As the scale of enterprises continues to expand, the traditional means of identifying positions with similar functions, such as manual screening and identification, can no longer meet the needs of enterprises. Therefore, de...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/22G06F17/27
CPCG06F40/131G06F40/205G06F40/247G06F40/284G06F40/30
Inventor 沈启明
Owner 裴克铭管理咨询(上海)有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products