Information extraction method and device and electronic equipment

An information extraction and notebook technology, applied in digital data information retrieval, electronic digital data processing, unstructured text data retrieval, etc., can solve the problem of low overall accuracy of resume experience information, poor overall generalization ability, and accurate resume experience information In order to achieve the effect of reducing the probability of missing, improving the accuracy and reducing the false alarm rate

Pending Publication Date: 2022-05-06
BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, due to the influence of factors such as the diversification of resume templates and the diversification of personal writing habits, the method of using rules and keyword lists to obtain resume experience information may be able to accurately extract resume templates that are relatively standardized and ideal. Resume experience information, but for the situation that the ideal resume template is not standardized, the accuracy of the extracted resume experience information is low, and the overall generalization ability is poor, which leads to low overall accuracy of resume experience information extraction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Information extraction method and device and electronic equipment
  • Information extraction method and device and electronic equipment
  • Information extraction method and device and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049]The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art based on the present application belong to the protection scope of the present invention.

[0050] In order to solve the problem that the overall accuracy of resume experience information extraction is low due to the existing method of obtaining resume experience information through information extraction using rules and keyword tables, an embodiment of the present invention provides an information extraction method, device and electronic equipment .

[0051] An information extraction method provided by an embodiment of the present invention includes...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides an information extraction method and device and electronic equipment, and the method comprises the steps: obtaining a target text to be subjected to information extraction, and carrying out the splicing processing of the target text, and obtaining a to-be-recognized text; adopting a first recognition mode to recognize preset key fields and title lines in the to-be-recognized text to obtain a plurality of first key fields and a plurality of target title lines; on the basis of each first key field and each target title line, partitioning the to-be-recognized text to obtain a plurality of text sub-blocks; for each text sub-block, adopting a second identification mode to extract a preset key field in the text sub-block to obtain a second key field; fusing the first key field and the second key field corresponding to each text sub-block to obtain preliminary extraction results corresponding to different text sub-blocks; and performing duplicate removal on the preliminary extraction result to obtain an information extraction result of the target text. According to the embodiment of the invention, the accuracy of information extraction can be improved.

Description

technical field [0001] The present invention relates to the technical field of natural language processing, in particular to an information extraction method, device and electronic equipment. Background technique [0002] Information extraction technology extracts structured text information by analyzing and processing structured, semi-structured and unstructured text data, which is a basic and important task link in the field of natural language processing. Resume is very important for knowing a person. Resume parsing is an important task in the field of intelligent recruitment. It automatically and intelligently parses and extracts personal basic information and experience information such as work, projects, internships, and activities in resume documents. Company recruitment, talent assessment and talent management all play an important role and have practical significance. [0003] In practical applications, due to the influence of many factors such as resume format and...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/279G06F40/205G06F16/31G06F16/335
CPCG06F40/279G06F40/205G06F16/313G06F16/335
Inventor 弓源李长亮
Owner BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products