Information extraction method and device and electronic equipment

An information extraction and notebook technology, applied in digital data information retrieval, electronic digital data processing, unstructured text data retrieval, etc., can solve the problem of low overall accuracy of resume experience information, poor overall generalization ability, and accurate resume experience information In order to achieve the effect of reducing the probability of missing, improving the accuracy and reducing the false alarm rate
CN114444489APending Publication Date: 2022-05-06BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD

Patent Information

Authority / Receiving Office
CN · China
Current Assignee / Owner
BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD
Publication Date
2022-05-06

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The embodiment of the invention provides an information extraction method and device and electronic equipment, and the method comprises the steps: obtaining a target text to be subjected to information extraction, and carrying out the splicing processing of the target text, and obtaining a to-be-recognized text; adopting a first recognition mode to recognize preset key fields and title lines in the to-be-recognized text to obtain a plurality of first key fields and a plurality of target title lines; on the basis of each first key field and each target title line, partitioning the to-be-recognized text to obtain a plurality of text sub-blocks; for each text sub-block, adopting a second identification mode to extract a preset key field in the text sub-block to obtain a second key field; fusing the first key field and the second key field corresponding to each text sub-block to obtain preliminary extraction results corresponding to different text sub-blocks; and performing duplicate removal on the preliminary extraction result to obtain an information extraction result of the target text. According to the embodiment of the invention, the accuracy of information extraction can be improved.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The present invention relates to the technical field of natural language processing, in particular to an information extraction method, device and electronic equipment. Background technique

[0002] Information extraction technology extracts structured text information by analyzing and processing structured, semi-structured and unstructured text data, which is a basic and important task link in the field of natural language processing. Resume is very important for knowing a person. Resume parsing is an important task in the field of intelligent recruitment. It automatically and intelligently parses and extracts personal basic information and experience information such as work, projects, internships, and activities in resume documents. Company recruitment, talent assessment and talent management all play an important role and have practical significance.

[0003] In practical applications, due to the influence of many factors such as resume format and...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More