Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Automatic corpus acquisition method and device, computer equipment and storage medium

An acquisition method and corpus technology, applied in office automation, computing, natural language data processing, etc., can solve the problems of low text acquisition efficiency and poor real-time performance, and achieve the effect of improving real-time performance and acquisition efficiency

Active Publication Date: 2021-09-03
PING AN TECH (SHENZHEN) CO LTD
View PDF9 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The embodiment of the present invention provides a method, device, computer equipment, and storage medium for automatic acquisition of corpus, aiming to solve the problem that the words and communication materials used by the interviewer in the on-site interview scene in the prior art can be pre-installed on the user terminal used by the interviewer. The stored text can also be the text provided in the printed materials. The process of organizing the text is not automatically receiving the text sent by the system on the client side. There are problems of low text acquisition efficiency and poor real-time performance.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic corpus acquisition method and device, computer equipment and storage medium
  • Automatic corpus acquisition method and device, computer equipment and storage medium
  • Automatic corpus acquisition method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0027] It should be understood that when used in this specification and the appended claims, the terms "comprising" and "comprises" indicate the presence of described features, integers, steps, operations, elements and / or components, but do not exclude one or Presence or addition of multiple other features, integers, steps, operations, elements, components and / or collections thereof.

[0028] It should also be understood that the terminology used ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an automatic corpus acquisition method and device, computer equipment and a storage medium, and relates to an artificial intelligence technology, and the method comprises the steps: training an LDA model according to each document which is marked with a topic probability distribution result in advance in a corpus to obtain the LDA model; then, using the LDA model to perform operation according to description text data of the input object to obtain a predicted topic probability distribution result, and finally, according to the predicted topic probability distribution result, selecting corpora with the same topic from a corpus to form a first target corpus subset, and sending the first target corpus subset to a user side. Through the LDA model, the corpus required in the interview process is predicted, and the target corpus is automatically screened and pushed based on the predicted prediction subject, so that the acquisition efficiency of the target text is improved, the target text is pushed more timely, and the real-time performance is improved.

Description

technical field [0001] The present invention relates to the technical field of speech and semantics of artificial intelligence, in particular to a method, device, computer equipment and storage medium for automatic acquisition of corpus. Background technique [0002] At present, in enterprise interview scenarios, interviewers communicate or test with interviewers based on a set of fixed interview procedures, and then obtain evaluation results for interviewers. During this process, the words and communication materials used by the interviewer can be the pre-stored text on the user terminal used by the interviewer, or the text provided in the printed materials. In this way, the user needs to operate the user terminal to edit the text or The text is selected from a large number of text databases in the database, and then it is determined whether to print the text according to the actual use requirements, which leads to the fact that the text finishing process is not the text se...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F40/216G06Q10/10
CPCG06F16/3346G06F40/216G06Q10/1053
Inventor 袁雅云张莉任杰吴志成
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products