Training file generation and evaluation method and device, computer system and storage medium

A file generation and evaluation method technology, applied in the field of machine learning, can solve the problems of inability to know the hit rate of training samples, and the quality of training samples cannot be guaranteed, so as to eliminate human errors, ensure the speed of generation, and ensure the effect of generating quality.

Pending Publication Date: 2020-08-25
深圳平安医疗健康科技服务有限公司
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to provide a training file generation and evaluation method, device, computer system and storage medium, which are used to solv

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Training file generation and evaluation method and device, computer system and storage medium
  • Training file generation and evaluation method and device, computer system and storage medium
  • Training file generation and evaluation method and device, computer system and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0057] see figure 1 , a training file generation and evaluation method of the present embodiment, comprising:

[0058] S1: The annotation server receives the original file and obtains the domain information and training entities of the original file, processes the original file according to the domain information and training entities to obtain the annotation file, and sends it to the recognition server; wherein, the domain The information is the information data expressing the field to which the original file belongs, and the training entity refers to the named entity in the original file;

[0059] S2: The recognition server recognizes the semantics of the tagged file through a preset natural language understanding model, and sequentially tags it to obtain a training file, and sends the training file to hit the server;

[0060] S3: The hit server has an intelligent search model and a hit analysis algorithm, the hit server enters the training file into the intelligent search ...

Embodiment 2

[0148] see Figure 8 , a training file generation and evaluation device 1 of the present embodiment, comprising:

[0149] The annotation server 11 is used to receive the original file and obtain the domain information and training entities of the original file, process the original file according to the domain information and the training entity to obtain the annotation file, and send it to the recognition server 12; wherein, The field information is information data expressing the field to which the original file belongs, and the training entity refers to a named entity in the original file;

[0150] The recognition server 12 is used to recognize the semantics of the tagged file by a preset natural language understanding model, and perform sequence tagging to obtain the training file, and send the training file to hit the server 13;

[0151] The hit server 13 has an intelligent search model and a hit analysis algorithm, which is used to input the training file into the intel...

Embodiment 3

[0154] In order to achieve the above object, the present invention also provides a computer system, which includes a plurality of computer equipment 2, the components of the training file generation and evaluation device 1 in embodiment 2 can be dispersed in different computer equipment, and the computer equipment can be It is a smartphone, tablet computer, laptop computer, desktop computer, rack server, blade server, tower server, or rack server (including an independent server, or a server cluster composed of multiple servers) that executes the program. The computer equipment in this embodiment at least includes but is not limited to: a memory 21 and a processor 22 that can communicate with each other through a system bus, such as Figure 9 shown. It should be pointed out that, Figure 9 Only a computer device is shown with the components - but it should be understood that implementing all of the illustrated components is not a requirement and that more or fewer components ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a training file generation and evaluation method and device, a computer system and a storage medium, and the method comprises the steps: receiving an original file, obtaining the domain information and training entity of the original file, and processing the original file according to the domain information and training entity, and obtaining a labeled file; identifying semanteme of the annotation file through a preset natural language understanding model, and performing sequence annotation on the annotation file to obtain a training file; and inputting the training fileinto an intelligent search model corresponding to the domain information to obtain a training result, calculating the training result through a hit analysis algorithm to obtain a hit rate, and summarizing the training file and the hit rate to generate a hit analysis report. The technical effect of automatically obtaining the training file is achieved, the generation quality and the generation speed of the training file are guaranteed, and the problem that the labeling quality of the training sample cannot be guaranteed due to the fact that the real hit rate of the training sample cannot be obtained at present is solved.

Description

technical field [0001] The invention relates to the technical field of machine learning, in particular to a training file generation and evaluation method, device, computer system and storage medium. Background technique [0002] A machine learning model is a general term for an algorithm that realizes prediction or classification by digging out the hidden laws from a large amount of historical data. Classification results; in the field of intelligent search, at present, the intelligent search model based on the machine learning model is usually trained by using labeled sample files to obtain a mature model that can accurately understand the sample data and obtain accurate retrieval results based on the data . [0003] Therefore, high-quality sample files are crucial for training the intelligent search model; however, since the current training file generation method cannot know the real hit rate of the training samples, the labeling quality of the training samples cannot b...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06N20/00G06K9/62
CPCG06N20/00G06F18/214
Inventor 王巍
Owner 深圳平安医疗健康科技服务有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products