Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Corpus intention prediction method, corpus labeling method and electronic equipment

A prediction method and corpus technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve problems that affect labeling efficiency and waste manpower, and achieve the effects of ensuring accuracy, improving accuracy, and ensuring balance

Inactive Publication Date: 2019-11-15
XIAMEN KUAISHANGTONG TECH CORP LTD
View PDF2 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Most of the labeling work is based on manual labeling. In most cases, the corpus has not been processed in advance, and there will be a large amount of duplicate data. If these duplicate data are not filtered, one will affect the efficiency of labeling, and the other will be a waste of manpower.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Corpus intention prediction method, corpus labeling method and electronic equipment
  • Corpus intention prediction method, corpus labeling method and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention more clear, the following will describe each embodiment of the present invention in detail with reference to the accompanying drawings. However, those of ordinary skill in the art can understand that, in each implementation manner of the present invention, many technical details are provided for readers to better understand the present application. However, even without these technical details and various changes and modifications based on the following embodiments, the technical solutions claimed in the present application can be realized.

[0019] The first embodiment provided by the present invention is a corpus intent prediction method, which will be described in detail below with reference to figures.

[0020] Please refer to figure 1 , figure 1 A flow chart of the corpus intent prediction method provided by the first embodiment of the present invention is...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a natural language processing technology, and provides a corpus intention prediction method, which comprises the following steps: training to obtain N prediction models on thebasis of a preprocessed sample; predicting a corpus to be predicted based on each prediction model to obtain N prediction results; and matching a preset rule based on the N prediction results, and determining intention information corresponding to the to-be-predicted corpus, wherein N is an odd number greater than or equal to 3, and the preset rule comprises the step of determining the same prediction result as intention information corresponding to the to-be-predicted corpus if the same prediction result exists in the N prediction results and the number of the same prediction results is greater than N / 2. Based on the corpus intention prediction method provided by the embodiment of the invention, the intention prediction of the corpus is realized, and the prediction accuracy is improved,so that repeated manual processing work can be greatly reduced. In addition, the invention further provides a corpus labeling method and electronic equipment.

Description

technical field [0001] The invention relates to natural language processing technology, in particular to a corpus intent prediction method, a corpus labeling method and electronic equipment. Background technique [0002] Corpus is the basic resource of corpus linguistics research and the main resource of empirical language research methods. Traditional corpora are mainly used in dictionary compilation, language teaching, traditional language research, statistical or case-based research in natural language processing, etc. With the development of Internet big data and artificial intelligence technology, corpus has also been widely used. [0003] The corpus has three characteristics. The corpus stores the language materials that have actually appeared in the actual use of the language, such as user messages and customer service dialogues obtained directly from the web page. The corpus is the basic resource for carrying language knowledge, but it does not mean Language knowle...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06F17/27G06F16/35
CPCG06F16/355G06F18/241G06F18/214
Inventor 陈鑫肖龙源蔡振华李稀敏刘晓葳谭玉坤
Owner XIAMEN KUAISHANGTONG TECH CORP LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products