Unlock instant, AI-driven research and patent intelligence for your innovation.

Corpus tagging method and corpus tagging device

A corpus tagging and corpus technology, applied in the computer field, can solve the problems of low corpus tagging efficiency and low accuracy.

Active Publication Date: 2012-09-12
LESHAN NORMAL UNIV
View PDF4 Cites 42 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] In the embodiment of the present invention, in the conventional computer-aided corpus labeling system, a credibility indicator unit is set, which can indicate the credibility of each labeling result corresponding to different corpus labeling results, so as to solve the problem of low efficiency and accuracy of corpus labeling low problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Corpus tagging method and corpus tagging device
  • Corpus tagging method and corpus tagging device
  • Corpus tagging method and corpus tagging device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] In order to improve the labeling rate and labeling accuracy of the computer-aided corpus labeling system, in the embodiment of the present invention, the corpus labeling device selects the corpus to be labeled each time it performs corpus labeling, and labels the corpus, and according to the stored The historical record of corpus labeling can be used to indicate the credibility of any labeling result corresponding to the above-mentioned corpus. In this way, human resources can be effectively saved, and the corpus labeling workload, low efficiency, and accuracy in the huge corpus existing in the prior art can be solved. low problem.

[0025] Preferred embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0026] refer to figure 2 As shown, in the embodiment of the present invention, the corpus tagging device includes: a tagging unit 20 , a credibility indicating unit 21 and a presentation unit 22 . The l...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of computer, and discloses a corpus tagging method and a corpus tagging device. The method includes: each time the corpuses are tagged, using the corpus tagging device to select a corpus to be tagged and tag the corpus, and indicating credibility for an optional tagging result of each corpus according to historical records of the stored corpuses. Therefore, manpower sources in manual correction are reasonably distributed, and the problems of high corpus tagging labor, low efficiency and low accuracy in a large copora in the prior art are solved.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a corpus tagging method and device. Background technique [0002] In linguistics, a corpus refers to a collection of a large number of texts. The texts in the corpus are corpus. After the corpus is sorted, it has a predetermined format and mark. A corpus composed of a large number of corpora with established formats and tags can be applied to lexicography, language teaching, traditional language research, statistical or case-based research in natural language processing, etc. Therefore, corpora are the basic resources of linguistic research. Corpus annotation is the work of word segmentation, part-of-speech tagging, named entity recognition, syntactic processing, information extraction and other aspects of the text in the corpus. It is the basis for establishing an accurate corpus and language analysis model. For example, part-of-speech corpus tagging is to tag the part-of-spe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27G06F17/30
Inventor 金澎邱立坤
Owner LESHAN NORMAL UNIV