Method and device for automatically labeling text

An automatic labeling and text technology, applied in the computer field, can solve the problems of poor user experience, lexical analysis can not give effective information, can not effectively complete the application, etc., to achieve the effect of broad application prospects

Active Publication Date: 2014-03-26
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF4 Cites 48 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, in the process of realizing the present invention, the inventors found that the prior art has at least the following problems: the lexical analysis only stays on the analysis of the literal semantics of the vocabulary, and for deeper sema...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for automatically labeling text
  • Method and device for automatically labeling text
  • Method and device for automatically labeling text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] Embodiments of the present invention are described in detail below, and examples of the embodiments are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention. On the contrary, the embodiments of the present invention include all changes, modifications and equivalents coming within the spirit and scope of the appended claims.

[0023] In the description of the present invention, the terms "first", "second", etc. are used for descriptive purposes only, and cannot be understood as indicating or implying relative importance. In the description of the present invention, unless otherwise specified and limited, the terms "connected" and "connected" should be understood in a broad sense, for ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a device for automatically labeling a text. The method for automatically labeling the text comprises the following steps of identifying vocabularies in the text; labeling identified vocabularies expressing attribute values into formats corresponding to the types which attribute values belong to in a knowledge base; labeling identified notional words into notional knowledge in the knowledge base; on the basis of a result of labeling the notional words, labeling identified pronouns into contents referred to by the pronouns; and on the basis of results of labeling the notional words and the pronouns, labeling identified attribute names into corresponding attribute names in the knowledge base. In the method for automatically labeling the text, which is disclosed by the embodiment of the invention, text is automatically labeled according to the notional knowledge in the knowledge base and the notional knowledge in the knowledge base is deeply integrated, so as to introduce massive structured information in the knowledge base into conventional text processing application and implement reasoning and expansion between the text and the notional knowledge, thereby expanding a very wide application prospect.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a method and device for automatically marking text. Background technique [0002] Lexical analysis and processing is a basic technology of NLP (Natural Language Processing, traditional natural language processing), and its main functions include WordSeg (Word Segmentation, natural language text segmentation), PosTag (Part-of-Speech Tagging, part-of-speech tagging) And NER (Named Entity Recognition, proper name recognition). After lexical analysis and processing, the text will be divided into vocabulary forms, and each vocabulary will be given a specific part of speech (for example, verb, noun, adjective, etc.) information. A large number of upper-level application technologies, such as search engine technology, in-depth question answering technology, machine translation technology, etc., are all based on the above analysis results. [0003] However, in the process of realizi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06F17/30
Inventor 孙珂赵世奇忻舟王海峰
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products