Unlock instant, AI-driven research and patent intelligence for your innovation.

Semantic identifier generating method and device for text set

A technology for semantic identification and generation of semantics, applied in the field of information processing, can solve problems such as low efficiency and slow speed, and achieve the effect of improving efficiency and accurate formal semantic identification

Inactive Publication Date: 2015-03-25
BEIJING QIHOO TECH CO LTD +1
View PDF7 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the large number of text categories, tens of thousands, it is inefficient and slow to mark each text set with a corresponding semantic mark through traditional methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semantic identifier generating method and device for text set
  • Semantic identifier generating method and device for text set
  • Semantic identifier generating method and device for text set

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0026] According to a first aspect of the present invention, there is provided a method for generating semantic tokens for a text collection. figure 1 A flowchart of a method 100 for generating a semantic tag for a text set according to an embodiment of the present invention is shown.

[0027] like figure 1 As shown, the method 100 starts at step S110. In step S110, each text in the text collection is subjected to at least one of wo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a semantic identifier generating method and device for a text set. The semantic identifier generating method includes subjecting texts in the text set to at least one of segmentation, separate word combination and permutation and combination to obtain candidate semantic identifiers corresponding to the texts; according to text characteristics, user behavior characteristics and length L of the candidate semantic identifiers, determining priorities of the candidate semantic identifiers; regarding one or more candidate semantic identifiers with highest priorities as an official semantic identifier of the text set.

Description

technical field [0001] The present invention relates to the technical field of information processing, in particular to a method and device for generating semantic tags for text collections. Background technique [0002] At present, in the field of the Internet, in order to better understand the needs and interests of users, it is often necessary to classify various short texts. For each short text set, by analyzing the text in the text set, a text set corresponding to the text set is generated. Corresponding semantic identification, and mark the corresponding semantic identification for each text set. For example, a collection of shirts is marked with semantic labels such as "women in shirts" / "men in shirts"; for a certain type of footwear, semantic labels such as "female Doudou shoes" or "Oxford shoes" are marked. However, because there are tens of thousands of text categories, it is inefficient and slow to mark each text set with a corresponding semantic mark through tra...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/313G06F40/221
Inventor 杨诗
Owner BEIJING QIHOO TECH CO LTD