Short text characteristic extension and fitting characteristic library building method and device

An extension method and technology of an extension device are applied in the field of short text feature extension and fitting text feature library construction, which can solve the problems of inaccurate extension results and high risk of short text feature escaping, and achieve the effect of improving the matching success rate.

Inactive Publication Date: 2014-01-22
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF3 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the embodiments of the present invention is to propose a short text feature extension and fitting feature library construction method and device to solve the problems of high escape risk and inaccurate extension results in the process of short text feature extension

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Short text characteristic extension and fitting characteristic library building method and device
  • Short text characteristic extension and fitting characteristic library building method and device
  • Short text characteristic extension and fitting characteristic library building method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The technical solutions of the present invention will be further described below in conjunction with the accompanying drawings and through specific implementation methods.

[0036] figure 1 is a flow chart of the short text feature extension method according to the first embodiment of the present invention. Such as figure 1 As shown, the method includes:

[0037] Step 101, acquiring short text information to be expanded.

[0038] Specifically, the acquisition of the short text information to be extended may be the short text information directly input by the user acquired in real time, or the short text information currently required to be processed by the computer equipment acquired in real time, or the acquisition pre-stored in the computer equipment. or short text messages in other devices that require extended processing.

[0039] Step 102. Delete items without expressive ability in the short text information to be expanded to obtain fitted short text informatio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a short text characteristic extension and fitting characteristic library building method and device. The short text characteristic extension method includes the steps of a, obtaining a short text to be extended, b, deleting an item, without meaning expression ability, in the short text to be extended to obtain a fitting short text, c, inquiring the fitting short text in a fitting characteristic library, returning a characteristic item of the fitting short text as an extension characteristic item if the fitting short text is found in the fitting characteristic library, and executing the step d if the fitting short text is not found in the fitting characteristic library, d, omitting an item with the lowest significance weight in the fitting short text to obtain an omitting short text, e, judging whether a sum of the significance weights of all items in the omitting short text is smaller than a threshold value or not, returning ineffectiveness if the sum of the significance weights of all items in the omitting short text is smaller than the threshold value, and executing the step f if the sum of the significance weights of all items in the omitting short text is larger than the threshold value, and f, inquiring the omitting short text in the fitting characteristic library, returning a characteristic item of the omitting short text as an extension characteristic item if the omitting short text is found, using the omitting short text as the fitting short text if the omitting short text is not found, and executing the step d. By means of the short text characteristic extension and fitting characteristic library building method and device, the transferred meaning risks in the short text characteristic extension process are reduced, and the characteristic extension accuracy is improved.

Description

technical field [0001] The invention relates to computer text processing technology, in particular to a short text feature expansion and fitting text feature database construction method and device. Background technique [0002] With the widespread use of applications such as e-mail, web forums, and microblogs, a large amount of text information data has been generated within the Internet, and these information are usually only fragmentary descriptions or opinion comments, with only a short text content, so called short text. Facing the massive text data generated by the rapid development of the Internet, how to accurately and effectively obtain the required materials and information has become a topic of widespread concern and research in the Internet industry. [0003] Due to the short length of the short text and the weak signal of the concept described, the retrieval results may not be obtained in the short text retrieval, or the obtained retrieval results are not what ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F17/21
CPCG06F16/334G06F16/3332
Inventor 李大任田浩冼健
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products