Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Method and apparatus for detecting similar short messages

A short message and similarity technology, applied in the field of information processing, can solve the problems of new short message recognition lag and achieve the effect of improving recognition efficiency

Active Publication Date: 2016-04-13
BEIJING QIHOO TECH CO LTD
View PDF7 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Embodiments of the present invention provide a method and device for detecting similar text messages, which are used to solve the technical problem of lagging in the identification of new text messages in the prior art, and improve the recognition efficiency of new text messages

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for detecting similar short messages
  • Method and apparatus for detecting similar short messages
  • Method and apparatus for detecting similar short messages

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0043] Please refer to figure 1 , the embodiment of the present application provides a method for detecting similar text messages, the method includes

[0044] S11: Segment the target text message, and obtain the target word vector of the target text message according to each word segment and the corpus word matrix;

[0045] S12: Obtain the similarity between the target word vector and the set word vector, wherein the set word vector is the word vector of at least one or at least one type of reference short message;

[0046] S13: judging whether the similarity is greater than a set threshold;

[0047] S14: If the similarity is greater than the set threshold, determine that the target short message is similar to the at least one or at least one type of reference short message.

[0048] When executing S11 to segment the target short message, all the received short messages can be used as target short messages for word segmentation, or the received short messages can be firstly...

Embodiment 2

[0077] Please refer to image 3 , the embodiment of the present application provides a method for detecting similar short messages in accordance with Embodiment 1, and accordingly provides a device for detecting similar short messages, which includes:

[0078] Word vector acquisition module 31, is used for carrying out word segmentation to target note, and obtains the target word vector of described target note according to each word segmentation and corpus word matrix;

[0079] The similarity calculation module 32 is used to obtain the similarity between the target word vector and the set word vector, wherein the set word vector is the word vector of at least one or at least one type of reference message;

[0080] A judging module 33, configured to judge whether the similarity is greater than a set threshold;

[0081] The first confirming module 34 is configured to determine that the target short message is similar to the at least one or at least one type of reference short ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and an apparatus for detecting similar short messages. The method comprises the following steps of: performing word segmentation for a target short message, and obtaining a target word vector of the target short message according to each segmented word and corpus word matrix; obtaining similarity between the target word vector and a set word vector, wherein the set word vector is the word vector of at least one piece of or at least one type of reference short messages; judging whether the similarity is greater than a set threshold value; and if the similarity is greater than the set threshold value, determining that the target short message is similar to the at least one piece or at least one type of reference short messages. In the technical scheme above, the target short message and the reference short message are converted into word vectors, and similarity between the word vectors of the short message is calculated for obtaining the target short message similar to the reference short message, thus, a new short message is detected, then, a technical problem of lag in identification of the new short message in the prior art is solved, and identification efficiency for the new short message is improved.

Description

technical field [0001] The invention relates to the technical field of information processing, in particular to a method and device for detecting similar short messages. Background technique [0002] With the continuous development of science and technology, communication technology has developed rapidly, and there are various ways of communication, including telephone, short message, email and so on. [0003] Short messages are widely used by people because of their short and concise advantages, low cost, etc., and they are also used by lawbreakers because of their wide use and low cost. People often receive fraudulent text messages about stolen bank cards, flight cancellations, point redemption, etc. sent by criminals, and they will be defrauded by criminals if they are not careful. In order to reduce the chances of people being defrauded, the existing technology usually uses marking and screening methods to help users identify fraudulent text messages. The specific proce...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
CPCG06F40/284G06F40/30
Inventor 张金晶李强常富洋
Owner BEIJING QIHOO TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products