Text-oriented digital forensic analysis method and device and computer readable medium

A text and digital technology, applied in the field of text-oriented digital forensics analysis and computer-readable media, which can solve the problems of labor-intensive and low-efficiency

Active Publication Date: 2018-11-23
BEIJING UNIV OF TECH
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In view of this, the object of the present invention is to provide a text-oriented digital forensic analysis method, device and computer-readable medium to solve the problem that the existing technology can only rely on manual browsing of the text content to determine the content of the text to be forensic. Inefficient and labor-intensive technical issues caused by whether the text to be forensic is the target of forensics

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text-oriented digital forensic analysis method and device and computer readable medium
  • Text-oriented digital forensic analysis method and device and computer readable medium
  • Text-oriented digital forensic analysis method and device and computer readable medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0026] figure 1 is a flowchart of a digital forensics method provided according to an embodiment of the present invention, such as figure 1 As shown, the digital forensics method includes the following steps:

[0027] Step S102, preprocessing the text content of the text to be forensics to obtain a plurality of stem words; wherein, the stem words are words other than punctuation marks and stop words in the text to be forensics, and stop words include at least one of the following: adjectives, adverb, pronoun;

[0028]In the embodiment of the present invention, stem words refer to words with actual meaning in the text content, excluding stop words such as adjectives, adverbs, pronouns, etc., and meaningless content such as punctuation marks.

[0029] Step S104, generating the LDA model based on the trained document topic to obtain the feature words in the stem words, obtaining multiple feature words, and determining the feature word vector based on the multiple feature words;...

Embodiment 2

[0063] According to the embodiment of the present invention, there is also provided a digital forensic device, which is used to execute the digital forensic method provided in the above-mentioned content of the embodiment of the present invention. The digital forensic device provided by the embodiment of the present invention will be specifically introduced below.

[0064] Figure 5 is a schematic diagram of a digital forensics device provided according to an embodiment of the present invention, such as Figure 5 As shown, the digital forensics device includes: a preprocessing module 10, an acquisition module 20, a calculation module 30, and a determination module 40, wherein:

[0065] The preprocessing module 10 is used to preprocess the text content of the text to be forensic to obtain a plurality of stem words; wherein, the stem words are words other than punctuation marks and stop words in the text to be forensic, and the stop words include at least one of the following O...

Embodiment 3

[0079] see Image 6 , the embodiment of the present invention also provides a computer 100, including: a processor 60, a memory 61, a bus 62 and a communication interface 63, the processor 60, the communication interface 63 and the memory 61 are connected through the bus 62; the processor 60 is used for Executable modules, such as computer programs, stored in the memory 61 are executed.

[0080] Wherein, the memory 61 may include a high-speed random access memory (RAM, Random Access Memory), and may also include a non-volatile memory (non-volatile memory), such as at least one disk memory. The communication connection between the system network element and at least one other network element is realized through at least one communication interface 63 (which may be wired or wireless), and the Internet, wide area network, local network, metropolitan area network, etc. can be used.

[0081] The bus 62 can be an ISA bus, a PCI bus or an EISA bus, etc. The bus can be divided into ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

An embodiment of the invention provides a text-oriented digital forensic analysis method and a device and a computer readable medium. The method comprises the following steps of: preprocessing the text content of the text to be taken as evidence to obtain a plurality of main words; generating an LDA model based on the trained document theme to obtain feature words in the main words, obtaining a plurality of feature words, and determining feature word vectors based on a plurality of feature words; calculating a semantic similarity between the feature word vector and the preset sensitive word vector, and obtaining a semantic similarity maximum vector based on the semantic similarity; and determining whether the text to be taken as evidence is a forensic target based on the semantic similarity maximum vector. According to the text-oriented digital forensic analysis method and the device and the computer readable medium, the technical problems in the prior art of inefficiency and labor wastage is solved, which are caused by only manually browsing the text content to determine whether the text to be taken as evidence is a forensic target when taking evidence of the text content of the text to be taken as evidence , , thereby realizing the technical effect of saving labor costs and improving the forensic efficiency of the text content.

Description

technical field [0001] The present invention relates to the technical field of digital forensics, in particular to a text-oriented digital forensics analysis method, device and computer-readable medium. Background technique [0002] In recent years, computer technology has developed rapidly, and various electronic devices have appeared in people's life and work, such as computers, tablet computers, smart phones, embedded terminals, etc. These "brained" devices contain a lot of user data. Become an important source of investigation and forensics for digital forensics. Text data is the most basic form of electronic data existence. In addition to simple text data such as text files and table files, user data contained in many applications also exists in the form of text. For example, in social applications and instant messaging applications, the most important data are the public remarks published by users and the content of communications with contacts, and these user data us...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 何泾沙黄娜朱娜斐刘公政轩兴刚泽维迪阿贝
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products