• Dec 24, 2021 News!Vol 10, No 1- Vol 10, No 2 has been indexed by IET-(Inspec)   [Click]
  • Aug 27, 2020 News!Welcome Prof. D. P. Sharma from India to join the Editorial board of IJFCC     [Click]
  • Aug 18, 2022 News!Vol.11, No.3 has been published with online version.     [Click]
General Information

Prof. Pascal Lorenz
University of Haute Alsace, France
It is my honor to be the editor-in-chief of IJFCC. The journal publishes good papers in the field of future computer and communication. Hopefully, IJFCC will become a recognized journal among the readers in the filed of future computer and communication.

IJFCC 2013 Vol.2(4): 368-371 ISSN: 2010-3751
DOI: 10.7763/IJFCC.2013.V2.187

Latent Text Mining for Cybercrime Forensics

Raymond Y. K. Lau and Yunqing Xia

Abstract—Recent research reveals that the number of cyber-attacks has been doubled in the past three years. This is a devastating growth of the number of cyber-attacks, and it reveals a serious business problem around the world. Existing intrusion detection systems (IDSs), intrusion prevention systems (IPSs), and anti-malware systems mainly rely on low-level network traffic features or program code signatures to detect cyber-attacks. However, since hackers can constantly change their attack tactics by, it is extremely difficult for existing security solutions to detect cyber-attacks. There are increasing more evidences showing that cybercriminals tend to exchange cybercrime knowledge and transact via online social media. Accordingly, it presents unprecedented opportunities for security intelligence experts to tap into online social media to extract the vital security intelligence for cyber-attack forensics. The main contributions of this paper are the design, development, and evaluation of a Latent Dirichlet Allocation (LDA)-based latent text mining model for cyber-attack forensics. Our preliminary evaluation of the proposed latent text mining model based on a real-world data set crawled from Twitter and Blog sites shows that it significantly outperforms the probabilistic latent semantic indexing (pLSI) method in terms of extracting more relevant and richer concepts describing real-world cyber-attack incidents.

Index Terms—Text mining, latent dirichlet allocation, cyber-attacks, cyber forensics.

Raymond Y. K. Lau is with the City University of Hong Kong, Tat Chee Avenue, Kowloon Tong, Hong Kong SAR (e-mail: raylau@cityu.edu.hk).
Yunqing Xia is with Centre for Speech and Language Technologies, Tsinghua University, Beijing 100084, China (e-mail: yqxia@tsinghua.edu.cn).


Cite: Raymond Y. K. Lau and Yunqing Xia, "Latent Text Mining for Cybercrime Forensics," International Journal of Future Computer and Communication vol. 2, no. 4 pp. 368-371, 2013.

Copyright © 2008-2022. International Journal of Future Computer and Communication. All rights reserved.
E-mail: ijfcc@ejournal.net