
		<paper>
			<loc>https://jjcit.org/paper/243</loc>
			<title>CURATING DATASETS TO ENHANCE SPYWARE CLASSIFICATION</title>
			<doi>10.5455/jjcit.71-1719026602</doi>
			<authors>Mousumi Ahmed Mimi,Hu Ng,Timothy Tzen Vun Yap</authors>
			<keywords>Datasets curation,Feature engineering,Packet analysis,Spyware classification</keywords>
			<citation>1</citation>
			<views>2089</views>
			<downloads>416</downloads>
			<received_date>22-Jun.-2024</received_date>
			<revised_date>  26-Aug.-2024</revised_date>
			<accepted_date>  14-Sep.-2024</accepted_date>
			<abstract>Current  methods  for  spyware  classification  lack  effectiveness  as  well-structured  datasets  are  typically  absent, 
especially  those  with  directionality  properties  in  their  set  of  features.  In  this  particular  research  work,  the 
efficacy  of  directionality  properties  for  classification  is  explored,  through  engineered  features  from  those  on 
existing  datasets.  This  study  curates  two  datasets,  Dataset  A  which  includes  features  extracted  from  only  single 
directional packet flows and Dataset B which includes those from bi-directional packet flows. Classification with 
these features is performed with selected classifiers, where SVM obtained the highest accuracy with 99.88% for 
Dataset  A,  while  the  highest  accuracy  went  to  RF,  DT and XGBoost  for  Dataset  B  with  99.24%.  Comparing 
these results  with those  from  existing research work, the  directional properties in these engineered features are 
able to provide improvements in terms of accuracy in classifying these spywares.</abstract>
		</paper>


