A Novel Approach Exploiting Machine Learning to Detect SQLi Attacks

Submitted by grigby1 on Fri, 02/03/2023 - 3:50pm

Title	A Novel Approach Exploiting Machine Learning to Detect SQLi Attacks
Publication Type	Conference Paper
Year of Publication	2022
Authors	Ashlam, Ahmed Abadulla, Badii, Atta, Stahl, Frederic
Conference Name	2022 5th International Conference on Advanced Systems and Emergent Technologies (IC\_ASET)
Date Published	mar
Keywords	Analytical models, attacks, CountVectorizer, data mining, Databases, false negative, False Positive, feature extraction, Human Behavior, machine learning algorithms, Metrics, OWASP, Performance analysis, policy-based governance, privacy, pubcrawl, resilience, Resiliency, SQL Injection, SQL injection detection
Abstract	The increasing use of Information Technology applications in the distributed environment is increasing security exploits. Information about vulnerabilities is also available on the open web in an unstructured format that developers can take advantage of to fix vulnerabilities in their IT applications. SQL injection (SQLi) attacks are frequently launched with the objective of exfiltration of data typically through targeting the back-end server organisations to compromise their customer databases. There have been a number of high profile attacks against large enterprises in recent years. With the ever-increasing growth of online trading, it is possible to see how SQLi attacks can continue to be one of the leading routes for cyber-attacks in the future, as indicated by findings reported in OWASP. Various machine learning and deep learning algorithms have been applied to detect and prevent these attacks. However, such preventive attempts have not limited the incidence of cyber-attacks and the resulting compromised database as reported by (CVE) repository. In this paper, the potential of using data mining approaches is pursued in order to enhance the efficacy of SQL injection safeguarding measures by reducing the false-positive rates in SQLi detection. The proposed approach uses CountVectorizer to extract features and then apply various supervised machine-learning models to automate the classification of SQLi. The model that returns the highest accuracy has been chosen among available models. Also a new model has been created PALOSDM (Performance analysis and Iterative optimisation of the SQLI Detection Model) for reducing false-positive rate and false-negative rate. The detection rate accuracy has also been improved significantly from a baseline of 94% up to 99%.
DOI	10.1109/IC_ASET53395.2022.9765948
Citation Key	ashlam_novel_2022

Groups:

Science of Security VO