A Study of Adversarial Attacks on Machine Learning-based Fake News Detection Systems

Brown, Brandon

Metadata Field	Value	Language
dc.contributor.advisor	Dozier, Gerry
dc.contributor.author	Brown, Brandon
dc.date.accessioned	2023-05-04T13:43:46Z
dc.date.available	2023-05-04T13:43:46Z
dc.date.issued	2023-05-04
dc.identifier.uri	https://etd.auburn.edu//handle/10415/8718
dc.description.abstract	Due to the increased use and reliance on social media, fake news has become a significant problem that can cause great harm to individuals. Because of the dangerousness of fake news, techniques must be developed to detect and keep it from spreading. Currently, fact-checking is a damage control strategy that is essential to detecting and mitigating fake news. Websites such as Politifact, Snopes, and Factcheck.org use human verifiers to manually fact-check news articles. In cases where there are only a few articles to be fact-checked by human verifiers, relying on these websites to take the time to effectively research and debunk fake news is sufficient. However, in the case of social media, where news is generated at an extremely high volume (and with an extremely high velocity), automated approaches are needed for fake news detection. Recently, social media companies have begun to rely on automated systems in the form of machine learning-based fake news detection systems (ML-FNDSs). These ML-FNDSs are used to classify articles as either news or fake news. Although these ML-FNDSs effectively classify articles as either news or fake news, they are susceptible to adversarial attacks. Adversaries use these attacks to make a ML-FNDS misclassify its input. Adversarial attacks can be used to make a ML-FNDS accept fake news as news or flag news as being fake. In our research, we study two potential vulnerabilities of ML-FNDSs with respect to false positives (e.g., when news is erroneously classified as fake news) and false negatives (e.g., when fake news is erroneously classified as news). In this dissertation, we first introduce the concept of an Adversarial Universal False Positive (UFP) Attack and the Adversarial Universal False Negative (UFN) Attack. Next, we study the effectiveness of these two attacks on ML-FNDSs based on a single classifier, and finally, we study these attacks on ML-FNDSs based on a set of classifiers (ensemble machines).	en_US
dc.rights	EMBARGO_GLOBAL	en_US
dc.subject	Computer Science and Software Engineering	en_US
dc.title	A Study of Adversarial Attacks on Machine Learning-based Fake News Detection Systems	en_US
dc.type	PhD Dissertation	en_US
dc.embargo.length	MONTHS_WITHHELD:60	en_US
dc.embargo.status	EMBARGOED	en_US
dc.embargo.enddate	2028-05-04	en_US

Files in this item

Name:: Auburn_University_Dissertation_v4__Submitted.pdf
Size:: 1.757Mb

Show simple item record