Explainable Artificial Intelligence with Integrated Gradients for the Detection of Adversarial Attacks on Text Classifiers

Text classifiers are Artificial Intelligence (AI) models used to classify new documents or text vectors into predefined classes. They are typically built using supervised learning algorithms and labelled datasets. Text classifiers produce a predefined class as an output, which also makes them suscep...

Full description

Bibliographic Details
Published in:	Applied System Innovation
Main Authors:	Harsha Moraliyage, Geemini Kulawardana, Daswin De Silva, Zafar Issadeen, Milos Manic, Seiichiro Katsura
Format:	Article
Language:	English
Published:	MDPI AG 2025-01-01
Subjects:	adversarial attacks AI cybersecurity integrated gradients text classification explainable AI
Online Access:	https://www.mdpi.com/2571-5577/8/1/17

Internet

https://www.mdpi.com/2571-5577/8/1/17

Explainable Artificial Intelligence with Integrated Gradients for the Detection of Adversarial Attacks on Text Classifiers

Internet

Similar Items