The explainability of shallow AI-generated text classification models via parts removing

Peredrii O.; Gorokhovatskyi O.

Будь ласка, використовуйте цей ідентифікатор, щоб цитувати або посилатися на цей матеріал: https://repository.hneu.edu.ua/handle/123456789/40084

Назва:	The explainability of shallow AI-generated text classification models via parts removing
Автори:	Peredrii O. Gorokhovatskyi O.
Теми:	explainability black-box shallow ANN perturbation AI-generated content human-written content text chunk text classification explainability index
Дата публікації:	2026
Бібліографічний опис:	Peredrii O. The explainability of shallow AI-generated text classification models via parts removing / O. Peredrii, O.Gorokhovatskyi // Системи управління, навігації та зв’язку. – 2026. -№ 2. – С. 153–159.
Короткий огляд (реферат):	In this paper, we address the explainability problem for the ANNs' classification of AI-generated and human-written text chunks in Ukrainian texts in the IT domain. The objective is to investigate whether the perturbation-based modifications of text chunks that include the removal of sentences, words, and word combinations may be helpful in searching for explanations. We used five shallow ANN models (with an average accuracy of about 0.88) and tested them on a sample of the document containing human-written text and AI-generated fragments generated with GPT-5, Gemini 2.5 Flash, and Claude Sonnet 4.5. The experimental modeling showed that it is not easy to find a single sentence or word that can flip the classification result. We have proposed an explainability index that measures the total influence of all perturbed samples on the classification result, accounting for the fact that short perturbations are more valuable.
URI (Уніфікований ідентифікатор ресурсу):	https://repository.hneu.edu.ua/handle/123456789/40084
Розташовується у зібраннях:	Статті (ІКТ)

Файли цього матеріалу:

Файл	Опис	Розмір	Формат
24.pdf		607,81 kB	Adobe PDF	Переглянути/відкрити

Показати повний опис матеріалу Перегляд статистики

Усі матеріали в архіві електронних ресурсів захищені авторським правом, всі права збережені.