Reformulation is All You Need: Addressing Malicious Textual Features in DNNs

Yi Jiang; Oubo Ma; Yong Yang; Tong Zhang; Shouling Ji

doi:10.1007/s11633-025-1594-9

Yi Jiang, Oubo Ma, Yong Yang, Tong Zhang, Shouling Ji. Reformulation is All You Need: Addressing Malicious Textual Features in DNNsJ. Machine Intelligence Research. DOI: 10.1007/s11633-025-1594-9

Citation:

Yi Jiang, Oubo Ma, Yong Yang, Tong Zhang, Shouling Ji. Reformulation is All You Need: Addressing Malicious Textual Features in DNNsJ. Machine Intelligence Research. DOI: 10.1007/s11633-025-1594-9

Citation:

Yi Jiang, Oubo Ma, Yong Yang, Tong Zhang, Shouling Ji. Reformulation is All You Need: Addressing Malicious Textual Features in DNNsJ. Machine Intelligence Research. DOI: 10.1007/s11633-025-1594-9

Reformulation is All You Need: Addressing Malicious Textual Features in DNNs

Abstract

Abstract

Human language encompasses a wide range of intricate and diverse implicit features, which attackers can exploit to launch adversarial or backdoor attacks, compromising deep neural network (DNN) models for natural language processing (NLP) tasks. Existing model-oriented defenses often require substantial computational resources as the model size increases whereas sample-oriented defenses typically focus on specific attack vectors or schemes, rendering them vulnerable to adaptive attacks. We observe that the root cause of both adversarial and backdoor attacks lies in the encoding process of DNN models, where subtle textual features, negligible for human comprehension, are erroneously assigned significant weights by less robust or trojaned models. On this basis, we propose a unified and adaptive defense framework that is effective against both adversarial and backdoor attacks. Our approach leverages reformulation modules to address potential malicious features in textual inputs while preserving the original semantic integrity. Extensive experiments demonstrate that our framework outperforms existing sample-oriented defense baselines across a diverse range of malicious textual features.

FullText(HTML)

References (47)

Cited By

免责声明：本文中文版本由iFLYTEK翻译自动生成，仅供参考。对于该英文译文的合理性、准确性及完整性，我们不予负责，亦不对由此产生的相关后果承担任何商业及法律责任。

Reformulation is All You Need: Addressing Malicious Textual Features in DNNs

Abstract

Catalog

Export File

Citation

Format

Content