Abstract
Enhancing model robustness under new and even adversarial environments is a crucial milestone toward building trustworthy machine learning systems. Current robust training methods such as adversarial training explicitly uses an “attack” (e.g., L-inf-norm bounded perturbation) to generate adversarial examples during model training for improving adversarial robustness. In this paper, we take a different perspective and propose a new framework called SPROUT, self-progressing robust training. During model training, SPROUT progressively adjusts training label distribution via our proposed parametrized label smoothing technique, making training free of attack generation and more scalable. We also motivate SPROUT using a general formulation based on vicinity risk minimization, which includes many robust training methods as special cases. Compared with state-of-the-art adversarial training methods (PGD-L-inf and TRADES) under L-inf-norm bounded attacks and various invariance tests, SPROUT consistently attains superior performance and is more scalable to large neural networks. Our results shed new light on scalable, effective and attack-independent robust training methods.
| Original language | English |
|---|---|
| Title of host publication | 35th AAAI Conference on Artificial Intelligence, AAAI 2021 |
| Publisher | Association for the Advancement of Artificial Intelligence |
| Pages | 7107-7115 |
| Number of pages | 9 |
| ISBN (Electronic) | 9781713835974 |
| Publication status | Published - 2021 |
| Externally published | Yes |
| Event | 35th AAAI Conference on Artificial Intelligence, AAAI 2021 - Virtual, Online Duration: 2 Feb 2021 → 9 Feb 2021 |
Publication series
| Name | 35th AAAI Conference on Artificial Intelligence, AAAI 2021 |
|---|---|
| Volume | 8B |
Conference
| Conference | 35th AAAI Conference on Artificial Intelligence, AAAI 2021 |
|---|---|
| City | Virtual, Online |
| Period | 2/02/21 → 9/02/21 |
Bibliographical note
Publisher Copyright:Copyright © 2021, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved