https://doi.org/10.1140/epjs/s11734-025-01685-x
Regular Article
Disruption prediction on J-TEXT tokamak using ACO-BP-AdaBoost algorithm coupled with data augmentation
1
School of Electrical Engineering & Automation, Jiangsu Normal University, 221116, Xuzhou, Jiangsu, China
2
State Key Laboratory of Advanced Electromagnetic Technology, International Joint Research Laboratory of Magnetic Confinement Fusion and Plasma Physics, School of Electrical and Electronic Engineering, Huazhong University of Science and Technology, 430074, Wuhan, China
a
zflin@jsnu.edu.cn
b
zhengwei@hust.edu.cn
Received:
6
February
2025
Accepted:
12
May
2025
Published online:
30
May
2025
Accurate prediction of disruptions is essential for ensuring the safe operation of tokamaks. However, achieving high accuracy in data-driven disruption prediction models requires a substantial amount of experimental disruption data, which is not a feasible option for tokamaks, especially for future large devices. Data augmentation by adding Gaussian noise is an effective method to increase the dataset size, which can enhance the model’s robustness against various types of noise and improve the model’s generalization capabilities. Before data augmentation, the disruption prediction model of back propagation (BP) neural network is optimized by the ant colony optimization algorithm (ACO) and the adaptive boosting (AdaBoost) algorithm. The area under the receiver operating characteristic curve (AUC) achieves 0.9382 in this improved model. Based on the ACO-BP-AdaBoost model, different data augmentation strategies with various augmentation ratios are investigated. It is observed that data augmentation leads to an increment in AUC across all ratios. The best performance (AUC = 0.9677) of the model is obtained when the disruptive data are augmented fourfold and the non-disruptive data are doubled. Both the warning time and true positive rate are approaching the minimum requirements of ITER. Even when the training data size is decreased to 30%, the AUC of this model with data augmentation can be higher than 0.93, demonstrating the effectiveness of data augmentation in performance improving under relatively small target samples. However, with only 10% of the training data size, the performance of this improved model decreases significantly, which may be due to insufficient disruptive targets to learn features for prediction. Although the performance of this ACO-BP-AdaBoost model coupled with data augmentation does not completely satisfy ITER requirements, it provides a potential to expand the database by adding Gaussian noise, which may be helpful for disruption prediction.
See Wang et al. (https://doi.org/10.1088/1741-4326/ac3aff) for the J-TEXT Team.
Copyright comment Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
© The Author(s), under exclusive licence to EDP Sciences, Springer-Verlag GmbH Germany, part of Springer Nature 2025
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.