https://doi.org/10.1140/epjs/s11734-025-01777-8
Regular Article
Two-parameter model of synthetic distortions in the problem of assessing the readability of distorted texts
1
Cybersecurity CPS Department, HSE University, Moscow, Russia
2
RUDN University, RUDN, Moscow, Russia
3
Institute of Control Science of RAS, ICS RAS, Moscow, Russia
Received:
28
May
2025
Accepted:
27
June
2025
Published online:
15
July
2025
The paper proposes a two-parameter model of random synthetic text distortions. The model provides for distortions both at the level of text symbols and at the level of words. The distortions introduced by the proposed model are close to the distortions that occur when recognition systems (automatic speech recognition and optical character recognition) operate in noise. The model is used to study the redundancy of text in natural languages and to analyze the possibility of its automatic processing under noise conditions. Estimates of the readability of text distorted using the proposed model are obtained.
Copyright comment Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
© The Author(s), under exclusive licence to EDP Sciences, Springer-Verlag GmbH Germany, part of Springer Nature 2025
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.