Annotation

DEVELOPMENT OF SOFTWARE AND MATHEMATICAL FRAMEWORK FOR ESTIMATION OF DISTORTIONS IN SQL-QUERY RESULTS UNDER CONDITIONS OF DECREASING DATA QUALITY
Скачать PDF
Annotation: This work is devoted to the practical application of data management standards as part of improving data quality in relational DBMSs. The problem of determining the impact of target data set distortions on the results of SQL queries executed over this set is investigated. The case of distortions detected on the basis of data quality indicators for completeness is considered. An evaluation algorithm is proposed for decomposing SQL queries into elementary operations of relational algebra, such as extended projection, restriction, union, and Cartesian product, and tracking the propagation of empty values as a result of applying the operation. The problem of ranking the identified sets of empty values is formulated in order to build an effective data filling process, and an algorithm implementing its solution is presented. As part of the testing, an experimental stand was developed on the Open University open data set and the proposed algorithms were implemented. The experimental results confirmed that the algorithm for assessing the propagation of empty values can be successfully applied to determine the order of their filling.
For citation: Dukhovenskiy S. E., Nikulchev E.V. Development of software and mathematical framework for estimation of distortions in sql-query results under conditions of decreasing data quality // Electronic Scientific Journal IT-Standard. – 2025. – No. 3. – pp. .