Надежность и качество сложных систем (May 2024)

FORMATION OF A METHOD FOR INCREASING THE RELIABILITY OF SPECIALIZED COMPUTERS

  • Aleksandr A. Pavlov,
  • Aleksey N. Tsarkov,
  • Yuriy A. Romanenko,
  • Igor I. Korneev,
  • Alexander Yu. Romanenko,
  • Maxim I. Makeev,
  • Fedor A. Pavlov

DOI
https://doi.org/10.21685/2307-4205-2024-1-7
Journal volume & issue
no. 1

Abstract

Read online

Background. Development of a methodological apparatus for constructing means to ensure the required level of probability of failure-free operation and survivability of self-healing specialized computers (SEVMs). During the research, an analysis of operating conditions and a selection of reliability indicators for computers of aerospace equipment was carried out. Requirements for computer backup methods are formulated. A comparative assessment of the detecting ability and hardware costs was carried out when implementing the majority redundancy method, the duplication method and the use of correcting codes. Materials and methods. The expediency of using the duplication method is substantiated to increase the probability of failure-free operation and survivability of self-healing central computers, using algebraic linear codes to identify a faulty backup channel. Unlike known codes, it is proposed to use an algebraic linear code, in which the values of the check bits correspond to the direct and inverse values of the information bits, which makes it possible to detect errors when reading information from the inverse outputs of the memory, correct single errors, detect double errors and control the logical inversion operation, necessary to represent a negative number in two's complement code, which makes it possible to adapt the code to control arithmetic and logical operations of the computer processor. An assessment was made of the probability of failure-free operation of a duplicated computer, with its general redundancy, with detection and correction of single errors in the backup memory channels and detection of errors in the backup channels of the processor ALU based on the proposed code, and an assessment of the probability of failure-free operation of the computer, with its separate redundancy, with error detection in the backup channels of the duplicated memory based on the Hamming code and error correction in the backup channels of the processor ALU based on the majority method. Results and conclusions. As a result of comparing the probabilities of failure-free operation, it was established that the general redundancy of the computer based on the proposed code, in comparison with the separate redundancy of the memory and ALU of the computer processor, allows for a gain in the probability of failurefree operation of the computer and its functional devices throughout the entire period of operation.

Keywords