Home > OS >  Optimisation and strict aliasing
Optimisation and strict aliasing

Time:05-13

My question is regarding a code fragment, such as below:

#include <iostream>

int main() {
    double a = -50;
    std::cout << a << "\n";
    uint8_t* b = reinterpret_cast<uint8_t*>(&a);
    b[7] &= 0x7F;
    std::cout << a << "\n";
    return 0;
}

As far as I can tell I am not breaking any rules and everything is well defined (as noted below I forgot that uint8_t is not allowed to alias other types). There is some implementation defined behavior going on, but for the purpose of this question I don't think that is relevant.

I would expect this code to print -50, then 50 on systems where the double follows the IEEE standard, is 8 bytes long and is stored in little endian format. Now the question is. Does the compiler guarantee that this happens. More specifically, turning on optimisations can the compiler optimise away the middle b[7], either explicitly or implicitly, by simply keeping a in a register through the whole function. The second one obviously could be solved by specifying volatile double a, but is that needed?

Edit: As an a note I (mistakenly) remembered that uint8_t was required to be an alias for unsigned char, but indeed the standard does not specify such. I have also written the question in a way that, yes the compiler can ahead of time know everything here, but modified to

#include <iostream>

int main() {
    double a;
    std::cin >> a;
    std::cout << a << "\n";
    unsigned char* b = reinterpret_cast<unsigned char*>(&a);
    b[7] &= 0x7F;
    std::cout << a << "\n";
    return 0;
}

one can see where the problem might arise. Here the strict aliasing rule is no longer violated, and a is not a compile time constant. Richard Critten's comment however is curious if the aliased data can be examined, but not written, is there a way one can set individual bytes, while still following the standard?

CodePudding user response:

More specifically, turning on optimisations can the compiler optimise away the middle b[7], either explicitly or implicitly, by simply keeping a in a register through the whole function.

The compiler can generate the double value 50 as a constant, and pass that directly to the output function. b can be optimised away completely. Like most optimisation, this is due to the as-if rule:

[intro.abstract]

The semantic descriptions in this document define a parameterized nondeterministic abstract machine. This document places no requirement on the structure of conforming implementations. In particular, they need not copy or emulate the structure of the abstract machine. Rather, conforming implementations are required to emulate (only) the observable behavior of the abstract machine as explained below.


The second one obviously could be solved by specifying volatile double a

That would prevent the optimisation, which would generally be considered to be the opposite of a solution.


Does the compiler guarantee that [50 is printed].

You didn't mention what compiler you are asking about. I'm going to assume that you mean whether the standard guarantees this. It doesn't guarantee that universally. You are relying on several assumptions about the implementation:

  • If sizeof(double) < 8, then you access the object outside of its bounds, and behaviour of the program is undefined.
  • If std::uint8_t is not an a type alias of unsigned char, then it isn't allowed to alias double, and the behaviour of the program is undefined.

Given the assumptions hold and thus behviour is well-defined, then the second output will be of a double value that is like -50, but whose most significant bit(s from 8th forward) of the byte at position 7 will have been set to 0. In case of little endian IEEE-754 representation, that value would be 50. volatile is not needed to guarantee this, and it won't add a guarantee in case the behaviour of the program is undefined.

  • Related