Possible performance loss with f32 arithmetic

I've tried, out of curiosity, a floating point arithmetic test and found quite a big difference between C++ and Rust.

The code used in rust
```rust
pub struct Stats
{
    x: f32,
    y: f32,
    z: f32
}

pub fn sum(a: &Stats, b: &Stats) -> Stats
{
    Stats {
        x: a.x + b.x,
        y: a.y + b.y,
        z: a.z + b.z
    }
}
```

The code used in C++
```cpp
struct Stats
{
    float x;
    float y;
    float z;
};

Stats sum(const Stats &a, const Stats &b)
{
    return Stats {
        a.x + b.x,
        a.y + b.y,
        a.z + b.z
    };
}
```

Here is a link to a godbolt for side-by-side comparision of assembly output: https://godbolt.org/z/dqc4b74rv

Rust seem to absolutely want the floats back into e* registers instead of keeping them in xmm registers, C++ leaves them into the xmm registers. In some cases it might more advantageous to leave the floats in xmm registers for future operations on them rather then passing them back into the e* registers.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Possible performance loss with f32 arithmetic #91447

clang (trunk with `-O0`)

rustc (1.56 with `-C opt-level=0 --emit=llvm-ir`)

rustc (1.47 with `-C opt-level=0 --emit=llvm-ir`)

17 remaining items

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Possible performance loss with f32 arithmetic #91447

Description

Activity

Urgau commented on Dec 2, 2021

clang (trunk with -O0)

rustc (1.56 with -C opt-level=0 --emit=llvm-ir)

rustc (1.47 with -C opt-level=0 --emit=llvm-ir)

Urgau commented on Dec 2, 2021

Yuri6037 commented on Dec 2, 2021

MSxDOS commented on Dec 3, 2021

17 remaining items

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions

clang (trunk with `-O0`)

rustc (1.56 with `-C opt-level=0 --emit=llvm-ir`)

rustc (1.47 with `-C opt-level=0 --emit=llvm-ir`)