Noticeable performance regression since the last LLVM update.

I get a huge performance regression since #8328 landed (revision a8c3fe45c6138cd1f4d143fdb0e843ee2d4759b2) on all my projects. Things are 50 to 75% slower. I’m pretty sure #8328 is in cause since when I revert the compiler to the version right before (revision 67c954e365970e4c2cd06f0c50724656d7010f45) performances go back to normal.

For what it’s worth, the [concerned projects](https://github.com/sebcrozet/nphysics) are 100% generic, and rely a lot on cross-crate inlining. They do a lot of numeric computations and array indexing. Sorry if I am a bit vague but I cannot valgrind my projects because my valgrind started to segfault a few days ago (perhaps since the re-enabling of jemalloc)…

I tried to come up with a small bench exhibiting the problem. It is not _that_ significative, but the following shows some noticeable performances regression already:

``` rust
extern mod extra;

use std::hashmap;
use extra::test::BenchHarness;

#[bench]
fn bench_insert_std(bh: &mut BenchHarness) {
    let mut m = hashmap::HashMap::with_capacity(32);

    do bh.iter {
        for i in range(0u, 500) {
            m.insert((i, i), i);
        }
    }
}

#[bench]
fn bench_insert_find_remove_std(bh: &mut BenchHarness) {
    let mut m = hashmap::HashMap::with_capacity(32);

    do bh.iter {
        for i in range(0u, 200) {
            m.insert((i, i), i);
        }

        for i in range(0u, 200) {
            assert!(*m.find(&(i, i)).unwrap() == i)
        }

        for i in range(100u, 200) {
            m.remove(&(i, i));
        }

        for i in range(100u, 200) {
            assert!(m.find(&(i, i)).is_none())
        }

        for i in range(0u, 100) {
            m.insert((i, i), i * 2);
        }

        for i in range(0u, 100) {
            assert!(*m.find(&(i, i)).unwrap() == i * 2)
        }

        for i in range(0u, 100) {
            m.remove(&(i, i));
        }

        for i in range(0u, 100) {
            assert!(m.find(&(i, i)).is_none())
        }
    }
}

fn main() {
} 
```

Compiled with `--opt-level=3`.
With the (new) compiler a8c3fe45c6138cd1f4d143fdb0e843ee2d4759b2, I get:

```
test bench_insert_find_remove_std ... bench: 89242 ns/iter (+/- 3605)
test bench_insert_std ... bench: 46177 ns/iter (+/- 1555)
```

With the (old) compiler 67c954e365970e4c2cd06f0c50724656d7010f45, I get something more than 10% faster. The asm dump is smaller too:

```
test bench_insert_find_remove_std ... bench: 73939 ns/iter (+/- 2872)
test bench_insert_std ... bench: 38482 ns/iter (+/- 1005)
```


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Noticeable performance regression since the last LLVM update. #8665

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Noticeable performance regression since the last LLVM update. #8665

Description

Activity

alexcrichton commented on Aug 21, 2013

alexcrichton commented on Aug 21, 2013

sebcrozet commented on Aug 21, 2013

alexcrichton commented on Aug 23, 2013

graydon commented on Aug 23, 2013

thestinger commented on Aug 23, 2013

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions