Auto merge of #39642 - stjepang:specialize-slice-partialord, r=alexcrichton - rust

diff options

author	bors <bors@rust-lang.org>	2017-02-11 04:37:27 +0000
committer	bors <bors@rust-lang.org>	2017-02-11 04:37:27 +0000
commit	f140a6c6effa9fe11f97373d995e6c0d977b509f (patch)
tree	4034fda15457a4b02094cd845786789081dc8b68 /src/rustllvm/RustWrapper.cpp
parent	064a0ee131b3129fcad68570975ccc85d0fb54d0 (diff)
parent	a344c126d03729a9d147f18dfc9cc6432bc790fd (diff)
download	rust-f140a6c6effa9fe11f97373d995e6c0d977b509f.tar.gz rust-f140a6c6effa9fe11f97373d995e6c0d977b509f.zip

Auto merge of #39642 - stjepang:specialize-slice-partialord, r=alexcrichton

Specialize `PartialOrd<A> for [A] where A: Ord`

This way we can call `cmp` instead of `partial_cmp` in the loop, removing some burden of optimizing `Option`s away from the compiler.

PR #39538 introduced a regression where sorting slices suddenly became slower, since `slice1.lt(slice2)` was much slower than `slice1.cmp(slice2) == Less`. This problem is now fixed.

To verify, I benchmarked this simple program:
```rust
fn main() {
    let mut v = (0..2_000_000).map(|x| x * x * x * 18913515181).map(|x| vec![x, x ^ 3137831591]).collect::<Vec<_>>();
    v.sort();
}
```

Before this PR, it would take 0.95 sec, and now it takes 0.58 sec.
I also tried changing the `is_less` lambda to use `cmp` and `partial_cmp`. Now all three versions (`lt`, `cmp`, `partial_cmp`) are equally performant for sorting slices - all of them take 0.58 sec on the
benchmark.

Tangentially, as soon as we get `default impl`, it might be a good idea to implement a blanket default impl for `lt`, `gt`, `le`, `ge` in terms of `cmp` whenever possible. Today, those four functions by default are only implemented in terms of `partial_cmp`.

r? @alexcrichton

Diffstat (limited to 'src/rustllvm/RustWrapper.cpp')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: