diff options
| author | Alex Crichton <alex@alexcrichton.com> | 2014-10-12 17:25:51 -0700 |
|---|---|---|
| committer | Alex Crichton <alex@alexcrichton.com> | 2014-10-13 08:52:15 -0700 |
| commit | f7b54703d07ab89e92157706440e75104c4790aa (patch) | |
| tree | baeba7af8f670eb0b7f80d328ef93f4d8072c5eb /src/rustllvm/RustWrapper.cpp | |
| parent | ff0abf05c9852df629df4a1ebe2e4276a87fc4f9 (diff) | |
| download | rust-f7b54703d07ab89e92157706440e75104c4790aa.tar.gz rust-f7b54703d07ab89e92157706440e75104c4790aa.zip | |
bench: Improve the spectralnorm shootout benchmark
This improves the spectralnorm shootout benchmark through a few vectors after looking at the leading C implementation: * The simd-based f64x2 is now used to parallelize a few computations * RWLock usage has been removed. A custom `parallel` function was added as a form of stack-based fork-join parallelism. I found that the contention on the locks was high as well as hindering other optimizations. This does, however, introduce one `unsafe` block into the benchmarks, which previously had none. In terms of timings, the before and after numbers are: ``` $ time ./shootout-spectralnorm-before ./shootout-spectralnorm-before 2.07s user 0.71s system 324% cpu 0.857 total $ time ./shootout-spectralnorm-before 5500 ./shootout-spectralnorm-before 5500 11.88s user 1.13s system 459% cpu 2.830 total $ time ./shootout-spectralnorm-after ./shootout-spectralnorm-after 0.58s user 0.01s system 280% cpu 0.210 tota $ time ./shootout-spectralnorm-after 5500 ./shootout-spectralnorm-after 5500 3.55s user 0.01s system 455% cpu 0.783 total ```
Diffstat (limited to 'src/rustllvm/RustWrapper.cpp')
0 files changed, 0 insertions, 0 deletions
