diff options
| author | Huon Wilson <dbau.pp+github@gmail.com> | 2014-11-04 20:59:00 +1100 |
|---|---|---|
| committer | Huon Wilson <dbau.pp+github@gmail.com> | 2014-11-05 00:02:43 +1100 |
| commit | 071c4110457358f5b91287a63fb782201f5eb7ad (patch) | |
| tree | 6624b61d00253667e709ed7b9e40ed60410e8d18 /src/rustllvm/RustWrapper.cpp | |
| parent | ff50f24feb512075f3fcf9fc2801e4c853ff9316 (diff) | |
| download | rust-071c4110457358f5b91287a63fb782201f5eb7ad.tar.gz rust-071c4110457358f5b91287a63fb782201f5eb7ad.zip | |
Translate SIMD construction as `insertelement`s and a single store.
This almost completely avoids GEPi's and pointer manipulation,
postponing it until the end with one big write of the whole vector. This
leads to a small speed-up in compilation, and makes it easier for LLVM
to work with the values, e.g. with `--opt-level=0`,
pub fn foo() -> f32x4 {
f32x4(0.,0.,0.,0.)
}
was previously compiled to
define <4 x float> @_ZN3foo20h74913e8b13d89666eaaE() unnamed_addr #0 {
entry-block:
%sret_slot = alloca <4 x float>
%0 = getelementptr inbounds <4 x float>* %sret_slot, i32 0, i32 0
store float 0.000000e+00, float* %0
%1 = getelementptr inbounds <4 x float>* %sret_slot, i32 0, i32 1
store float 0.000000e+00, float* %1
%2 = getelementptr inbounds <4 x float>* %sret_slot, i32 0, i32 2
store float 0.000000e+00, float* %2
%3 = getelementptr inbounds <4 x float>* %sret_slot, i32 0, i32 3
store float 0.000000e+00, float* %3
%4 = load <4 x float>* %sret_slot
ret <4 x float> %4
}
but now becomes
define <4 x float> @_ZN3foo20h74913e8b13d89666eaaE() unnamed_addr #0 {
entry-block:
ret <4 x float> zeroinitializer
}
Diffstat (limited to 'src/rustllvm/RustWrapper.cpp')
0 files changed, 0 insertions, 0 deletions
