rust - https://github.com/rust-lang/rust

Age	Commit message (Collapse)	Author	Lines
2024-05-27	Rollup merge of #125148 - RalfJung:codegen-sh, r=scottmcm	Guillaume Gomez	-0/+8
	codegen: tweak/extend shift comments r? `@scottmcm`
2024-05-23	cleanup: run rustfmt	Augie Fackler	-1/+4

2024-05-23	thinlto: only build summary file if needed	Augie Fackler	-1/+1
	If we don't do this, some versions of LLVM (at least 17, experimentally) will double-emit some error messages, which is how I noticed this. Given that it seems to be costing some extra work, let's only request the summary bitcode production if we'll actually bother writing it down, otherwise skip it.
2024-05-22	rustc_codegen_llvm: add support for writing summary bitcode	Augie Fackler	-0/+1
	Typical uses of ThinLTO don't have any use for this as a standalone file, but distributed ThinLTO uses this to make the linker phase more efficient. With clang you'd do something like `clang -flto=thin -fthin-link-bitcode=foo.indexing.o -c foo.c` and then get both foo.o (full of bitcode) and foo.indexing.o (just the summary or index part of the bitcode). That's then usable by a two-stage linking process that's more friendly to distributed build systems like bazel, which is why I'm working on this area. I talked some to @teresajohnson about naming in this area, as things seem to be a little confused between various blog posts and build systems. "bitcode index" and "bitcode summary" tend to be a little too ambiguous, and she tends to use "thin link bitcode" and "minimized bitcode" (which matches the descriptions in LLVM). Since the clang option is thin-link-bitcode, I went with that to try and not add a new spelling in the world. Per @dtolnay, you can work around the lack of this by using `lld --thinlto-index-only` to do the indexing on regular .o files of bitcode, but that is a bit wasteful on actions when we already have all the information in rustc and could just write out the matching minimized bitcode. I didn't test that at all in our infrastructure, because by the time I learned that I already had this patch largely written.
2024-05-16	Fix assertion when attempting to convert `f16` and `f128` with `as`	Trevor Gross	-1/+4
	These types are currently rejected for `as` casts by the compiler. Remove this incorrect check and add codegen tests for all conversions involving these types.
2024-05-15	codegen: tweak/extend shift comments	Ralf Jung	-0/+8

2024-05-10	Refactoring after the `PlaceValue` addition	Scott McMurray	-25/+36
	I added `PlaceValue` in 123775, but kept that one line-by-line simple because it touched so many places. This goes through to add more helpers & docs, and change some `PlaceRef` to `PlaceValue` where the type didn't need to be included. No behaviour changes.
2024-05-06	Refactor float `Primitive`s to a separate `Float` type	beetrees	-1/+11

2024-05-01	coverage: Eagerly do start-of-function codegen for coverage	Zalathar	-0/+5

2024-04-29	Remove `extern crate rustc_middle` from numerous crates.	Nicholas Nethercote	-0/+1

2024-04-24	Auto merge of #122053 - erikdesjardins:alloca, r=nikic	bors	-2/+2
	Stop using LLVM struct types for alloca The alloca type has no semantic meaning, only the size (and alignment, but we specify it explicitly) matter. Using `[N x i8]` is a more direct way to specify that we want `N` bytes, and avoids relying on LLVM's struct layout. It is likely that a future LLVM version will change to an untyped alloca representation. Split out from #121577. r? `@ghost`
2024-04-11	use [N x i8] for alloca types	Erik Desjardins	-2/+2

2024-04-11	Add load/store helpers that take `PlaceValue`	Scott McMurray	-2/+10

2024-04-11	Make `PlaceRef` hold a `PlaceValue` for the non-layout fields (like ↵	Scott McMurray	-5/+5
	`OperandRef` does)
2024-04-09	Put the `NONTEMPORAL` case first	Scott McMurray	-6/+7
	That's how it was in `store_with_flags` before this PR, so let's do that here too just to be sure we get the right thing.
2024-04-09	Remove my `scalar_copy_backend_type` optimization attempt	Scott McMurray	-26/+18
	I added this back in 111999, but I no longer think it's a good idea - It had to get scaled back to only power-of-two things to not break a bunch of targets - LLVM seems to be getting better at memcpy removal anyway - Introducing vector instructions has seemed to sometimes (115515) make autovectorization worse So this removes it from the codegen crates entirely, and instead just tries to use <https://doc.rust-lang.org/nightly/nightly-rustc/rustc_codegen_ssa/traits/builder/trait.BuilderMethods.html#method.typed_place_copy> instead of direct `memcpy` so things will still use load/store for immediates.
2024-04-02	Auto merge of #118310 - scottmcm:three-way-compare, r=davidtwco	bors	-0/+1
	Add `Ord::cmp` for primitives as a `BinOp` in MIR Update: most of this OP was written months ago. See https://github.com/rust-lang/rust/pull/118310#issuecomment-2016940014 below for where we got to recently that made it ready for review. --- There are dozens of reasonable ways to implement `Ord::cmp` for integers using comparison, bit-ops, and branches. Those differences are irrelevant at the rust level, however, so we can make things better by adding `BinOp::Cmp` at the MIR level: 1. Exactly how to implement it is left up to the backends, so LLVM can use whatever pattern its optimizer best recognizes and cranelift can use whichever pattern codegens the fastest. 2. By not inlining those details for every use of `cmp`, we drastically reduce the amount of MIR generated for `derive`d `PartialOrd`, while also making it more amenable to MIR-level optimizations. Having extremely careful `if` ordering to μoptimize resource usage on broadwell (#63767) is great, but it really feels to me like libcore is the wrong place to put that logic. Similarly, using subtraction [tricks](https://graphics.stanford.edu/~seander/bithacks.html#CopyIntegerSign) (#105840) is arguably even nicer, but depends on the optimizer understanding it (https://github.com/llvm/llvm-project/issues/73417) to be practical. Or maybe [bitor is better than add](https://discourse.llvm.org/t/representing-in-ir/67369/2?u=scottmcm)? But maybe only on a future version that [has `or disjoint` support](https://discourse.llvm.org/t/rfc-add-or-disjoint-flag/75036?u=scottmcm)? And just because one of those forms happens to be good for LLVM, there's no guarantee that it'd be the same form that GCC or Cranelift would rather see -- especially given their very different optimizers. Not to mention that if LLVM gets a spaceship intrinsic -- [which it should](https://rust-lang.zulipchat.com/#narrow/stream/131828-t-compiler/topic/Suboptimal.20inlining.20in.20std.20function.20.60binary_search.60/near/404250586) -- we'll need at least a rustc intrinsic to be able to call it. As for simplifying it in Rust, we now regularly inline `{integer}::partial_cmp`, but it's quite a large amount of IR. The best way to see that is with https://github.com/rust-lang/rust/commit/8811efa88b25b5e41d63850e6047e8257c677858#diff-d134c32d028fbe2bf835fef2df9aca9d13332dd82284ff21ee7ebf717bfa4765R113 -- I added a new pre-codegen MIR test for a simple 3-tuple struct, and this PR change it from 36 locals and 26 basic blocks down to 24 locals and 8 basic blocks. Even better, as soon as the construct-`Some`-then-match-it-in-same-BB noise is cleaned up, this'll expose the `Cmp == 0` branches clearly in MIR, so that an InstCombine (#105808) can simplify that to just a `BinOp::Eq` and thus fix some of our generated code perf issues. (Tracking that through today's `if a < b { Less } else if a == b { Equal } else { Greater }` would be much harder.) --- r? `@ghost` But first I should check that perf is ok with this ~~...and my true nemesis, tidy.~~
2024-03-24	Rollup merge of #122937 - Zalathar:unbox, r=oli-obk	Matthias Krüger	-2/+2
	Unbox and unwrap the contents of `StatementKind::Coverage` The payload of coverage statements was historically a structure with several fields, so it was boxed to avoid bloating `StatementKind`. Now that the payload is a single relatively-small enum, we can replace `Box<Coverage>` with just `CoverageKind`. This patch also adds a size assertion for `StatementKind`, to avoid accidentally bloating it in the future. ``@rustbot`` label +A-code-coverage
2024-03-23	Add+Use `mir::BinOp::Cmp`	Scott McMurray	-0/+1

2024-03-23	CFI: Use Instance at callsites	Matthew Maurer	-1/+3
	We already use `Instance` at declaration sites when available to glean additional information about possible abstractions of the type in use. This does the same when possible at callsites as well. The primary purpose of this change is to allow CFI to alter how it generates type information for indirect calls through `Virtual` instances.
2024-03-23	Auto merge of #122582 - scottmcm:swap-intrinsic-v2, r=oli-obk	bors	-2/+66
	Let codegen decide when to `mem::swap` with immediates Making `libcore` decide this is silly; the backend has so much better information about when it's a good idea. Thus this PR introduces a new `typed_swap` intrinsic with a fallback body, and replaces that fallback implementation when swapping immediates or scalar pairs. r? oli-obk Replaces #111744, and means we'll never need more libs PRs like #111803 or #107140
2024-03-23	Unbox and unwrap the contents of `StatementKind::Coverage`	Zalathar	-2/+2
	The payload of coverage statements was historically a structure with several fields, so it was boxed to avoid bloating `StatementKind`. Now that the payload is a single relatively-small enum, we can replace `Box<Coverage>` with just `CoverageKind`. This patch also adds a size assertion for `StatementKind`, to avoid accidentally bloating it in the future.
2024-03-21	Remove `CodegenBackend::target_override`.	Nicholas Nethercote	-7/+0
	Backend and target selection is a mess: the target can override the backend (via `Target::default_codegen_backend`), and the backend can override the target (via `CodegenBackend::target_override`). The code that handles this is ugly. It calls `build_target_config` twice, once before getting the backend and once again afterward. It also must check that both overrides aren't triggering at the same time. This commit removes the latter override. It's used in rust-gpu but @eddyb said via Zulip that removing it would be ok. This simplifies the code greatly, and will allow some nice follow-up refactorings.
2024-03-17	Let codegen decide when to `mem::swap` with immediates	Scott McMurray	-2/+66
	Making `libcore` decide this is silly; the backend has so much better information about when it's a good idea. So introduce a new `typed_swap` intrinsic with a fallback body, but replace that implementation for immediates and scalar pairs.
2024-03-12	Check whether a static is mutable instead of passing it down	Oli Scherer	-1/+1

2024-03-11	Rollup merge of #116791 - WaffleLapkin:unparallel-backends, r=oli-obk	Jubilee	-0/+7
	Allow codegen backends to opt-out of parallel codegen This makes it a bit easier to write cursed codegen backends.
2024-03-08	Rollup merge of #119365 - nbdd0121:asm-goto, r=Amanieu	Matthias Krüger	-1/+5
	Add asm goto support to `asm!` Tracking issue: #119364 This PR implements asm-goto support, using the syntax described in "future possibilities" section of [RFC2873](https://rust-lang.github.io/rfcs/2873-inline-asm.html#asm-goto). Currently I have only implemented the `label` part, not the `fallthrough` part (i.e. fallthrough is implicit). This doesn't reduce the expressive though, since you can use label-break to get arbitrary control flow or simply set a value and rely on jump threading optimisation to get the desired control flow. I can add that later if deemed necessary. r? ``@Amanieu`` cc ``@ojeda``
2024-03-03	Auto merge of #121665 - erikdesjardins:ptradd, r=nikic	bors	-2/+6
	Always generate GEP i8 / ptradd for struct offsets This implements #98615, and goes a bit further to remove `struct_gep` entirely. Upstream LLVM is in the beginning stages of [migrating to `ptradd`](https://discourse.llvm.org/t/rfc-replacing-getelementptr-with-ptradd/68699). LLVM 19 will [canonicalize](https://github.com/llvm/llvm-project/pull/68882) all constant-offset GEPs to i8, which has roughly the same effect as this change. Fixes #121719. Split out from #121577. r? `@nikic`
2024-02-28	Add `f16` and `f128` to `rustc_type_ir::FloatTy` and `rustc_abi::Primitive`	Trevor Gross	-0/+2
	Make changes necessary to support these types in the compiler.
2024-02-26	introduce and use ptradd/inbounds_ptradd instead of gep	Erik Desjardins	-0/+6

2024-02-26	remove struct_gep, use manual layout calculations for va_arg	Erik Desjardins	-1/+0

2024-02-26	always use gep inbounds i8 (ptradd) for field offsets	Erik Desjardins	-1/+0

2024-02-24	Implement asm goto for LLVM and GCC backend	Gary Guo	-1/+5

2024-02-21	Auto merge of #120718 - saethlin:reasonable-fast-math, r=nnethercote	bors	-0/+5
	Add "algebraic" fast-math intrinsics, based on fast-math ops that cannot return poison Setting all of LLVM's fast-math flags makes our fast-math intrinsics very dangerous, because some inputs are UB. This set of flags permits common algebraic transformations, but according to the [LangRef](https://llvm.org/docs/LangRef.html#fastmath), only the flags `nnan` (no nans) and `ninf` (no infs) can produce poison. And this uses the algebraic float ops to fix https://github.com/rust-lang/rust/issues/120720 cc `@orlp`
2024-02-20	Add "algebraic" versions of the fast-math intrinsics	Ben Kimock	-0/+5

2024-02-17	Rollup merge of #121209 - nnethercote:infallible-join_codegen, r=bjorn3	Matthias Krüger	-1/+1
	Make `CodegenBackend::join_codegen` infallible. Because they all are, in practice. r? ```@bjorn3```
2024-02-17	Make `CodegenBackend::join_codegen` infallible.	Nicholas Nethercote	-1/+1
	Because they all are, in practice.
2024-02-15	Allow codegen backends to opt-out of parallel codegen	Maybe Waffle	-0/+7

2024-02-12	Teach llvm backend how to fall back to default bodies	Oli Scherer	-1/+3

2023-12-30	Auto merge of #118705 - WaffleLapkin:codegen-atomic-exhange-untuple, r=cjgillot	bors	-1/+1
	Change `rustc_codegen_ssa`'s `atomic_cmpxchg` interface to return a pair of values Doesn't change much, but a little nicer that way.
2023-12-28	Change `rustc_codegen_ssa`'s `atomic_cmpxchg` interface to return a pair of ↵	Bernd Schmidt	-1/+1
	values
2023-12-18	Rename many `DiagCtxt` arguments.	Nicholas Nethercote	-3/+3

2023-12-18	Rename `Handler` as `DiagCtxt`.	Nicholas Nethercote	-4/+4

2023-11-30	Move `MetadataLoader{,Dyn}` to `rustc_metadata`.	Nicholas Nethercote	-1/+1
	They're not used in `rustc_session`, and `rustc_metadata` is a more obvious location. `MetadataLoader` was originally put into `rustc_session` in #41565 to avoid a dependency on LLVM, but things have changed a lot since then and that's no longer relevant, e.g. `rustc_codegen_llvm` depends on `rustc_metadata`.
2023-11-05	Update doc comment for CodegenBackend::link	bjorn3	-5/+1

2023-10-05	Rollup merge of #116223 - catandcoder:master, r=cjgillot	Jubilee	-1/+1
	Fix misuses of a vs an Fixes the misuse of "a" vs "an", according to English grammatical expectations and using https://www.a-or-an.com/
2023-10-04	Fix misuses of a vs an	cui fliter	-1/+1
	Signed-off-by: cui fliter <imcusg@gmail.com>
2023-10-02	Reapply: Mark drop calls in landing pads cold instead of noinline	Erik Desjardins	-1/+1
	Co-authored-by: Max Fan <git@max.fan> Co-authored-by: Nikita Popov <npopov@redhat.com>
2023-09-22	Merge `ExternProviders` into the general `Providers` struct	Oli Scherer	-2/+0

2023-09-22	Have a single struct for queries and hook	Oli Scherer	-3/+3