rust - https://github.com/rust-lang/rust

Age	Commit message (Collapse)	Author	Lines
2025-01-17	Update our range `assume`s to the format that LLVM prefers	Scott McMurray	-55/+67

2025-01-05	Expand the `select_unpredictable` test for ZSTs	Trevor Gross	-0/+5
	For ZSTs there is no selection that needs to take place, so assert that no `select` statement is emitted.
2025-01-05	Merge the intrinsic and user tests for `select_unpredictable`	Trevor Gross	-1/+33
	[1] mentions that having a single test with `-Zmerge-functions=disabled` is preferable to having two separate tests. Apply that to the new `select_unpredicatble` test here. [1]: https://github.com/rust-lang/rust/pull/133964#issuecomment-2569693325
2025-01-03	Update carrying_mul_add test to tolerate `nuw`	Matthew Maurer	-2/+2
	LLVM 20 adds nuw to GEP operations in this code, tolerate them.
2024-12-30	Auto merge of #134757 - RalfJung:const_swap, r=scottmcm	bors	-6/+6
	stabilize const_swap libs-api FCP passed in https://github.com/rust-lang/rust/issues/83163. However, I only just realized that this actually involves an intrinsic. The intrinsic could be implemented entirely with existing stable const functionality, but we choose to make it a primitive to be able to detect more UB. So nominating for `@rust-lang/lang` to make sure they are aware; I leave it up to them whether they want to FCP this. While at it I also renamed the intrinsic to make the "nonoverlapping" constraint more clear. Fixes #83163
2024-12-27	Override `carrying_mul_add` in cg_llvm	Scott McMurray	-0/+137

2024-12-25	rename typed_swap → typed_swap_nonoverlapping	Ralf Jung	-6/+6

2024-11-17	Likely unlikely fix	Jiri Bobek	-12/+90

2024-10-23	Set `signext` or `zeroext` for integer arguments on RISC-V	Asuna	-4/+2

2024-09-04	Don't codegen `expect` in opt-level=0	clubby789	-1/+1

2024-08-19	Don't generate functions with the `rustc_intrinsic_must_be_overridden` attribute	DianQK	-0/+14

2024-08-12	make the codegen test also cover an ill-behaved arch, and add links	Ralf Jung	-3/+12

2024-08-05	nontemporal_store: make sure that the intrinsic is truly just a hint	Ralf Jung	-2/+17

2024-07-30	Auto merge of #128250 - Amanieu:select_unpredictable, r=nikic	bors	-0/+35
	Add `select_unpredictable` to force LLVM to use CMOV Since https://reviews.llvm.org/D118118, LLVM will no longer turn CMOVs into branches if it comes from a `select` marked with an `unpredictable` metadata attribute. This PR introduces `core::intrinsics::select_unpredictable` which emits such a `select` and uses it in the implementation of `binary_search_by`.
2024-07-29	Reformat `use` declarations.	Nicholas Nethercote	-3/+2
	The previous commit updated `rustfmt.toml` appropriately. This commit is the outcome of running `x fmt --all` with the new formatting options.
2024-07-28	Force LLVM to use CMOV for binary search	Amanieu d'Antras	-0/+35
	Since https://reviews.llvm.org/D118118, LLVM will no longer turn CMOVs into branches if it comes from a `select` marked with an `unpredictable` metadata attribute. This PR introduces `core::intrinsics::select_unpredictable` which emits such a `select` and uses it in the implementation of `binary_search_by`.
2024-05-31	Run rustfmt on `tests/codegen/`.	Nicholas Nethercote	-15/+12
	Except for `simd-intrinsic/`, which has a lot of files containing multiple types like `u8x64` which really are better when hand-formatted. There is a surprising amount of two-space indenting in this directory. Non-trivial changes: - `rustfmt::skip` needed in `debug-column.rs` to preserve meaning of the test. - `rustfmt::skip` used in a few places where hand-formatting read more nicely: `enum/enum-match.rs` - Line number adjustments needed for the expected output of `debug-column.rs` and `coroutine-debug.rs`.
2024-05-28	Add an intrinsic for `ptr::metadata`	Scott McMurray	-0/+36

2024-05-16	Fix ICE in non-operand `aggregate_raw_ptr` instrinsic codegen	Scott McMurray	-0/+23

2024-04-24	Fix tests and bless	Gary Guo	-1/+0

2024-04-24	Auto merge of #122053 - erikdesjardins:alloca, r=nikic	bors	-8/+8
	Stop using LLVM struct types for alloca The alloca type has no semantic meaning, only the size (and alignment, but we specify it explicitly) matter. Using `[N x i8]` is a more direct way to specify that we want `N` bytes, and avoids relying on LLVM's struct layout. It is likely that a future LLVM version will change to an untyped alloca representation. Split out from #121577. r? `@ghost`
2024-04-23	Rollup merge of #124003 - WaffleLapkin:dellvmization, r=scottmcm,RalfJung,antoyo	Matthias Krüger	-0/+118
	Dellvmize some intrinsics (use `u32` instead of `Self` in some integer intrinsics) This implements https://github.com/rust-lang/compiler-team/issues/693 minus what was implemented in #123226. Note: I decided to _not_ change `shl`/... builder methods, as it just doesn't seem worth it. r? ``@scottmcm``
2024-04-22	Stabilize generic `NonZero`.	Markus Reiter	-1/+0

2024-04-16	Add codegen tests for changed intrinsics	Maybe Waffle	-0/+118

2024-04-11	use [N x i8] for alloca types	Erik Desjardins	-8/+8

2024-04-02	Auto merge of #118310 - scottmcm:three-way-compare, r=davidtwco	bors	-0/+47
	Add `Ord::cmp` for primitives as a `BinOp` in MIR Update: most of this OP was written months ago. See https://github.com/rust-lang/rust/pull/118310#issuecomment-2016940014 below for where we got to recently that made it ready for review. --- There are dozens of reasonable ways to implement `Ord::cmp` for integers using comparison, bit-ops, and branches. Those differences are irrelevant at the rust level, however, so we can make things better by adding `BinOp::Cmp` at the MIR level: 1. Exactly how to implement it is left up to the backends, so LLVM can use whatever pattern its optimizer best recognizes and cranelift can use whichever pattern codegens the fastest. 2. By not inlining those details for every use of `cmp`, we drastically reduce the amount of MIR generated for `derive`d `PartialOrd`, while also making it more amenable to MIR-level optimizations. Having extremely careful `if` ordering to μoptimize resource usage on broadwell (#63767) is great, but it really feels to me like libcore is the wrong place to put that logic. Similarly, using subtraction [tricks](https://graphics.stanford.edu/~seander/bithacks.html#CopyIntegerSign) (#105840) is arguably even nicer, but depends on the optimizer understanding it (https://github.com/llvm/llvm-project/issues/73417) to be practical. Or maybe [bitor is better than add](https://discourse.llvm.org/t/representing-in-ir/67369/2?u=scottmcm)? But maybe only on a future version that [has `or disjoint` support](https://discourse.llvm.org/t/rfc-add-or-disjoint-flag/75036?u=scottmcm)? And just because one of those forms happens to be good for LLVM, there's no guarantee that it'd be the same form that GCC or Cranelift would rather see -- especially given their very different optimizers. Not to mention that if LLVM gets a spaceship intrinsic -- [which it should](https://rust-lang.zulipchat.com/#narrow/stream/131828-t-compiler/topic/Suboptimal.20inlining.20in.20std.20function.20.60binary_search.60/near/404250586) -- we'll need at least a rustc intrinsic to be able to call it. As for simplifying it in Rust, we now regularly inline `{integer}::partial_cmp`, but it's quite a large amount of IR. The best way to see that is with https://github.com/rust-lang/rust/commit/8811efa88b25b5e41d63850e6047e8257c677858#diff-d134c32d028fbe2bf835fef2df9aca9d13332dd82284ff21ee7ebf717bfa4765R113 -- I added a new pre-codegen MIR test for a simple 3-tuple struct, and this PR change it from 36 locals and 26 basic blocks down to 24 locals and 8 basic blocks. Even better, as soon as the construct-`Some`-then-match-it-in-same-BB noise is cleaned up, this'll expose the `Cmp == 0` branches clearly in MIR, so that an InstCombine (#105808) can simplify that to just a `BinOp::Eq` and thus fix some of our generated code perf issues. (Tracking that through today's `if a < b { Less } else if a == b { Equal } else { Greater }` would be much harder.) --- r? `@ghost` But first I should check that perf is ok with this ~~...and my true nemesis, tidy.~~
2024-03-25	Don't emit load metadata in debug mode	clubby789	-7/+6

2024-03-23	Add+Use `mir::BinOp::Cmp`	Scott McMurray	-0/+47

2024-03-17	Stop whining, tidy	Scott McMurray	-0/+1

2024-03-17	Let codegen decide when to `mem::swap` with immediates	Scott McMurray	-0/+77
	Making `libcore` decide this is silly; the backend has so much better information about when it's a good idea. So introduce a new `typed_swap` intrinsic with a fallback body, but replace that implementation for immediates and scalar pairs.
2024-03-11	Lower transmutes from int to pointer type as gep on null	Ben Kimock	-2/+2

2024-02-25	Use generic `NonZero` in tests.	Markus Reiter	-3/+3

2024-02-22	[AUTO_GENERATED] Migrate compiletest to use `ui_test`-style `//@` directives	许杰友 Jieyou Xu (Joe)	-22/+22

2023-12-15	Separate immediate and in-memory ScalarPair representation	Nikita Popov	-5/+5
	Currently, we assume that ScalarPair is always represented using a two-element struct, both as an immediate value and when stored in memory. This currently works fairly well, but runs into problems with https://github.com/rust-lang/rust/pull/116672, where a ScalarPair involving an i128 type can no longer be represented as a two-element struct in memory. For example, the tuple `(i32, i128)` needs to be represented in-memory as `{ i32, [3 x i32], i128 }` to satisfy alignment requirement. Using `{ i32, i128 }` instead will result in the second element being stored at the wrong offset (prior to LLVM 18). Resolve this issue by no longer requiring that the immediate and in-memory type for ScalarPair are the same. The in-memory type will now look the same as for normal struct types (and will include padding filler and similar), while the immediate type stays a simple two-element struct type. This also means that booleans in immediate ScalarPair are now represented as i1 rather than i8, just like we do everywhere else. The core change here is to llvm_type (which now treats ScalarPair as a normal struct) and immediate_llvm_type (which returns the two-element struct that llvm_type used to produce). The rest is fixing things up to no longer assume these are the same. In particular, this switches places that try to get pointers to the ScalarPair elements to use byte-geps instead of struct-geps.
2023-08-06	Add a new `compare_bytes` intrinsic instead of calling `memcmp` directly	Scott McMurray	-0/+34

2023-07-27	CHECK only for opaque ptr	Josh Stone	-2/+2

2023-07-27	Update the minimum external LLVM to 15	Josh Stone	-4/+0

2023-07-08	Always name the return place.	Camille GILLOT	-61/+57

2023-05-31	Add a distinct `OperandValue::ZeroSized` variant for ZSTs	Scott McMurray	-4/+71
	These tend to have special handling in a bunch of places anyway, so the variant helps remember that. And I think it's easier to grok than non-Scalar Aggregates sometimes being `Immediates` (like I got wrong and caused 109992). As a minor bonus, it means we don't need to generate poison LLVM values for them to pass around in `OperandValue::Immediate`s.
2023-04-27	Also use `mir::Offset` for pointer `add`	Scott McMurray	-0/+34

2023-04-22	Add `intrinsics::transmute_unchecked`	Scott McMurray	-33/+9
	This takes a whole 3 lines in `compiler/` since it lowers to `CastKind::Transmute` in MIR exactly the same as the existing `intrinsics::transmute` does, it just doesn't have the fancy checking in `hir_typeck`. Added to enable experimenting with the request in <https://github.com/rust-lang/rust/pull/106281#issuecomment-1496648190> and because the portable-simd folks might be interested for dependently-sized array-vector conversions. It also simplifies a couple places in `core`.
2023-04-13	`assume` value ranges in `transmute`	Scott McMurray	-4/+188
	Fixes #109958
2023-04-09	Handle not all immediates having `abi::Scalar`s	Scott McMurray	-1/+92

2023-04-06	Check `CastKind::Transmute` sizes in a better way	Scott McMurray	-1/+73
	Fixes #110005
2023-04-04	Allow `transmute`s to produce `OperandValue`s instead of always using `alloca`s	Scott McMurray	-23/+130
	LLVM can usually optimize these away, but especially for things like transmutes of newtypes it's silly to generate the `alloc`+`store`+`load` at all when it's actually a nop at LLVM level.
2023-03-22	Add `CastKind::Transmute` to MIR	Scott McMurray	-0/+196
	Updates `interpret`, `codegen_ssa`, and `codegen_cranelift` to consume the new cast instead of the intrinsic. Includes `CastTransmute` for custom MIR building, to be able to test the extra UB.
2023-01-17	Add more codegen tests	Nilstrieb	-4/+5

2023-01-17	Put `noundef` on all scalars that don't allow uninit	Nilstrieb	-2/+2
	Previously, it was only put on scalars with range validity invariants like bool, was uninit was obviously invalid for those. Since then, we have normatively declared all uninit primitives to be undefined behavior and can therefore put `noundef` on them. The remaining concern was the `mem::uninitialized` function, which cause quite a lot of UB in the older parts of the ecosystem. This function now doesn't return uninit values anymore, making users of it safe from this change. The only real sources of UB where people could encounter uninit primitives are `MaybeUninit::uninit().assume_init()`, which has always be clear in the docs about being UB and from heap allocations (like reading from the spare capacity of a vec. This is hopefully rare enough to not break anything.
2023-01-11	Move /src/test to /tests	Albert Larsan	-0/+328