rust - https://github.com/rust-lang/rust

Age	Commit message (Collapse)	Author	Lines
2024-03-17	make PassMode::Cast consistently copy between Rust/ABI representation	Erik Desjardins	-41/+27
	Previously, we did this slightly incorrectly for return values, and didn't do it at all for arguments.
2024-03-17	make CastTarget::size and CastTarget::llvm_type consistent, remove	Erik Desjardins	-28/+24
	special case that's not present in Clang Making the methods consistent doesn't require much justification. It's required for us to generate correct code. The special case was present near the end of `CastTarget::llvm_type`, and resulted in the final integer component of the ABI type being shrunk to the smallest integer that fits. You can see this in action here (https://godbolt.org/z/Pe73cr91d), where, for a struct with 5 u16 elements, rustc generates `{ i64, i16 }`, while Clang generates `[2 x i64]`. This special case was added a long time ago, when the function was originally written [1]. That commit consolidated logic from many backends, and in some of the code it deleted, sparc64 [2] and powerpc64 [3] had similar special cases. However, looking at Clang today, it doesn't have this special case for sparc64 (https://godbolt.org/z/YaafvYWdf) or powerpc64 (https://godbolt.org/z/5c3YePTje), so this change just removes it. [1]: https://github.com/rust-lang/rust/commit/f0636b61c7f84962a609e831760db9d77f4f5e14#diff-183c4dadf10704bd1f521b71f71d89bf755c9603a93f894d66c03bb1effc6021R231 [2]: https://github.com/rust-lang/rust/commit/f0636b61c7f84962a609e831760db9d77f4f5e14#diff-2d8f87ea6db6d7f0a6fbeb1d5549adc07e93331278d951a1e051a40f92914436L163-L166 [3]: https://github.com/rust-lang/rust/commit/f0636b61c7f84962a609e831760db9d77f4f5e14#diff-88af4a9df9ead503a5c7774a0455d270dea3ba60e9b0ec1ce550b4c53d3bce3bL172-L175
2024-03-11	copy byval argument to alloca if alignment is insufficient	Erik Desjardins	-49/+55

2024-03-05	use [N x i8] for byval/sret types	Erik Desjardins	-4/+16
	This avoids depending on LLVM's struct types to determine the size of the byval/sret slot.
2024-01-30	Remove the `abi_amdgpu_kernel` feature	clubby789	-1/+0

2023-12-15	Separate immediate and in-memory ScalarPair representation	Nikita Popov	-2/+2
	Currently, we assume that ScalarPair is always represented using a two-element struct, both as an immediate value and when stored in memory. This currently works fairly well, but runs into problems with https://github.com/rust-lang/rust/pull/116672, where a ScalarPair involving an i128 type can no longer be represented as a two-element struct in memory. For example, the tuple `(i32, i128)` needs to be represented in-memory as `{ i32, [3 x i32], i128 }` to satisfy alignment requirement. Using `{ i32, i128 }` instead will result in the second element being stored at the wrong offset (prior to LLVM 18). Resolve this issue by no longer requiring that the immediate and in-memory type for ScalarPair are the same. The in-memory type will now look the same as for normal struct types (and will include padding filler and similar), while the immediate type stays a simple two-element struct type. This also means that booleans in immediate ScalarPair are now represented as i1 rather than i8, just like we do everywhere else. The core change here is to llvm_type (which now treats ScalarPair as a normal struct) and immediate_llvm_type (which returns the two-element struct that llvm_type used to produce). The rest is fixing things up to no longer assume these are the same. In particular, this switches places that try to get pointers to the ScalarPair elements to use byte-geps instead of struct-geps.
2023-12-10	remove redundant imports	surechen	-1/+0
	detects redundant imports that can be eliminated. for #117772 : In order to facilitate review and modification, split the checking code and removing redundant imports code into two PR.
2023-11-21	Fix `clippy::needless_borrow` in the compiler	Nilstrieb	-1/+1
	`x clippy compiler -Aclippy::all -Wclippy::needless_borrow --fix`. Then I had to remove a few unnecessary parens and muts that were exposed now.
2023-11-03	move ABI sanity check from LLVM codegen backend to ABI computation logic	Ralf Jung	-40/+3

2023-10-27	Link to correct issue in PassMode::Direct ptx-kernel exception	Kjetil Kjeka	-1/+1

2023-10-27	Documentation and error message improvements related to PassMode::Direct assert	Kjetil Kjeka	-2/+2
	Co-authored-by: Jubilee <46493976+workingjubilee@users.noreply.github.com>
2023-10-26	NVPTX: Allow PassMode::Direct for ptx kernels for now	Kjetil Kjeka	-1/+6

2023-09-15	cannot have Direct for unsized types	Ralf Jung	-0/+5

2023-09-15	clarify PassMode::Indirect as well	Ralf Jung	-15/+20

2023-09-15	explain PassMode::Cast	Ralf Jung	-8/+10

2023-09-08	the wasm ABI behavior is a bug	Ralf Jung	-9/+6

2023-09-07	extend comments around PassMode::Direct	Ralf Jung	-1/+41

2023-08-26	Use `preserve_mostcc` for `extern "rust-cold"`	Scott McMurray	-1/+3
	As experimentation in 115242 has shown looks better than `coldcc`. And don't use a different convention for cold on Windows, because that actually ends up making things worse. cc tracking issue 97544
2023-08-08	feat: `riscv-interrupt-{m,s}` calling conventions	Seth Pellegrino	-2/+7
	Similar to prior support added for the mips430, avr, and x86 targets this change implements the rough equivalent of clang's [`__attribute__((interrupt))`][clang-attr] for riscv targets, enabling e.g. ```rust static mut CNT: usize = 0; pub extern "riscv-interrupt-m" fn isr_m() { unsafe { CNT += 1; } } ``` to produce highly effective assembly like: ```asm pub extern "riscv-interrupt-m" fn isr_m() { 420003a0: 1141 addi sp,sp,-16 unsafe { CNT += 1; 420003a2: c62a sw a0,12(sp) 420003a4: c42e sw a1,8(sp) 420003a6: 3fc80537 lui a0,0x3fc80 420003aa: 63c52583 lw a1,1596(a0) # 3fc8063c <_ZN12esp_riscv_rt3CNT17hcec3e3a214887d53E.0> 420003ae: 0585 addi a1,a1,1 420003b0: 62b52e23 sw a1,1596(a0) } } 420003b4: 4532 lw a0,12(sp) 420003b6: 45a2 lw a1,8(sp) 420003b8: 0141 addi sp,sp,16 420003ba: 30200073 mret ``` (disassembly via `riscv64-unknown-elf-objdump -C -S --disassemble ./esp32c3-hal/target/riscv32imc-unknown-none-elf/release/examples/gpio_interrupt`) This outcome is superior to hand-coded interrupt routines which, lacking visibility into any non-assembly body of the interrupt handler, have to be very conservative and save the [entire CPU state to the stack frame][full-frame-save]. By instead asking LLVM to only save the registers that it uses, we defer the decision to the tool with the best context: it can more accurately account for the cost of spills if it knows that every additional register used is already at the cost of an implicit spill. At the LLVM level, this is apparently [implemented by] marking every register as "[callee-save]," matching the semantics of an interrupt handler nicely (it has to leave the CPU state just as it found it after its `{m\|s}ret`). This approach is not suitable for every interrupt handler, as it makes no attempt to e.g. save the state in a user-accessible stack frame. For a full discussion of those challenges and tradeoffs, please refer to [the interrupt calling conventions RFC][rfc]. Inside rustc, this implementation differs from prior art because LLVM does not expose the "all-saved" function flavor as a calling convention directly, instead preferring to use an attribute that allows for differentiating between "machine-mode" and "superivsor-mode" interrupts. Finally, some effort has been made to guide those who may not yet be aware of the differences between machine-mode and supervisor-mode interrupts as to why no `riscv-interrupt` calling convention is exposed through rustc, and similarly for why `riscv-interrupt-u` makes no appearance (as it would complicate future LLVM upgrades). [clang-attr]: https://clang.llvm.org/docs/AttributeReference.html#interrupt-risc-v [full-frame-save]: https://github.com/esp-rs/esp-riscv-rt/blob/9281af2ecffe13e40992917316f36920c26acaf3/src/lib.rs#L440-L469 [implemented by]: https://github.com/llvm/llvm-project/blob/b7fb2a3fec7c187d58a6d338ab512d9173bca987/llvm/lib/Target/RISCV/RISCVRegisterInfo.cpp#L61-L67 [callee-save]: https://github.com/llvm/llvm-project/blob/973f1fe7a8591c7af148e573491ab68cc15b6ecf/llvm/lib/Target/RISCV/RISCVCallingConv.td#L30-L37 [rfc]: https://github.com/rust-lang/rfcs/pull/3246
2023-07-29	cg_llvm: remove pointee types and pointercast/bitcast-of-ptr	Erik Desjardins	-13/+4

2023-07-05	Move `TyCtxt::mk_x` to `Ty::new_x` where applicable	Boxy	-1/+1

2023-01-17	Remove double spaces after dots in comments	Maybe Waffle	-1/+1

2023-01-04	cleanup: handle -Zmutable-noalias like -Zbox-noalias	Erik Desjardins	-10/+0

2022-11-11	Improve generating Custom entry function	Ayush Singh	-17/+23
	This commit is aimed at making compiler generated entry functions (Basically just C `main` right now) more generic so other targets can do similar things for custom entry. This was initially implemented as part of https://github.com/rust-lang/rust/pull/100316. Currently, this moves the entry function name and Call convention to the target spec. Signed-off-by: Ayush Singh <ayushsingh1325@gmail.com>
2022-10-01	Merge apply_attrs_callsite into call and invoke	bjorn3	-4/+0
	Some codegen backends are not able to apply callsite attrs after the fact.
2022-09-09	Rollup merge of #99207 - 5225225:msan-eager-checks, r=jackh726	Matthias Krüger	-0/+8
	Enable eager checks for memory sanitizer Fixes #99179
2022-08-26	Move `ArgAbi::pad_i32` into `PassMode::Cast`.	Nicholas Nethercote	-19/+20
	Because it's only needed for that variant. This shrinks the types and clarifies the logic.
2022-08-26	Turn `ArgAbi::pad` into a `bool`.	Nicholas Nethercote	-4/+4
	Because it's only ever set to `None` or `Some(Reg::i32())`.
2022-08-26	Simplify arg capacity calculations.	Nicholas Nethercote	-5/+2
	Currently they try to be very precise. But they are wrong, i.e. they don't match what's happening in the loop below. This code isn't hot enough for it to matter that much.
2022-08-26	Change `FnAbi::args` to a boxed slice.	Nicholas Nethercote	-2/+2

2022-08-26	Change `FnAbi::fixed_count` to a `u32`.	Nicholas Nethercote	-1/+2

2022-08-26	Box `CastTarget` within `PassMode`.	Nicholas Nethercote	-25/+21
	Because `PassMode::Cast` is by far the largest variant, but is relatively rare. This requires making `PassMode` not impl `Copy`, and `Clone` is no longer necessary. This causes lots of sigil adjusting, but nothing very notable.
2022-08-14	Emit noundef even for unoptimised code if msan is on	5225225	-0/+8

2022-07-27	Add elementtype attributes for llvm.arm.ldrex/strex intrinsics	Nikita Popov	-0/+16
	These intrinsics (and a few more, but there are the only ones exposed by stdarch) require an elementtype attribute in LLVM 15.
2022-07-13	Rename `debugging_opts` to `unstable_opts`	Joshua Nelson	-1/+1
	This is no longer used only for debugging options (e.g. `-Zoutput-width`, `-Zallow-features`). Rename it to be more clear.
2022-05-30	Add support for emitting functions with `coldcc` in LLVM	Scott McMurray	-0/+1
	The eventual goal is to try using this for things like the internal panicking stuff, to see whether it helps.
2022-04-05	Mark scalar layout unions so that backends that do not support partially ↵	Oli Scherer	-2/+2
	initialized scalars can special case them.
2022-03-03	Pass LLVM string attributes as string slices	Tomasz Miąsko	-2/+1

2022-02-26	just put smallvec lengths in the signature	Erik Desjardins	-5/+2

2022-02-26	Add LLVM attributes in batches instead of individually	Erik Desjardins	-129/+85
	This should improve performance.
2022-02-26	Auto merge of #94127 - erikdesjardins:debugattr, r=nikic	bors	-49/+60
	At opt-level=0, apply only ABI-affecting attributes to functions This should provide a small perf improvement for debug builds, and should more than cancel out the perf regression from adding noundef (https://github.com/rust-lang/rust/pull/93670#issuecomment-1038347581, #94106). r? `@nikic`
2022-02-20	Improve `unused_unsafe` lint	Frank Steffahn	-7/+5
	Main motivation: Fixes some issues with the current behavior. This PR is more-or-less completely re-implementing the unused_unsafe lint; it’s also only done in the MIR-version of the lint, the set of tests for the `-Zthir-unsafeck` version no longer succeeds (and is thus disabled, see `lint-unused-unsafe.rs`). On current nightly, ```rs unsafe fn unsf() {} fn inner_ignored() { unsafe { #[allow(unused_unsafe)] unsafe { unsf() } } } ``` doesn’t create any warnings. This situation is not unrealistic to come by, the inner `unsafe` block could e.g. come from a macro. Actually, this PR even includes removal of one unused `unsafe` in the standard library that was missed in a similar situation. (The inner `unsafe` coming from an external macro hides the warning, too.) The reason behind this problem is how the check currently works: * While generating MIR, it already skips nested unsafe blocks (i.e. unsafe nested in other unsafe) so that the inner one is always the one considered unused * To differentiate the cases of no unsafe operations inside the `unsafe` vs. a surrounding `unsafe` block, there’s some ad-hoc magic walking up the HIR to look for surrounding used `unsafe` blocks. There’s a lot of problems with this approach besides the one presented above. E.g. the MIR-building uses checks for `unsafe_op_in_unsafe_fn` lint to decide early whether or not `unsafe` blocks in an `unsafe fn` are redundant and ought to be removed. ```rs unsafe fn granular_disallow_op_in_unsafe_fn() { unsafe { #[deny(unsafe_op_in_unsafe_fn)] { unsf(); } } } ``` ``` error: call to unsafe function is unsafe and requires unsafe block (error E0133) --> src/main.rs:13:13 \| 13 \| unsf(); \| ^^^^^^ call to unsafe function \| note: the lint level is defined here --> src/main.rs:11:16 \| 11 \| #[deny(unsafe_op_in_unsafe_fn)] \| ^^^^^^^^^^^^^^^^^^^^^^ = note: consult the function's documentation for information on how to avoid undefined behavior warning: unnecessary `unsafe` block --> src/main.rs:10:5 \| 9 \| unsafe fn granular_disallow_op_in_unsafe_fn() { \| --------------------------------------------- because it's nested under this `unsafe` fn 10 \| unsafe { \| ^^^^^^ unnecessary `unsafe` block \| = note: `#[warn(unused_unsafe)]` on by default ``` Here, the intermediate `unsafe` was ignored, even though it contains a unsafe operation that is not allowed to happen in an `unsafe fn` without an additional `unsafe` block. Also closures were problematic and the workaround/algorithms used on current nightly didn’t work properly. (I skipped trying to fully understand what it was supposed to do, because this PR uses a completely different approach.) ```rs fn nested() { unsafe { unsafe { unsf() } } } ``` ``` warning: unnecessary `unsafe` block --> src/main.rs:10:9 \| 9 \| unsafe { \| ------ because it's nested under this `unsafe` block 10 \| unsafe { unsf() } \| ^^^^^^ unnecessary `unsafe` block \| = note: `#[warn(unused_unsafe)]` on by default ``` vs ```rs fn nested() { let _ = \|\| unsafe { let _ = \|\| unsafe { unsf() }; }; } ``` ``` warning: unnecessary `unsafe` block --> src/main.rs:9:16 \| 9 \| let _ = \|\| unsafe { \| ^^^^^^ unnecessary `unsafe` block \| = note: `#[warn(unused_unsafe)]` on by default warning: unnecessary `unsafe` block --> src/main.rs:10:20 \| 10 \| let _ = \|\| unsafe { unsf() }; \| ^^^^^^ unnecessary `unsafe` block ``` note that this warning kind-of suggests that both* unsafe blocks are redundant* -------------------------------------------------------------------------------- I also dislike the fact that it always suggests keeping the outermost `unsafe`. E.g. for ```rs fn granularity() { unsafe { unsafe { unsf() } unsafe { unsf() } unsafe { unsf() } } } ``` I prefer if `rustc` suggests removing the more-course outer-level `unsafe` instead of the fine-grained inner `unsafe` blocks, which it currently does on nightly: ``` warning: unnecessary `unsafe` block --> src/main.rs:10:9 \| 9 \| unsafe { \| ------ because it's nested under this `unsafe` block 10 \| unsafe { unsf() } \| ^^^^^^ unnecessary `unsafe` block \| = note: `#[warn(unused_unsafe)]` on by default warning: unnecessary `unsafe` block --> src/main.rs:11:9 \| 9 \| unsafe { \| ------ because it's nested under this `unsafe` block 10 \| unsafe { unsf() } 11 \| unsafe { unsf() } \| ^^^^^^ unnecessary `unsafe` block warning: unnecessary `unsafe` block --> src/main.rs:12:9 \| 9 \| unsafe { \| ------ because it's nested under this `unsafe` block ... 12 \| unsafe { unsf() } \| ^^^^^^ unnecessary `unsafe` block ``` -------------------------------------------------------------------------------- Needless to say, this PR addresses all these points. For context, as far as my understanding goes, the main advantage of skipping inner unsafe blocks was that a test case like ```rs fn top_level_used() { unsafe { unsf(); unsafe { unsf() } unsafe { unsf() } unsafe { unsf() } } } ``` should generate some warning because there’s redundant nested `unsafe`, however every single `unsafe` block _does_ contain some statement that uses it. Of course this PR doesn’t aim change the warnings on this kind of code example, because the current behavior, warning on all the inner `unsafe` blocks, makes sense in this case. As mentioned, during MIR building all the unsafe blocks are kept now, and usage is attributed to them. The way to still generate a warning like ``` warning: unnecessary `unsafe` block --> src/main.rs:11:9 \| 9 \| unsafe { \| ------ because it's nested under this `unsafe` block 10 \| unsf(); 11 \| unsafe { unsf() } \| ^^^^^^ unnecessary `unsafe` block \| = note: `#[warn(unused_unsafe)]` on by default warning: unnecessary `unsafe` block --> src/main.rs:12:9 \| 9 \| unsafe { \| ------ because it's nested under this `unsafe` block ... 12 \| unsafe { unsf() } \| ^^^^^^ unnecessary `unsafe` block warning: unnecessary `unsafe` block --> src/main.rs:13:9 \| 9 \| unsafe { \| ------ because it's nested under this `unsafe` block ... 13 \| unsafe { unsf() } \| ^^^^^^ unnecessary `unsafe` block ``` in this case is by emitting a `unused_unsafe` warning for all of the `unsafe` blocks that are _within a used unsafe block_. The previous code had a little HIR traversal already anyways to collect a set of all the unsafe blocks (in order to afterwards determine which ones are unused afterwards). This PR uses such a traversal to do additional things including logic like _always_ warn for an `unsafe` block that’s inside of another used unsafe block. The traversal is expanded to include nested closures in the same go, this simplifies a lot of things. The whole logic around `unsafe_op_in_unsafe_fn` is a little complicated, there’s some test cases of corner-cases in this PR. (The implementation involves differentiating between whether a used unsafe block was used exclusively by operations where `allow(unsafe_op_in_unsafe_fn)` was active.) The main goal was to make sure that code should compile successfully if all the `unused_unsafe`-warnings are addressed _simultaneously_ (by removing the respective `unsafe` blocks) no matter how complicated the patterns of `unsafe_op_in_unsafe_fn` being disallowed and allowed throughout the function are. -------------------------------------------------------------------------------- One noteworthy design decision I took here: An `unsafe` block with `allow(unused_unsafe)` is considered used for the purposes of linting about redundant contained unsafe blocks. So while ```rs fn granularity() { unsafe { //~ ERROR: unnecessary `unsafe` block unsafe { unsf() } unsafe { unsf() } unsafe { unsf() } } } ``` warns for the outer `unsafe` block, ```rs fn top_level_ignored() { #[allow(unused_unsafe)] unsafe { #[deny(unused_unsafe)] { unsafe { unsf() } //~ ERROR: unnecessary `unsafe` block unsafe { unsf() } //~ ERROR: unnecessary `unsafe` block unsafe { unsf() } //~ ERROR: unnecessary `unsafe` block } } } ``` warns on the inner ones.
2022-02-19	always add align attributes	Erik Desjardins	-10/+10

2022-02-18	At opt-level=0, apply only ABI-affecting attributes to functions	Erik Desjardins	-41/+52
	This should provide a small perf improvement for debug builds, and should more than cancel out the regression from adding noundef, which was only significant in debug builds.
2022-02-06	apply noundef explicitly in all cases instead of relying on dereferenceable ↵	Erik Desjardins	-4/+0
	implying it
2022-02-05	Apply noundef attribute to &T, &mut T, Box<T>, bool	Erik Desjardins	-1/+5
	This doesn't handle `char` because it's a bit awkward to distinguish it from u32 at this point in codegen. Note that for some types (like `&Struct` and `&mut Struct`), we already apply `dereferenceable`, which implies `noundef`, so the IR does not change.
2021-12-16	Remove `in_band_lifetimes` from `rustc_codegen_llvm`	LegionMammal978	-11/+11
	See #91867 for more information.
2021-12-02	Auto merge of #91003 - psumbera:sparc64-abi, r=nagisa	bors	-7/+15
	fix sparc64 ABI for aggregates with floating point members Fixes #86163
2021-12-01	fix sparc64 ABI for aggregates with floating point members	Petr Sumbera	-7/+15

2021-11-23	Feat: make cg_ssa get_param borrow the builder mutable	rdambrosio	-1/+1