rust - https://github.com/rust-lang/rust

Age	Commit message (Collapse)	Author	Lines
2025-05-28	Move supports_parallel from CodegenBackend to ExtraBackendMethods	bjorn3	-7/+7
	It is only relevant when using cg_ssa for driving compilation.
2025-05-28	Remove a couple of uses of interior mutability around statics	bjorn3	-3/+3

2025-05-28	Reduce amount of types that need to be PartialEq	bjorn3	-3/+3

2025-05-28	The personality function is a Function, not a Value	bjorn3	-4/+4

2025-05-28	Remove codegen_unit from MiscCodegenMethods	bjorn3	-2/+0

2025-05-26	Remove usage of FnAbi in codegen_intrinsic_call	bjorn3	-3/+1

2025-05-26	Pass PlaceRef rather than Bx::Value to codegen_intrinsic_call	bjorn3	-1/+2

2025-04-30	Rollup merge of #134232 - bjorn3:naked_asm_improvements, r=wesleywiser	Matthias Krüger	-1/+1
	Share the naked asm impl between cg_ssa and cg_clif This was introduced in https://github.com/rust-lang/rust/pull/128004.
2025-04-29	Update documentation for `fn target_config`	Trevor Gross	-5/+4
	This was missed as part of [1]. [1]: https://github.com/rust-lang/rust/pull/140323
2025-04-27	Implement the internal feature `cfg_target_has_reliable_f16_f128`	Trevor Gross	-3/+10
	Support for `f16` and `f128` is varied across targets, backends, and backend versions. Eventually we would like to reach a point where all backends support these approximately equally, but until then we have to work around some of these nuances of support being observable. Introduce the `cfg_target_has_reliable_f16_f128` internal feature, which provides the following new configuration gates: * `cfg(target_has_reliable_f16)` * `cfg(target_has_reliable_f16_math)` * `cfg(target_has_reliable_f128)` * `cfg(target_has_reliable_f128_math)` `reliable_f16` and `reliable_f128` indicate that basic arithmetic for the type works correctly. The `_math` versions indicate that anything relying on `libm` works correctly, since sometimes this hits a separate class of codegen bugs. These options match configuration set by the build script at [1]. The logic for LLVM support is duplicated as-is from the same script. There are a few possible updates that will come as a follow up. The config introduced here is not planned to ever become stable, it is only intended to replace the build scripts for `std` tests and `compiler-builtins` that don't have any way to configure based on the codegen backend. MCP: https://github.com/rust-lang/compiler-team/issues/866 Closes: https://github.com/rust-lang/compiler-team/issues/866 [1]: https://github.com/rust-lang/rust/blob/555e1d0386f024a8359645c3217f4b3eae9be042/library/std/build.rs#L84-L186
2025-04-14	Pass &mut self to codegen_global_asm	bjorn3	-1/+1

2025-03-24	Auto merge of #133984 - DaniPopes:scmp-ucmp, r=scottmcm	bors	-0/+12
	Lower BinOp::Cmp to llvm.{s,u}cmp.* intrinsics Lowers `mir::BinOp::Cmp` (`three_way_compare` intrinsic) to the corresponding LLVM `llvm.{s,u}cmp.i8.*` intrinsics. These are the intrinsics mentioned in https://github.com/rust-lang/rust/pull/118310, which are now available in LLVM 19. I couldn't find any follow-up PRs/discussions about this, please let me know if I missed something. r? `@scottmcm`
2025-03-11	Auto merge of #137586 - nnethercote:SetImpliedBits, r=bjorn3	bors	-3/+6
	Speed up target feature computation The LLVM backend calls `LLVMRustHasFeature` twice for every feature. In short-running rustc invocations, this accounts for a surprising amount of work. r? `@bjorn3`
2025-03-07	Rollup merge of #137549 - oli-obk:llvm-ffi, r=davidtwco	Matthias Krüger	-11/+8
	Clean up various LLVM FFI things in codegen_llvm cc ```@ZuseZ4``` I touched some autodiff parts The major change of this PR is [bfd88ce](https://github.com/rust-lang/rust/pull/137549/commits/bfd88cead0dd79717f123ad7e9a26ecad88653cb) which makes `CodegenCx` generic just like `GenericBuilder` The other commits mostly took advantage of the new feature of making extern functions safe, but also just used some wrappers that were already there and shrunk unsafe blocks. best reviewed commit-by-commit
2025-03-06	Lower BinOp::Cmp to llvm.{s,u}cmp.* intrinsics	DaniPopes	-0/+12
	Lowers `mir::BinOp::Cmp` (`three_way_compare` intrinsic) to the corresponding LLVM `llvm.{s,u}cmp.i8.*` intrinsics, added in LLVM 19.
2025-03-05	Change signature of `target_features_cfg`.	Nicholas Nethercote	-3/+6
	Currently it is called twice, once with `allow_unstable` set to true and once with it set to false. This results in some duplicated work. Most notably, for the LLVM backend, `LLVMRustHasFeature` is called twice for every feature, and it's moderately slow. For very short running compilations on platforms with many features (e.g. a `check` build of hello-world on x86) this is a significant fraction of runtime. This commit changes `target_features_cfg` so it is only called once, and it now returns a pair of feature sets. This halves the number of `LLVMRustHasFeature` calls.
2025-03-02	Revert "Auto merge of #135335 - oli-obk:push-zxwssomxxtnq, r=saethlin"	Michael Goulet	-1/+0
	This reverts commit a7a6c64a657f68113301c2ffe0745b49a16442d1, reversing changes made to ebbe63891f1fae21734cb97f2f863b08b1d44bf8.
2025-03-01	Auto merge of #133250 - DianQK:embed-bitcode-pgo, r=nikic	bors	-1/+1
	The embedded bitcode should always be prepared for LTO/ThinLTO Fixes #115344. Fixes #117220. There are currently two methods for generating bitcode that used for LTO. One method involves using `-C linker-plugin-lto` to emit object files as bitcode, which is the typical setting used by cargo. The other method is through `-C embed-bitcode=yes`. When using with `-C embed-bitcode=yes -C lto=no`, we run a complete non-LTO LLVM pipeline to obtain bitcode, then the bitcode is used for LTO. We run the Call Graph Profile Pass twice on the same module. This PR is doing something similar to LLVM's `buildFatLTODefaultPipeline`, obtaining the bitcode for embedding after running `buildThinLTOPreLinkDefaultPipeline`. r? nikic
2025-02-24	Remove an unused lifetime param	Oli Scherer	-2/+2

2025-02-24	Generalize BaseTypeCodegenMethods	Oli Scherer	-6/+3

2025-02-24	Remove an unnecessary lifetime	Oli Scherer	-3/+3

2025-02-24	ssa/mono: deduplicate `type_has_metadata`	David Wood	-14/+1
	The implementation of the `type_has_metadata` function is duplicated in `rustc_codegen_ssa` and `rustc_monomorphize`, so move this to `rustc_middle`.
2025-02-24	Auto merge of #137271 - nikic:gep-nuw-2, r=scottmcm	bors	-0/+8
	Emit getelementptr inbounds nuw for pointer::add() Lower pointer::add (via intrinsic::offset with unsigned offset) to getelementptr inbounds nuw on LLVM versions that support it. This lets LLVM make use of the pre-condition that the offset addition does not wrap in an unsigned sense. Together with inbounds, this also implies that the offset is non-negative. Fixes https://github.com/rust-lang/rust/issues/137217.
2025-02-23	Save pre-link bitcode to `ModuleCodegen`	DianQK	-1/+1

2025-02-19	Rework `OperandRef::extract_field` to stop calling `to_immediate_scalar` on ↵	Scott McMurray	-8/+1
	things which are already immediates That means it stops trying to truncate things that are already `i1`s.
2025-02-19	Emit `trunc nuw` for unchecked shifts and `to_immediate_scalar`	Scott McMurray	-0/+11
	- For shifts this shrinks the IR by no longer needing an `assume` while still providing the UB information - Having this on the `i8`→`i1` truncations will hopefully help with some places that have to load `i8`s or pass those in LLVM structs without range information
2025-02-19	Emit getelementptr inbounds nuw for pointer::add()	Nikita Popov	-0/+8

2025-02-18	Auto merge of #133852 - x17jiri:cold_path, r=saethlin	bors	-0/+14
	improve cold_path() #120370 added a new instrinsic `cold_path()` and used it to fix `likely` and `unlikely` However, in order to limit scope, the information about cold code paths is only used in 2-target switch instructions. This is sufficient for `likely` and `unlikely`, but limits usefulness of `cold_path` for idiomatic rust. For example, code like this: ``` if let Some(x) = y { ... } ``` may generate 3-target switch: ``` switch y.discriminator: 0 => true branch 1 = > false branch _ => unreachable ``` and therefore marking a branch as cold will have no effect. This PR improves `cold_path()` to work with arbitrary switch instructions. Note that for 2-target switches, we can use `llvm.expect`, but for multiple targets we need to manually emit branch weights. I checked Clang and it also emits weights in this situation. The Clang's weight calculation is more complex that this PR, which I believe is mainly because `switch` in `C/C++` can have multiple cases going to the same target.
2025-02-17	improve cold_path()	Jiri Bobek	-0/+14

2025-02-14	Auto merge of #136575 - scottmcm:nsuw-math, r=nikic	bors	-6/+29
	Set both `nuw` and `nsw` in slice size calculation There's an old note in the code to do this, and now that [LLVM-C has an API for it](https://github.com/llvm/llvm-project/blob/f0b8ff12519270adcfef93410abff76ab073476a/llvm/include/llvm-c/Core.h#L4403-L4408), we might as well. And it's been there since what looks like LLVM 17 https://github.com/llvm/llvm-project/commit/de9b6aa341d8951625d62ae3dac8670ebb3eb006 so doesn't even need to be conditional. (There's other places, like `RawVecInner` or `Layout`, that might want to do things like this too, but I'll leave those for a future PR.)
2025-02-13	Set both `nuw` and `nsw` in slice size calculation	Scott McMurray	-6/+29
	There's an old note in the code to do this, and now that LLVM-C has an API for it, we might as well.
2025-02-12	`transmute` should also assume non-null pointers	Scott McMurray	-0/+13
	Previously it only did integer-ABI things, but this way it does data pointers too. That gives more information in general to the backend, and allows slightly simplifying one of the helpers in slice iterators.
2025-02-07	compiler: remove reexports from rustc_target::callconv	Jubilee Young	-2/+2

2025-02-07	compiler: remove rustc_target::abi entirely	Jubilee Young	-1/+1

2025-01-31	Override `disjoint_or` in the LLVM backend	Scott McMurray	-0/+5

2025-01-30	Use ExistentialTraitRef throughout codegen	Michael Goulet	-4/+4

2025-01-24	Rollup merge of #135581 - EnzymeAD:refactor-codgencx, r=oli-obk	Matthias Krüger	-2/+0
	Separate Builder methods from tcx As part of the autodiff upstreaming we noticed, that it would be nice to have various builder methods available without the TypeContext, which prevents the normal CodegenCx to be passed around between threads. We introduce a SimpleCx which just owns the llvm module and llvm context, to encapsulate them. The previous CodegenCx now implements deref and forwards access to the llvm module or context to it's SimpleCx sub-struct. This gives us a bit more flexibility, because now we can pass (or construct) the SimpleCx in locations where we don't have enough information to construct a CodegenCx, or are not able to pass it around due to the tcx lifetimes (and it not implementing send/sync). This also introduces an SBuilder, similar to the SimpleCx. The SBuilder uses a SimpleCx, whereas the existing Builder uses the larger CodegenCx. I will push updates to make implementations generic (where possible) to be implemented once and work for either of the two. I'll also clean up the leftover code. `call` is a bit tricky, because it requires a tcx, I probably need to duplicate it after all. Tracking: - https://github.com/rust-lang/rust/issues/124509
2025-01-24	Make CodegenCx and Builder generic	Manuel Drehwald	-2/+0
	Co-authored-by: Oli Scherer <github35764891676564198441@oli-obk.de>
2025-01-22	Auto merge of #135674 - scottmcm:assume-better, r=estebank	bors	-0/+21
	Update our range `assume`s to the format that LLVM prefers I found out in https://github.com/llvm/llvm-project/issues/123278#issuecomment-2597440158 that the way I started emitting the `assume`s in #109993 was suboptimal, and as seen in that LLVM issue the way we're doing it -- with two `assume`s sometimes -- can at times lead to CVP/SCCP not realize what's happening because one of them turns into a `ne` instead of conveying a range. So this updates how it's emitted from ``` assume( x >= LOW ); assume( x <= HIGH ); ``` or ``` // (for ranges that wrap the range) assume( (x <= LOW) \| (x >= HIGH) ); ``` to ``` assume( (x - LOW) <= (HIGH - LOW) ); ``` so that we don't need multiple `icmp`s nor multiple `assume`s for a single value, and both wrappping and non-wrapping ranges emit the same shape. (And we don't bother emitting the subtraction if `LOW` is zero, since that's trivial for us to check too.)
2025-01-21	Treat undef bytes as equal to any other byte	Oli Scherer	-0/+1

2025-01-17	Update our range `assume`s to the format that LLVM prefers	Scott McMurray	-0/+21

2025-01-01	upstream rustc_codegen_llvm changes for enzyme/autodiff	Manuel Drehwald	-0/+9

2024-12-25	rename typed_swap → typed_swap_nonoverlapping	Ralf Jung	-1/+1

2024-12-18	Re-export more `rustc_span::symbol` things from `rustc_span`.	Nicholas Nethercote	-1/+1
	`rustc_span::symbol` defines some things that are re-exported from `rustc_span`, such as `Symbol` and `sym`. But it doesn't re-export some closely related things such as `Ident` and `kw`. So you can do `use rustc_span::{Symbol, sym}` but you have to do `use rustc_span::symbol::{Ident, kw}`, which is inconsistent for no good reason. This commit re-exports `Ident`, `kw`, and `MacroRulesNormalizedIdent`, and changes many `rustc_span::symbol::` qualifiers in `compiler/` to `rustc_span::`. This is a 200+ net line of code reduction, mostly because many files with two `use rustc_span` items can be reduced to one.
2024-12-13	Auto merge of #133099 - RalfJung:forbidden-hardfloat-features, r=workingjubilee	bors	-1/+3
	forbid toggling x87 and fpregs on hard-float targets Part of https://github.com/rust-lang/rust/issues/116344, follow-up to https://github.com/rust-lang/rust/pull/129884: The `x87` target feature on x86 and the `fpregs` target feature on ARM must not be disabled on a hardfloat target, as that would change the float ABI. However, enabling `fpregs` on ARM is [explicitly requested](https://github.com/rust-lang/rust/issues/130988) as it seems to be useful. Therefore, we need to refine the distinction of "forbidden" target features and "allowed" target features: all (un)stable target features can determine on a per-target basis whether they should be allowed to be toggled or not. `fpregs` then checks whether the current target has the `soft-float` feature, and if yes, `fpregs` is permitted -- otherwise, it is not. (Same for `x87` on x86). Also fixes https://github.com/rust-lang/rust/issues/132351. Since `fpregs` and `x87` can be enabled on some builds and disabled on others, it would make sense that one can query it via `cfg`. Therefore, I made them behave in `cfg` like any other unstable target feature. The first commit prepares the infrastructure, but does not change behavior. The second commit then wires up `fpregs` and `x87` with that new infrastructure. r? `@workingjubilee`
2024-12-11	Auto merge of #128004 - folkertdev:naked-fn-asm, r=Amanieu	bors	-0/+7
	codegen `#[naked]` functions using global asm tracking issue: https://github.com/rust-lang/rust/issues/90957 Fixes #124375 This implements the approach suggested in the tracking issue: use the existing global assembly infrastructure to emit the body of `#[naked]` functions. The main advantage is that we now have full control over what gets generated, and are no longer dependent on LLVM not sneakily messing with our output (inlining, adding extra instructions, etc). I discussed this approach with `@Amanieu` and while I think the general direction is correct, there is probably a bunch of stuff that needs to change or move around here. I'll leave some inline comments on things that I'm not sure about. Combined with https://github.com/rust-lang/rust/pull/127853, if both accepted, I think that resolves all steps from the tracking issue. r? `@Amanieu`
2024-12-11	generalize 'forbidden feature' concept so that even (un)stable feature can ↵	Ralf Jung	-1/+3
	be invalid to toggle Also rename some things for extra clarity
2024-12-10	codegen `#[naked]` functions using `global_asm!`	Folkert	-0/+7

2024-12-06	Remove all threading through of ErrorGuaranteed from the driver	bjorn3	-8/+2
	It was inconsistently done (sometimes even within a single function) and most of the rest of the compiler uses fatal errors instead, which need to be caught using catch_with_exit_code anyway. Using fatal errors instead of ErrorGuaranteed everywhere in the driver simplifies things a bit.
2024-11-27	use intra-doc links for CodegenBackend::link	Monadic Cat	-1/+1