rust - https://github.com/rust-lang/rust

Age	Commit message (Collapse)	Author	Lines
2025-09-29	Rollup merge of #147127 - antoyo:fix/gcc-linker-plugin, r=bjorn3	Stuart Cook	-1/+12
	Add a leading dash to linker plugin arguments in the gcc codegen Fix rust-lang/rust#130583 r? ``@bjorn3``
2025-09-28	Add a leading dash to linker plugin arguments in the gcc codegen	Antoni Boucher	-1/+12

2025-09-28	Rollup merge of #144197 - KMJ-007:type-tree, r=ZuseZ4	Matthias Krüger	-0/+1
	TypeTree support in autodiff # TypeTrees for Autodiff ## What are TypeTrees? Memory layout descriptors for Enzyme. Tell Enzyme exactly how types are structured in memory so it can compute derivatives efficiently. ## Structure ```rust TypeTree(Vec<Type>) Type { offset: isize, // byte offset (-1 = everywhere) size: usize, // size in bytes kind: Kind, // Float, Integer, Pointer, etc. child: TypeTree // nested structure } ``` ## Example: `fn compute(x: &f32, data: &[f32]) -> f32` Input 0: `x: &f32` ```rust TypeTree(vec![Type { offset: -1, size: 8, kind: Pointer, child: TypeTree(vec![Type { offset: -1, size: 4, kind: Float, child: TypeTree::new() }]) }]) ``` Input 1: `data: &[f32]` ```rust TypeTree(vec![Type { offset: -1, size: 8, kind: Pointer, child: TypeTree(vec![Type { offset: -1, size: 4, kind: Float, // -1 = all elements child: TypeTree::new() }]) }]) ``` Output: `f32` ```rust TypeTree(vec![Type { offset: -1, size: 4, kind: Float, child: TypeTree::new() }]) ``` ## Why Needed? - Enzyme can't deduce complex type layouts from LLVM IR - Prevents slow memory pattern analysis - Enables correct derivative computation for nested structures - Tells Enzyme which bytes are differentiable vs metadata ## What Enzyme Does With This Information: Without TypeTrees (current state): ```llvm ; Enzyme sees generic LLVM IR: define float ``@distance(ptr`` %p1, ptr %p2) { ; Has to guess what these pointers point to ; Slow analysis of all memory operations ; May miss optimization opportunities } ``` With TypeTrees (our implementation): ```llvm define "enzyme_type"="{[]:Float@float}" float ``@distance(`` ptr "enzyme_type"="{[]:Pointer}" %p1, ptr "enzyme_type"="{[]:Pointer}" %p2 ) { ; Enzyme knows exact type layout ; Can generate efficient derivative code directly } ``` # TypeTrees - Offset and -1 Explained ## Type Structure ```rust Type { offset: isize, // WHERE this type starts size: usize, // HOW BIG this type is kind: Kind, // WHAT KIND of data (Float, Int, Pointer) child: TypeTree // WHAT'S INSIDE (for pointers/containers) } ``` ## Offset Values ### Regular Offset (0, 4, 8, etc.) Specific byte position within a structure ```rust struct Point { x: f32, // offset 0, size 4 y: f32, // offset 4, size 4 id: i32, // offset 8, size 4 } ``` TypeTree for `&Point` (internal representation): ```rust TypeTree(vec![ Type { offset: 0, size: 4, kind: Float }, // x at byte 0 Type { offset: 4, size: 4, kind: Float }, // y at byte 4 Type { offset: 8, size: 4, kind: Integer } // id at byte 8 ]) ``` Generates LLVM: ```llvm "enzyme_type"="{[]:Float@float}" ``` ### Offset -1 (Special: "Everywhere") Means "this pattern repeats for ALL elements" #### Example 1: Array `[f32; 100]` ```rust TypeTree(vec![Type { offset: -1, // ALL positions size: 4, // each f32 is 4 bytes kind: Float, // every element is float }]) ``` Instead of listing 100 separate Types with offsets `0,4,8,12...396` #### Example 2: Slice `&[i32]` ```rust // Pointer to slice data TypeTree(vec![Type { offset: -1, size: 8, kind: Pointer, child: TypeTree(vec![Type { offset: -1, // ALL slice elements size: 4, // each i32 is 4 bytes kind: Integer }]) }]) ``` #### Example 3: Mixed Structure ```rust struct Container { header: i64, // offset 0 data: [f32; 1000], // offset 8, but elements use -1 } ``` ```rust TypeTree(vec![ Type { offset: 0, size: 8, kind: Integer }, // header Type { offset: 8, size: 4000, kind: Pointer, child: TypeTree(vec![Type { offset: -1, size: 4, kind: Float // ALL array elements }]) } ]) ```
2025-09-19	generate list of all variants with `target_spec_enum`	Deadbeef	-13/+3
	This helps us avoid the hardcoded lists elsewhere.
2025-09-19	Add TypeTree metadata attachment for autodiff	Karan Janthe	-0/+1
	- Add F128 support to TypeTree Kind enum - Implement TypeTree FFI bindings and conversion functions - Add typetree.rs module for metadata attachment to LLVM functions - Integrate TypeTree generation with autodiff intrinsic pipeline - Support scalar types: f32, f64, integers, f16, f128 - Attach enzyme_type attributes as LLVM string metadata for Enzyme Signed-off-by: Karan Janthe <karanjanthe@gmail.com>
2025-09-06	Remove want_summary argument from prepare_thin	bjorn3	-5/+2
	It is always false nowadays. ThinLTO summary writing is instead done by llvm_optimize.
2025-08-24	Directly raise fatal errors inside the codegen backends	bjorn3	-16/+16
	As opposed to passing it around through Result.
2025-08-19	Rollup merge of #145432 - Zalathar:target-machine, r=wesleywiser	Stuart Cook	-8/+1
	cg_llvm: Small cleanups to `owned_target_machine` This PR contains a few tiny cleanups to the `owned_target_machine` code. Each individual commit should be fairly straightforward.
2025-08-15	Declare module `rustc_codegen_llvm::back` in the normal way	Zalathar	-8/+1
	Declaring these submodules directly in `lib.rs` was needlessly confusing.
2025-08-14	Complete functionality and general cleanup	Marcelo Domínguez	-6/+0

2025-07-26	Remove support for -Zcombine-cgu	bjorn3	-7/+0
	Nobody seems to actually use this, while still adding some extra complexity to the already rather complex codegen coordinator code. It is also not supported by any backend other than the LLVM backend.
2025-07-24	Auto merge of #144062 - bjorn3:lto_refactors2, r=davidtwco	bors	-3/+14
	Various refactors to the LTO handling code (part 2) Continuing from https://github.com/rust-lang/rust/pull/143388 this removes a bit of dead code and moves the LTO symbol export calculation from individual backends to cg_ssa.
2025-07-21	Remove each_linked_rlib_for_lto from CodegenContext	bjorn3	-2/+12

2025-07-21	Move exported_symbols_for_lto out of CodegenContext	bjorn3	-2/+4

2025-07-21	Merge modules and cached_modules for fat LTO	bjorn3	-2/+1
	The modules vec can already contain serialized modules and there is no need to distinguish between cached and non-cached cgus at LTO time.
2025-07-18	gpu host code generation	Manuel Drehwald	-7/+15

2025-07-17	Rollup merge of #143388 - bjorn3:lto_refactors, r=compiler-errors	León Orell Valerian Liehr	-30/+19
	Various refactors to the LTO handling code In particular reducing the sharing of code paths between fat and thin-LTO and making the fat LTO implementation more self-contained. This also moves some autodiff handling out of cg_ssa into cg_llvm given that Enzyme only works with LLVM anyway and an implementation for another backend may do things entirely differently. This will also make it a bit easier to split LTO handling out of the coordinator thread main loop into a separate loop, which should reduce the complexity of the coordinator thread.
2025-07-07	compiler: Parse `p-` specs in datalayout string, allow definition of custom ↵	Edoardo Marangoni	-1/+1
	default data address space
2025-07-03	Merge run_fat_lto, optimize_fat and autodiff into run_and_optimize_fat_lto	bjorn3	-24/+15

2025-07-03	Remove unused config param from WriteBackendMethods::autodiff	bjorn3	-2/+1

2025-07-03	Move dcx creation into WriteBackendMethods::codegen	bjorn3	-2/+1

2025-07-03	Remove LtoModuleCodegen	bjorn3	-3/+3
	Most uses of it either contain a fat or thin lto module. Only WorkItem::LTO could contain both, but splitting that enum variant doesn't complicate things much.
2025-06-15	Rollup merge of #141769 - bjorn3:codegen_metadata_module_rework, ↵	León Orell Valerian Liehr	-10/+9
	r=workingjubilee,saethlin Move metadata object generation for dylibs to the linker code This deduplicates some code between codegen backends and may in the future allow adding extra metadata that is only known at link time. Prerequisite of https://github.com/rust-lang/rust/issues/96708.
2025-06-08	Remove all unused feature gates from the compiler	bjorn3	-1/+0

2025-06-03	Move metadata object generation for dylibs to the linker code	bjorn3	-6/+1
	This deduplicates some code between codegen backends and may in the future allow adding extra metadata that is only known at link time.
2025-06-03	Only borrow EncodedMetadata in codegen_crate	bjorn3	-5/+9
	And move passing it to the linker to the driver code.
2025-05-28	Mark all optimize methods and the codegen method as safe	bjorn3	-6/+6
	There is no safety contract and I don't think any of them can actually cause UB in more ways than passing malicious source code to rustc can. While LtoModuleCodegen::optimize says that the returned ModuleCodegen points into the LTO module, the LTO module has already been dropped by the time this function returns, so if the returned ModuleCodegen indeed points into the LTO module, we would have seen crashes on every LTO compilation, which we don't. As such the comment is outdated.
2025-05-12	update cfg(bootstrap)	Pietro Albini	-1/+0

2025-04-27	Implement the internal feature `cfg_target_has_reliable_f16_f128`	Trevor Gross	-4/+4
	Support for `f16` and `f128` is varied across targets, backends, and backend versions. Eventually we would like to reach a point where all backends support these approximately equally, but until then we have to work around some of these nuances of support being observable. Introduce the `cfg_target_has_reliable_f16_f128` internal feature, which provides the following new configuration gates: * `cfg(target_has_reliable_f16)` * `cfg(target_has_reliable_f16_math)` * `cfg(target_has_reliable_f128)` * `cfg(target_has_reliable_f128_math)` `reliable_f16` and `reliable_f128` indicate that basic arithmetic for the type works correctly. The `_math` versions indicate that anything relying on `libm` works correctly, since sometimes this hits a separate class of codegen bugs. These options match configuration set by the build script at [1]. The logic for LLVM support is duplicated as-is from the same script. There are a few possible updates that will come as a follow up. The config introduced here is not planned to ever become stable, it is only intended to replace the build scripts for `std` tests and `compiler-builtins` that don't have any way to configure based on the codegen backend. MCP: https://github.com/rust-lang/compiler-team/issues/866 Closes: https://github.com/rust-lang/compiler-team/issues/866 [1]: https://github.com/rust-lang/rust/blob/555e1d0386f024a8359645c3217f4b3eae9be042/library/std/build.rs#L84-L186
2025-04-23	Make #![feature(let_chains)] bootstrap conditional in compiler/	est31	-1/+1

2025-03-25	Reduce visibility of most items in `rustc_codegen_llvm`	Daniel Paoliello	-4/+2

2025-03-12	Rollup merge of #138331 - nnethercote:use-RUSTC_LINT_FLAGS-more, ↵	Matthias Krüger	-1/+0
	r=onur-ozkan,jieyouxu Use `RUSTC_LINT_FLAGS` more An alternative to the failed #138084. Fixes #138106. r? ````@jieyouxu````
2025-03-11	Auto merge of #137586 - nnethercote:SetImpliedBits, r=bjorn3	bors	-2/+2
	Speed up target feature computation The LLVM backend calls `LLVMRustHasFeature` twice for every feature. In short-running rustc invocations, this accounts for a surprising amount of work. r? `@bjorn3`
2025-03-11	Remove `#![warn(unreachable_pub)]` from all `compiler/` crates.	Nicholas Nethercote	-1/+0
	It's no longer necessary now that `-Wunreachable_pub` is being passed.
2025-03-10	Revert "Use workspace lints for crates in `compiler/` #138084"	许杰友 Jieyou Xu (Joe)	-0/+1
	Revert <https://github.com/rust-lang/rust/pull/138084> to buy time to consider options that avoids breaking downstream usages of cargo on distributed `rustc-src` artifacts, where such cargo invocations fail due to inability to inherit `lints` from workspace root manifest's `workspace.lints` (this is only valid for the source rust-lang/rust workspace, but not really the distributed `rustc-src` artifacts). This breakage was reported in <https://github.com/rust-lang/rust/issues/138304>. This reverts commit 48caf81484b50dca5a5cebb614899a3df81ca898, reversing changes made to c6662879b27f5161e95f39395e3c9513a7b97028.
2025-03-09	Rollup merge of #138084 - nnethercote:workspace-lints, r=jieyouxu	Matthias Krüger	-1/+0
	Use workspace lints for crates in `compiler/` This is nicer and hopefully less error prone than specifying lints via bootstrap. r? ``@jieyouxu``
2025-03-08	Remove `#![warn(unreachable_pub)]` from all `compiler/` crates.	Nicholas Nethercote	-1/+0
	(Except for `rustc_codegen_cranelift`.) It's no longer necessary now that `unreachable_pub` is in the workspace lints.
2025-03-07	Rollup merge of #137549 - oli-obk:llvm-ffi, r=davidtwco	Matthias Krüger	-2/+5
	Clean up various LLVM FFI things in codegen_llvm cc ```@ZuseZ4``` I touched some autodiff parts The major change of this PR is [bfd88ce](https://github.com/rust-lang/rust/pull/137549/commits/bfd88cead0dd79717f123ad7e9a26ecad88653cb) which makes `CodegenCx` generic just like `GenericBuilder` The other commits mostly took advantage of the new feature of making extern functions safe, but also just used some wrappers that were already there and shrunk unsafe blocks. best reviewed commit-by-commit
2025-03-05	Change signature of `target_features_cfg`.	Nicholas Nethercote	-2/+2
	Currently it is called twice, once with `allow_unstable` set to true and once with it set to false. This results in some duplicated work. Most notably, for the LLVM backend, `LLVMRustHasFeature` is called twice for every feature, and it's moderately slow. For very short running compilations on platforms with many features (e.g. a `check` build of hello-world on x86) this is a significant fraction of runtime. This commit changes `target_features_cfg` so it is only called once, and it now returns a pair of feature sets. This halves the number of `LLVMRustHasFeature` calls.
2025-03-03	Rollup merge of #137741 - cuviper:const_str-raw_entry, r=Mark-Simulacrum	Matthias Krüger	-1/+0
	Stop using `hash_raw_entry` in `CodegenCx::const_str` That unstable feature (#56167) completed fcp-close, so the compiler needs to be migrated away to allow its removal. In this case, `cg_llvm` and `cg_gcc` were using raw entries to optimize their `const_str_cache` lookup and insertion. We can change that to separate `get` and (on miss) `insert` calls, so we still have the fast path avoiding string allocation when the cache hits.
2025-02-27	Stop using `hash_raw_entry` in `CodegenCx::const_str`	Josh Stone	-1/+0
	That unstable feature completed fcp-close, so the compiler needs to be migrated away to allow its removal. In this case, `cg_llvm` and `cg_gcc` were using raw entries to optimize their `const_str_cache` lookup and insertion. We can change that to separate `get` and (on miss) `insert` calls, so we still have the fast path avoiding string allocation when the cache hits.
2025-02-24	Make allocator shim creation mostly use safe code	Oli Scherer	-2/+5

2025-02-23	Save pre-link bitcode to `ModuleCodegen`	DianQK	-1/+1

2025-02-13	Rollup merge of #136858 - safinaskar:parallel-cleanup-2025-02-11-07-54, ↵	Jacob Pratt	-3/+0
	r=SparrowLii Parallel-compiler-related cleanup Parallel-compiler-related cleanup I carefully split changes into commits. Commit messages are self-explanatory. Squashing is not recommended. cc "Parallel Rustc Front-end" https://github.com/rust-lang/rust/issues/113349 r? SparrowLii ``@rustbot`` label: +WG-compiler-parallel
2025-02-11	compiler/rustc_codegen_llvm/src/lib.rs: remove "unsafe impl Send/Sync"	Askar Safin	-3/+0

2025-02-11	Rollup merge of #136721 - dpaoliello:cleanllvm2, r=Zalathar	Jacob Pratt	-1/+1
	cg_llvm: Reduce visibility of some items outside the `llvm` module Next piece of #135502 This reduces the visibility of items (other than those in the `llvm` module) so that dead code analysis will correctly identify unused items.
2025-02-10	rustc_codegen_llvm: Mark items as pub(crate) outside of the llvm module	Daniel Paoliello	-1/+1

2025-02-06	coverage: Defer part of counter-creation until codegen	Zalathar	-0/+1

2025-02-06	Remove the `mod llvm_` hack, which should no longer be necessary	Zalathar	-8/+3

2025-01-24	Make CodegenCx and Builder generic	Manuel Drehwald	-2/+1
	Co-authored-by: Oli Scherer <github35764891676564198441@oli-obk.de>