rust - https://github.com/rust-lang/rust

Age	Commit message (Collapse)	Author	Lines
2025-06-25	added PrintTAFn flag for autodiff	Karan Janthe	-1/+5
	Signed-off-by: Karan Janthe <karanjanthe@gmail.com>
2025-05-28	Mark all optimize methods and the codegen method as safe	bjorn3	-3/+3
	There is no safety contract and I don't think any of them can actually cause UB in more ways than passing malicious source code to rustc can. While LtoModuleCodegen::optimize says that the returned ModuleCodegen points into the LTO module, the LTO module has already been dropped by the time this function returns, so if the returned ModuleCodegen indeed points into the LTO module, we would have seen crashes on every LTO compilation, which we don't. As such the comment is outdated.
2025-05-11	Add a safe wrapper for `LLVMAppendModuleInlineAsm`	Zalathar	-2/+2
	This patch also changes the Rust-side declaration to take `const c_uchar` instead of `const c_char`, to avoid the need for `AsCCharPtr`.
2025-05-04	Initial support for dynamically linked crates	Bryanskiy	-1/+2

2025-04-28	remove noinline attribute and add alwaysinline after AD pass	bit-aloo	-1/+27

2025-04-24	Rollup merge of #139700 - EnzymeAD:autodiff-flags, r=oli-obk	Matthias Krüger	-19/+37
	Autodiff flags Interestingly, it seems that some other projects have conflicts with exactly the same LLVM optimization passes as autodiff. At least `LLVMRustOptimize` has exactly the flags that we need to disable problematic opt passes. This PR enables us to compile code where users differentiate two identical functions in the same module. This has been especially common in test cases, but it's not impossible to encounter in the wild. It also enables two new flags for testing/debugging. I consider writing an MCP to upgrade PrintPasses to be a standalone -Z flag, since it is not the same as `-Z print-llvm-passes`, which IMHO gives less useful output. A discussion can be found here: [#t-compiler/llvm > Print llvm passes. @ 💬](https://rust-lang.zulipchat.com/#narrow/channel/187780-t-compiler.2Fllvm/topic/Print.20llvm.20passes.2E/near/511533038) Finally, it improves `PrintModBefore` and `PrintModAfter`. They used to work reliable, but now we just schedule enzyme as part of an existing ModulePassManager (MPM). Since Enzyme is last in the MPM scheduling, PrintModBefore became very inaccurate. It used to print the input module, which we gave to the Enzyme and was great to create llvm-ir reproducer. However, lately the MPM would run the whole `default<O3>` pipeline, which heavily modifies the llvm module, before we pass it to Enzyme. That made it impossible to use the flag to create llvm-ir reproducers for Enzyme bugs. We now schedule a PrintModule pass just before Enzyme, solving this problem. Based on the PrintPass output, it also _seems_ like changing `registerEnzymeAndPassPipeline(PB, true);` to `registerEnzymeAndPassPipeline(PB, false);` has no effect. In theory, the bool should tell Enzyme to schedule some helpful passes in the PassBuilder. However, since it doesn't do anything and I'm not 100% sure anymore on whether we really need it, I'll just disable it for now and postpone investigations. r? ``@oli-obk`` closes #139471 Tracking: - https://github.com/rust-lang/rust/issues/124509
2025-04-12	update documentation	Manuel Drehwald	-0/+5

2025-04-12	fix "could not find source function" error by preventing function merging ↵	Manuel Drehwald	-1/+4
	before AD
2025-04-12	fix LooseTypes flag and PrintMod behaviour, add debug helper	Manuel Drehwald	-18/+28

2025-04-07	Prepend temp files with a string per invocation of rustc	Michael Goulet	-12/+45

2025-04-07	Simplify temp path creation a bit	Michael Goulet	-16/+11

2025-04-06	Remove LLVM 18 inline ASM span fallback	beetrees	-5/+2

2025-04-05	Rollup merge of #137880 - EnzymeAD:autodiff-batching, r=oli-obk	Stuart Cook	-2/+10
	Autodiff batching Enzyme supports batching, which is especially known from the ML side when training neural networks. There we would normally have a training loop, where in each iteration we would pass in some data (e.g. an image), and a target vector. Based on how close we are with our prediction we compute our loss, and then use backpropagation to compute the gradients and update our weights. That's quite inefficient, so what you normally do is passing in a batch of 8/16/.. images and targets, and compute the gradients for those all at once, allowing better optimizations. Enzyme supports batching in two ways, the first one (which I implemented here) just accepts a Batch size, and then each Dual/Duplicated argument has not one, but N shadow arguments. So instead of ```rs for i in 0..100 { df(x[i], y[i], 1234); } ``` You can now do ```rs for i in 0..100.step_by(4) { df(x[i+0],x[i+1],x[i+2],x[i+3], y[i+0], y[i+1], y[i+2], y[i+3], 1234); } ``` which will give the same results, but allows better compiler optimizations. See the testcase for details. There is a second variant, where we can mark certain arguments and instead of having to pass in N shadow arguments, Enzyme assumes that the argument is N times longer. I.e. instead of accepting 4 slices with 12 floats each, we would accept one slice with 48 floats. I'll implement this over the next days. I will also add more tests for both modes. For any one preferring some more interactive explanation, here's a video of Tim's llvm dev talk, where he presents his work. https://www.youtube.com/watch?v=edvaLAL5RqU I'll also add some other docs to the dev guide and user docs in another PR. r? ghost Tracking: - https://github.com/rust-lang/rust/issues/124509 - https://github.com/rust-lang/rust/issues/135283
2025-04-04	add new flag to print the module post-AD, before opts	Manuel Drehwald	-2/+10

2025-04-04	Rollup merge of #138949 - madsmtm:rename-to-darwin, r=WaffleLapkin	Matthias Krüger	-3/+3
	Rename `is_like_osx` to `is_like_darwin` Replace `is_like_osx` with `is_like_darwin`, which more closely describes reality (OS X is the pre-2016 name for macOS, and is by now quite outdated; Darwin is the overall name for the OS underlying Apple's macOS, iOS, etc.). ``@rustbot`` label O-apple r? compiler
2025-03-25	Rename `is_like_osx` to `is_like_darwin`	Mads Marquart	-3/+3

2025-03-25	Reduce visibility of most items in `rustc_codegen_llvm`	Daniel Paoliello	-1/+1

2025-03-01	Auto merge of #133250 - DianQK:embed-bitcode-pgo, r=nikic	bors	-55/+98
	The embedded bitcode should always be prepared for LTO/ThinLTO Fixes #115344. Fixes #117220. There are currently two methods for generating bitcode that used for LTO. One method involves using `-C linker-plugin-lto` to emit object files as bitcode, which is the typical setting used by cargo. The other method is through `-C embed-bitcode=yes`. When using with `-C embed-bitcode=yes -C lto=no`, we run a complete non-LTO LLVM pipeline to obtain bitcode, then the bitcode is used for LTO. We run the Call Graph Profile Pass twice on the same module. This PR is doing something similar to LLVM's `buildFatLTODefaultPipeline`, obtaining the bitcode for embedding after running `buildThinLTOPreLinkDefaultPipeline`. r? nikic
2025-02-28	Rollup merge of #137017 - bjorn3:ignore_invalid_bitcode, r=oli-obk	许杰友 Jieyou Xu (Joe)	-3/+22
	Don't error when adding a staticlib with bitcode files compiled by newer LLVM cc https://github.com/rust-lang/rust/issues/128955#issuecomment-2657811196
2025-02-26	Restore usage of io::Error	bjorn3	-4/+7

2025-02-24	codegen_llvm: avoid `Deref` impls w/ extern type	David Wood	-11/+12
	`rustc_codegen_llvm` relied on `Deref` impls where `Deref::Target` was or contained an extern type - in my experimental implementation of rust-lang/rfcs#3729, this isn't possible as the `Target` associated type's `?Sized` bound cannot be relaxed backwards compatibly (unless we come up with some way of doing this). In later pull requests with the rust-lang/rfcs#3729 implementation, breakage like this could only occur for nightly users relying on the `extern_types` feature. Upstreaming this to avoid needing to keep carrying this patch locally, and I think it'll necessarily need to change eventually.
2025-02-23	Save pre-link bitcode to `ModuleCodegen`	DianQK	-15/+10

2025-02-23	Add `new_regular` and `new_allocator` to `ModuleCodegen`	DianQK	-10/+3

2025-02-23	The embedded bitcode should always be prepared for LTO/ThinLTO	DianQK	-44/+99

2025-02-21	update autodiff flags	Manuel Drehwald	-23/+62

2025-02-21	clean up autodiff code/comments	Manuel Drehwald	-10/+5

2025-02-20	Merge two operations that were always performed together	Oli Scherer	-1/+1

2025-02-14	Don't error when adding a staticlib with bitcode files compiled by newer LLVM	bjorn3	-6/+22

2025-02-13	Make `-O` mean `-C opt-level=3`	clubby789	-2/+2

2025-02-12	Rollup merge of #135549 - oli-obk:push-tmxtpnrloyqu, r=compiler-errors	Matthias Krüger	-131/+92
	Document some safety constraints and use more safe wrappers Lots of unsafe codegen_llvm code has safe wrappers already, so I used some of them and added some where applicable. I stopped here because this diff is large enough and should probably be reviewed independently of other changes.
2025-02-11	Document some safety constraints and use more safe wrappers	Oli Scherer	-43/+32

2025-02-11	Add a safe wrapper for `WriteBitcodeToFile`	Oli Scherer	-8/+10

2025-02-11	Remove an unsafe closure invariant by inlining the closure wrapper into the ↵	Oli Scherer	-80/+50
	called function
2025-02-11	Rollup merge of #136721 - dpaoliello:cleanllvm2, r=Zalathar	Jacob Pratt	-3/+3
	cg_llvm: Reduce visibility of some items outside the `llvm` module Next piece of #135502 This reduces the visibility of items (other than those in the `llvm` module) so that dead code analysis will correctly identify unused items.
2025-02-10	rustc_codegen_llvm: Mark items as pub(crate) outside of the llvm module	Daniel Paoliello	-3/+3

2025-02-10	move second opt run to lto phase and cleanup code	Manuel Drehwald	-18/+48

2025-02-07	fix non-enzyme builds	Manuel Drehwald	-1/+3

2025-01-22	rustc_codegen_llvm: remove outdated asm-to-obj codegen note	Ken Matsui	-6/+3
	Remove comment about missing integrated assembler handling, which was removed in commit 02840ca.
2025-01-13	Rollup merge of #133752 - klensy:cp, r=davidtwco	Matthias Krüger	-7/+3
	replace copypasted ModuleLlvm::parse replaced code same as in https://github.com/rust-lang/rust/blob/bd36e69d2533ee750e2d805915b8ca88d2825e0f/compiler/rustc_codegen_llvm/src/lib.rs#L426-L445 except before error message was emitted via `write::llvm_err`, which returned other error kind, but it still ok?
2025-01-07	llvm: Ignore error value that is always false	Matthew Maurer	-5/+1
	See llvm/llvm-project#121851 For LLVM 20+, this function (`renameModuleForThinLTO`) has no return value. For prior versions of LLVM, this never failed, but had a signature which allowed an error value people were handling.
2025-01-01	upstream rustc_codegen_llvm changes for enzyme/autodiff	Manuel Drehwald	-7/+55

2024-12-30	add llvm_floatabi field to target spec that controls FloatABIType	Ralf Jung	-4/+12

2024-12-30	rustc_llvm: expose FloatABIType target machine parameter	Ralf Jung	-7/+7

2024-12-29	make -Csoft-float have an effect on all ARM targets	Ralf Jung	-1/+1

2024-12-18	Re-export more `rustc_span::symbol` things from `rustc_span`.	Nicholas Nethercote	-2/+1
	`rustc_span::symbol` defines some things that are re-exported from `rustc_span`, such as `Symbol` and `sym`. But it doesn't re-export some closely related things such as `Ident` and `kw`. So you can do `use rustc_span::{Symbol, sym}` but you have to do `use rustc_span::symbol::{Ident, kw}`, which is inconsistent for no good reason. This commit re-exports `Ident`, `kw`, and `MacroRulesNormalizedIdent`, and changes many `rustc_span::symbol::` qualifiers in `compiler/` to `rustc_span::`. This is a 200+ net line of code reduction, mostly because many files with two `use rustc_span` items can be reduced to one.
2024-12-12	Auto merge of #129181 - beetrees:asm-spans, r=pnkfelix,compiler-errors	bors	-6/+17
	Pass end position of span through inline ASM cookie Before this PR, only the start position of the span was passed though the inline ASM cookie to diagnostics. LLVM 19 has full support for 64-bit inline ASM cookies; this PR uses that to pass the end position of the span in the upper 32 bits, meaning inline ASM diagnostics now point at the entire line the error occurred on, not just the first character of it.
2024-12-02	Use c"lit" for CStrings without unwrap	Kornel	-2/+2

2024-12-02	replace copypasted ModuleLlvm::parse	klensy	-7/+3

2024-11-26	Respect verify-llvm-ir option in the backend	Nikita Popov	-0/+4
	We are currently unconditionally verifying the LLVM IR in the backend (twice), ignoring the value of the verify-llvm-ir option.
2024-11-26	Pass end position of span through inline ASM cookie	beetrees	-6/+17