rust - https://github.com/rust-lang/rust

Age	Commit message (Collapse)	Author	Lines
2025-07-31	Rollup merge of #144712 - nnethercote:dedup-num-types, r=fmease	Jana Dönszelmann	-0/+2
	Deduplicate `IntTy`/`UintTy`/`FloatTy`. There are identical definitions in `rustc_type_ir` and `rustc_ast`. This commit removes them and places a single definition in `rustc_ast_ir`. This requires adding `rust_span` as a dependency of `rustc_ast_ir`, but means a bunch of silly conversion functions can be removed. r? `@fmease`
2025-07-31	Tidy up `Cargo.toml` files.	Nicholas Nethercote	-0/+2
	- Add some missing `tidy-alphabetical-*` markers. - Remove some unnecessary blank lines.
2025-07-31	Rollup merge of #144232 - xacrimon:explicit-tail-call, r=WaffleLapkin	Stuart Cook	-0/+26
	Implement support for `become` and explicit tail call codegen for the LLVM backend This PR implements codegen of explicit tail calls via `become` in `rustc_codegen_ssa` and support within the LLVM backend. Completes a task on (https://github.com/rust-lang/rust/issues/112788). This PR implements all the necessary bits to make explicit tail calls usable, other backends have received stubs for now and will ICE if you use `become` on them. I suspect there is some bikeshedding to be done on how we should go about implementing this for other backends, but it should be relatively straightforward for GCC after this is merged. During development I also put together a POC bytecode VM based on tail call dispatch to test these changes out and analyze the codegen to make sure it generates expected assembly. That is available [here](https://github.com/xacrimon/tcvm).
2025-07-29	cc dependencies: clarify comment	Ralf Jung	-2/+2

2025-07-28	Rollup merge of #144503 - bjorn3:lto_refactors3, r=petrochenkov	Matthias Krüger	-34/+0
	Various refactors to the codegen coordinator code (part 3) Continuing from https://github.com/rust-lang/rust/pull/144062 this removes an option without any known users, uses the object crate in favor of LLVM for getting the LTO bitcode and improves the coordinator channel handling.
2025-07-26	Implement support for explicit tail calls in the MIR block builders and the ↵	Joel Wejdenstål	-0/+26
	LLVM codegen backend.
2025-07-25	Use the object crate rather than LLVM for extracting bitcode sections	bjorn3	-34/+0

2025-07-23	RustWrapper: Suppress getNextNonDebugInfoInstruction	WANG Rui	-1/+1
	Link: https://github.com/llvm/llvm-project/pull/144383
2025-07-22	Rollup merge of #142097 - ZuseZ4:offload-host1, r=oli-obk	许杰友 Jieyou Xu (Joe)	-0/+37
	gpu offload host code generation r? ghost This will generate most of the host side code to use llvm's offload feature. The first PR will only handle automatic mem-transfers to and from the device. So if a user calls a kernel, we will copy inputs back and forth, but we won't do the actual kernel launch. Before merging, we will use LLVM's Info infrastructure to verify that the memcopies match what openmp offloa generates in C++. `LIBOMPTARGET_INFO=-1 ./my_rust_binary` should print that a memcpy to and later from the device is happening. A follow-up PR will generate the actual device-side kernel which will then do computations on the GPU. A third PR will implement manual host2device and device2host functionality, but the goal is to minimize cases where a user has to overwrite our default handling due to performance issues. I'm trying to get a full MVP out first, so this just recognizes GPU functions based on magic names. The final frontend will obviously move this over to use proper macros, like I'm already doing it for the autodiff work. This work will also be compatible with std::autodiff, so one can differentiate GPU kernels. Tracking: - https://github.com/rust-lang/rust/issues/131513
2025-07-20	Rollup merge of #144116 - nikic:llvm-21-fixes, r=dianqk	Matthias Krüger	-1/+4
	Fixes for LLVM 21 This fixes compatibility issues with LLVM 21 without performing the actual upgrade. Split out from https://github.com/rust-lang/rust/pull/143684. This fixes three issues: * Updates the AMDGPU data layout for address space 8. * Makes emit-arity-indicator.rs a no_core test, so it doesn't fail on non-x86 hosts. * Explicitly sets the exception model for wasm, as this is no longer implied by `-wasm-enable-eh`.
2025-07-19	Rollup merge of #142444 - KMJ-007:autodiff-codegen-test, r=ZuseZ4	Matthias Krüger	-0/+13
	adding run-make test to autodiff r? `@ZuseZ4`
2025-07-18	add various wrappers for gpu code generation	Manuel Drehwald	-0/+37

2025-07-18	Pass wasm exception model to TargetOptions	Nikita Popov	-1/+4
	This is no longer implied by -wasm-enable-eh.
2025-07-11	Avoid building C++ for rustc_llvm with --compile-time-deps	bjorn3	-0/+8
	This saves about 30s.
2025-07-02	awhile -> a while where appropriate	наб	-1/+1

2025-07-02	fix: Fix TypePrintFn flag passing for autodiff codegen	Karan Janthe	-0/+13
	Signed-off-by: Karan Janthe <karanjanthe@gmail.com>
2025-05-31	rustc_llvm: add Windows system libs only when cross-compiling from Windows	Mateusz Mikuła	-2/+2
	This obviously doesn't work when cross-compiling from Linux. Split out from: https://github.com/rust-lang/rust/pull/140772
2025-05-15	Experimental cygwin support in rustc	王宇逸	-0/+1
	Co-authored-by: Ookiineko <chiisaineko@protonmail.com>
2025-05-11	Use `LLVMGetInlineAsm`	Zalathar	-27/+0
	This LLVM-C binding replaces the existing `LLVMRustInlineAsm` function.
2025-05-01	PassWrapper: adapt for ↵	Erick Tryzelaar	-0/+5
	llvm/llvm-project@f137c3d592e96330e450a8fd63ef7e8877fc1908 In LLVM 21 PR https://github.com/llvm/llvm-project/pull/130940 `TargetRegistry::createTargetMachine` was changed to take a `const Triple&` and has deprecated the old `StringRef` method. @rustbot label llvm-main
2025-04-29	Rollup merge of #140400 - durin42:llvm-21-getguid, r=cuviper	Trevor Gross	-4/+9
	PassWrapper: adapt for llvm/llvm-project@d3d856ad8469 LLVM 21 moves to making it more explicit what this function call is doing, but nothing has changed behaviorally, so for now we just adjust to using the new name of the function. `@rustbot` label llvm-main
2025-04-28	Rollup merge of #139308 - Shourya742:2025-03-29-add-autodiff-inline, r=ZuseZ4	Chris Denton	-0/+21
	add autodiff inline closes: #138920 r? ```@ZuseZ4``` try-job: dist-aarch64-linux
2025-04-28	PassWrapper: adapt for llvm/llvm-project@d3d856ad8469	Augie Fackler	-4/+9
	LLVM 21 moves to making it more explicit what this function call is doing, but nothing has changed behaviorally, so for now we just adjust to using the new name of the function. @rustbot label llvm-main
2025-04-28	remove noinline attribute and add alwaysinline after AD pass	bit-aloo	-4/+6

2025-04-26	Rollup merge of #140253 - SergioGasquez:feat/xtensa-asm-printer, r=cuviper	Matthias Krüger	-0/+1
	Add XtensaAsmPrinter See https://github.com/rust-lang/rust/pull/133601. The PR was closed because it required LLVM 19 in CI added with (https://github.com/rust-lang/rust/commit/12167d7064597993355e41d3a8c20654bccaf0be)
2025-04-25	add llvm wrappers and corresponding methods in attribute	bit-aloo	-0/+19

2025-04-24	feat: Add XtensaAsmPrinter	Sergio Gasquez	-0/+1

2025-04-12	fix LooseTypes flag and PrintMod behaviour, add debug helper	Manuel Drehwald	-2/+28

2025-04-05	Update the minimum external LLVM to 19	Josh Stone	-129/+14

2025-04-05	Rollup merge of #137880 - EnzymeAD:autodiff-batching, r=oli-obk	Stuart Cook	-0/+10
	Autodiff batching Enzyme supports batching, which is especially known from the ML side when training neural networks. There we would normally have a training loop, where in each iteration we would pass in some data (e.g. an image), and a target vector. Based on how close we are with our prediction we compute our loss, and then use backpropagation to compute the gradients and update our weights. That's quite inefficient, so what you normally do is passing in a batch of 8/16/.. images and targets, and compute the gradients for those all at once, allowing better optimizations. Enzyme supports batching in two ways, the first one (which I implemented here) just accepts a Batch size, and then each Dual/Duplicated argument has not one, but N shadow arguments. So instead of ```rs for i in 0..100 { df(x[i], y[i], 1234); } ``` You can now do ```rs for i in 0..100.step_by(4) { df(x[i+0],x[i+1],x[i+2],x[i+3], y[i+0], y[i+1], y[i+2], y[i+3], 1234); } ``` which will give the same results, but allows better compiler optimizations. See the testcase for details. There is a second variant, where we can mark certain arguments and instead of having to pass in N shadow arguments, Enzyme assumes that the argument is N times longer. I.e. instead of accepting 4 slices with 12 floats each, we would accept one slice with 48 floats. I'll implement this over the next days. I will also add more tests for both modes. For any one preferring some more interactive explanation, here's a video of Tim's llvm dev talk, where he presents his work. https://www.youtube.com/watch?v=edvaLAL5RqU I'll also add some other docs to the dev guide and user docs in another PR. r? ghost Tracking: - https://github.com/rust-lang/rust/issues/124509 - https://github.com/rust-lang/rust/issues/135283
2025-04-04	add autodiff batching backend	Manuel Drehwald	-0/+10

2025-03-31	PassWrapper: adapt for ↵	Augie Fackler	-4/+9
	llvm/llvm-project@94122d58fc77079a291a3d008914006cb509d9db We also have to remove the LLVM argument in cast-target-abi.rs for LLVM 21. I'm not really sure what the best approach here is since that test already uses revisions. We could also fork the test into a copy for LLVM 19-20 and another for LLVM 21, but what I did for now was drop the lint-abort-on-error flag to LLVM figuring that some coverage was better than none, but I'm happy to change this if that was a bad direction. The above also applies for ffi-out-of-bounds-loads.rs. r? dianqk @rustbot label llvm-main
2025-03-20	coverage: Add LLVM plumbing for expansion regions	Zalathar	-0/+16
	This is currently unused, but paves the way for future work on expansion regions without having to worry about the FFI parts.
2025-03-16	Auto merge of #137011 - LuuuXXX:promote-ohos-with-host-tools, r=Amanieu	bors	-1/+2
	Promote ohos targets to tier2 with host tools. ### What does this PR try to resolve? Try to promote the following [[Tier 2 without Host Tools](https://doc.rust-lang.org/rustc/platform-support.html#tier-2-without-host-tools)](https://doc.rust-lang.org/rustc/platform-support.html#tier-2-without-host-tools) targets to [[Tier 2 with Host Tools](https://doc.rust-lang.org/rustc/platform-support.html#tier-2-with-host-tools)](https://doc.rust-lang.org/rustc/platform-support.html#tier-2-with-host-tools): - `aarch64-unknown-linux-ohos` - `armv7-unknown-linux-ohos` - `x86_64-unknown-linux-ohos` ### More Information? see MCP: https://github.com/rust-lang/compiler-team/issues/811 ### Blockage to be solved? - [x] Submit an MCP - [x] Submit code of promote ohos targets - [x] Resolve related dependencies （`measureme`） The modified code of the measureme has been merged （see https://github.com/rust-lang/measureme/pull/238）. [done] The new version will was released (https://github.com/rust-lang/measureme/pull/240). [done]
2025-03-13	Rollup merge of #138420 - zmodem:cfifunctionindex_fix, r=durin42	Matthias Krüger	-0/+9
	Adapt to LLVM dropping CfiFunctionIndex::begin()/end() After https://github.com/llvm/llvm-project/pull/130382, RustWrapper needs to call CfiFunctionIndex::symbols() instead.
2025-03-12	Adapt to LLVM dropping CfiFunctionIndex::begin()/end()	Hans Wennborg	-0/+9
	After https://github.com/llvm/llvm-project/pull/130382, RustWrapper needs to call CfiFunctionIndex::symbols() instead.
2025-03-11	Remove `#![warn(unreachable_pub)]` from all `compiler/` crates.	Nicholas Nethercote	-1/+0
	It's no longer necessary now that `-Wunreachable_pub` is being passed.
2025-03-10	Revert "Use workspace lints for crates in `compiler/` #138084"	许杰友 Jieyou Xu (Joe)	-3/+1
	Revert <https://github.com/rust-lang/rust/pull/138084> to buy time to consider options that avoids breaking downstream usages of cargo on distributed `rustc-src` artifacts, where such cargo invocations fail due to inability to inherit `lints` from workspace root manifest's `workspace.lints` (this is only valid for the source rust-lang/rust workspace, but not really the distributed `rustc-src` artifacts). This breakage was reported in <https://github.com/rust-lang/rust/issues/138304>. This reverts commit 48caf81484b50dca5a5cebb614899a3df81ca898, reversing changes made to c6662879b27f5161e95f39395e3c9513a7b97028.
2025-03-09	Rollup merge of #138084 - nnethercote:workspace-lints, r=jieyouxu	Matthias Krüger	-1/+3
	Use workspace lints for crates in `compiler/` This is nicer and hopefully less error prone than specifying lints via bootstrap. r? ``@jieyouxu``
2025-03-07	Rollup merge of #138137 - ZequanWu:fix-triple, r=cuviper	Jacob Pratt	-2/+6
	setTargetTriple now accepts Triple rather than string https://github.com/llvm/llvm-project/pull/129868 updated `setTargetTriple`
2025-03-08	Remove `#![warn(unreachable_pub)]` from all `compiler/` crates.	Nicholas Nethercote	-1/+0
	(Except for `rustc_codegen_cranelift`.) It's no longer necessary now that `unreachable_pub` is in the workspace lints.
2025-03-08	Specify rust lints for `compiler/` crates via Cargo.	Nicholas Nethercote	-0/+3
	By naming them in `[workspace.lints.rust]` in the top-level `Cargo.toml`, and then making all `compiler/` crates inherit them with `[lints] workspace = true`. (I omitted `rustc_codegen_{cranelift,gcc}`, because they're a bit different.) The advantages of this over the current approach: - It uses a standard Cargo feature, rather than special handling in bootstrap. So, easier to understand, and less likely to get accidentally broken in the future. - It works for proc macro crates. It's a shame it doesn't work for rustc-specific lints, as the comments explain.
2025-03-06	rename Triple to Target	Zequan Wu	-3/+3

2025-03-06	setTargetTriple now accepts Triple rather than string	Zequan Wu	-0/+4

2025-03-06	[llvm/PassWrapper] use `size_t` when building arg strings	Josh Stone	-5/+5

2025-03-04	promote ohos targets to tier to with host tools	LuuuXXX	-1/+2

2025-03-01	Auto merge of #133250 - DianQK:embed-bitcode-pgo, r=nikic	bors	-16/+50
	The embedded bitcode should always be prepared for LTO/ThinLTO Fixes #115344. Fixes #117220. There are currently two methods for generating bitcode that used for LTO. One method involves using `-C linker-plugin-lto` to emit object files as bitcode, which is the typical setting used by cargo. The other method is through `-C embed-bitcode=yes`. When using with `-C embed-bitcode=yes -C lto=no`, we run a complete non-LTO LLVM pipeline to obtain bitcode, then the bitcode is used for LTO. We run the Call Graph Profile Pass twice on the same module. This PR is doing something similar to LLVM's `buildFatLTODefaultPipeline`, obtaining the bitcode for embedding after running `buildThinLTOPreLinkDefaultPipeline`. r? nikic
2025-02-28	compiler: bump `cc` to 1.2.16 to fix `x86` Windows jobs on newest Windows SDK	许杰友 Jieyou Xu (Joe)	-1/+1
	See <https://github.com/rust-lang/rust/issues/137733>.
2025-02-24	Auto merge of #137271 - nikic:gep-nuw-2, r=scottmcm	bors	-0/+18
	Emit getelementptr inbounds nuw for pointer::add() Lower pointer::add (via intrinsic::offset with unsigned offset) to getelementptr inbounds nuw on LLVM versions that support it. This lets LLVM make use of the pre-condition that the offset addition does not wrap in an unsigned sense. Together with inbounds, this also implies that the offset is non-negative. Fixes https://github.com/rust-lang/rust/issues/137217.
2025-02-23	The embedded bitcode should always be prepared for LTO/ThinLTO	DianQK	-16/+50