rust - https://github.com/rust-lang/rust

Age	Commit message (Collapse)	Author	Lines
2017-10-07	rustc: Implement ThinLTO	Alex Crichton	-0/+461
	This commit is an implementation of LLVM's ThinLTO for consumption in rustc itself. Currently today LTO works by merging all relevant LLVM modules into one and then running optimization passes. "Thin" LTO operates differently by having more sharded work and allowing parallelism opportunities between optimizing codegen units. Further down the road Thin LTO also allows incremental LTO which should enable even faster release builds without compromising on the performance we have today. This commit uses a `-Z thinlto` flag to gate whether ThinLTO is enabled. It then also implements two forms of ThinLTO: * In one mode we'll only perform ThinLTO over the codegen units produced in a single compilation. That is, we won't load upstream rlibs, but we'll instead just perform ThinLTO amongst all codegen units produced by the compiler for the local crate. This is intended to emulate a desired end point where we have codegen units turned on by default for all crates and ThinLTO allows us to do this without performance loss. * In anther mode, like full LTO today, we'll optimize all upstream dependencies in "thin" mode. Unlike today, however, this LTO step is fully parallelized so should finish much more quickly. There's a good bit of comments about what the implementation is doing and where it came from, but the tl;dr; is that currently most of the support here is copied from upstream LLVM. This code duplication is done for a number of reasons: * Controlling parallelism means we can use the existing jobserver support to avoid overloading machines. * We will likely want a slightly different form of incremental caching which integrates with our own incremental strategy, but this is yet to be determined. * This buys us some flexibility about when/where we run ThinLTO, as well as having it tailored to fit our needs for the time being. * Finally this allows us to reuse some artifacts such as our `TargetMachine` creation, where all our options we used today aren't necessarily supported by upstream LLVM yet. My hope is that we can get some experience with this copy/paste in tree and then eventually upstream some work to LLVM itself to avoid the duplication while still ensuring our needs are met. Otherwise I fear that maintaining these bindings may be quite costly over the years with LLVM updates!
2017-09-30	rustc: Enable LTO and multiple codegen units	Alex Crichton	-0/+62
	This commit is a refactoring of the LTO backend in Rust to support compilations with multiple codegen units. The immediate result of this PR is to remove the artificial error emitted by rustc about `-C lto -C codegen-units-8`, but longer term this is intended to lay the groundwork for LTO with incremental compilation and ultimately be the underpinning of ThinLTO support. The problem here that needed solving is that when rustc is producing multiple codegen units in one compilation LTO needs to merge them all together. Previously only upstream dependencies were merged and it was inherently relied on that there was only one local codegen unit. Supporting this involved refactoring the optimization backend architecture for rustc, namely splitting the `optimize_and_codegen` function into `optimize` and `codegen`. After an LLVM module has been optimized it may be blocked and queued up for LTO, and only after LTO are modules code generated. Non-LTO compilations should look the same as they do today backend-wise, we'll spin up a thread for each codegen unit and optimize/codegen in that thread. LTO compilations will, however, send the LLVM module back to the coordinator thread once optimizations have finished. When all LLVM modules have finished optimizing the coordinator will invoke the LTO backend, producing a further list of LLVM modules. Currently this is always a list of one LLVM module. The coordinator then spawns further work to run LTO and code generation passes over each module. In the course of this refactoring a number of other pieces were refactored: * Management of the bytecode encoding in rlibs was centralized into one module instead of being scattered across LTO and linking. * Some internal refactorings on the link stage of the compiler was done to work directly from `CompiledModule` structures instead of lists of paths. * The trans time-graph output was tweaked a little to include a name on each bar and inflate the size of the bars a little
2017-09-15	Add 'native' to -C target-cpu=help	Matt Ickstadt	-0/+7

2017-08-08	Fix covered-switch-default warnings in PassWrapper	kennytm	-2/+4
	(See #39063 for explanation)
2017-07-31	Gate LLVMRustHasFeature on LLVM_RUSTLLVM	Josh Stone	-1/+1
	Commit c4710203c098b in #43492 make `LLVMRustHasFeature` "more robust" by using `getFeatureTable()`. However, this function is specific to Rust's own LLVM fork, not upstream LLVM-4.0, so we need to use `#if LLVM_RUSTLLVM` to guard this call.
2017-07-28	Make LLVMRustHasFeature more robust	Luca Barbato	-13/+7
	The function should accept feature strings that old LLVM might not support. Simplify the code using the same approach used by LLVMRustPrintTargetFeatures. Dummify the function for non 4.0 LLVM and update the tests accordingly.
2017-07-23	Auto merge of #43387 - TimNN:rustllvm50, r=alexcrichton	bors	-79/+154
	Update Rust LLVM bindings for LLVM 5.0 This is the initial set of changes to update the rust llvm bindings for 5.0. The llvm commits necessitating these changes are linked from the tracking issue, #43370.
2017-07-21	Fix archive member names on 5.0	Alex Crichton	-0/+4

2017-07-21	update attributes API usage	Alex Crichton	-1/+26

2017-07-21	rustllvm: split DebugLoc in UnpackOptimizationDiagnostic	Tim Neumann	-3/+20

2017-07-21	rustllvm: update to SyncScope::ID	Tim Neumann	-0/+13

2017-07-21	rustllvm: adjust usage of createNameSpace	Tim Neumann	-1/+5

2017-07-21	rustllvm: adjust usage of createPointerType	Tim Neumann	-1/+9

2017-07-21	rustllvm: use LLVMMetadataRef	Tim Neumann	-73/+75

2017-07-21	rustllvm: define LLVM_VERSION_LT	Tim Neumann	-0/+2

2017-07-18	Fix LLVM assertion when a weak symbol is defined in global_asm.	Vadzim Dambrouski	-1/+1
	This change will fix the issue from https://github.com/japaric/svd2rust/pull/130
2017-07-12	[LLVM] Avoid losing the !nonnull attribute in SROA	Ariel Ben-Yehuda	-5/+1
	This still does not work on 32-bit archs because of an LLVM limitation, but this is only an optimization, so let's push it on 64-bit only for now. Fixes #37945
2017-07-06	Auto merge of #42727 - alexcrichton:allocators-new, r=eddyb	bors	-0/+4
	rustc: Implement the #[global_allocator] attribute This PR is an implementation of [RFC 1974] which specifies a new method of defining a global allocator for a program. This obsoletes the old `#![allocator]` attribute and also removes support for it. [RFC 1974]: https://github.com/rust-lang/rfcs/pull/1974 The new `#[global_allocator]` attribute solves many issues encountered with the `#![allocator]` attribute such as composition and restrictions on the crate graph itself. The compiler now has much more control over the ABI of the allocator and how it's implemented, allowing much more freedom in terms of how this feature is implemented. cc #27389
2017-07-05	rustc: Implement the #[global_allocator] attribute	Alex Crichton	-0/+4
	This PR is an implementation of [RFC 1974] which specifies a new method of defining a global allocator for a program. This obsoletes the old `#![allocator]` attribute and also removes support for it. [RFC 1974]: https://github.com/rust-lang/rfcs/pull/197 The new `#[global_allocator]` attribute solves many issues encountered with the `#![allocator]` attribute such as composition and restrictions on the crate graph itself. The compiler now has much more control over the ABI of the allocator and how it's implemented, allowing much more freedom in terms of how this feature is implemented. cc #27389
2017-07-04	Auto merge of #42993 - stepancheg:editorconfig, r=brson	bors	-0/+6
	Add .editorconfig to src/rustllvm ... which uses 2 space indent instead of common 4 spaces.
2017-07-01	When writing LLVM IR output demangled fn name in comments	Stepan Koltsov	-2/+126
	`--emit=llvm-ir` looks like this now: ``` ; <alloc::vec::Vec<T> as core::ops::index::IndexMut<core::ops::range::RangeFull>>::index_mut ; Function Attrs: inlinehint uwtable define internal { i8, i64 } @"_ZN106_$LT$alloc..vec..Vec$LT$T$GT$$u20$as$u20$core..ops..index..IndexMut$LT$core..ops..range..RangeFull$GT$$GT$9index_mut17h7f7b576609f30262E"(%"alloc::vec::Vec<u8>" dereferenceable(24)) unnamed_addr #0 { start: ... ``` cc https://github.com/integer32llc/rust-playground/issues/15
2017-06-30	Add .editorconfig to src/rustllvm	Stepan Koltsov	-0/+6
	... which uses 2 space indent instead of common 4 spaces.
2017-06-27	Rebase LLVM on top of LLVM 4.0.1	Ariel Ben-Yehuda	-1/+1
	Fixes #42893.
2017-06-19	Update LLVM to pick StackColoring improvement	Ariel Ben-Yehuda	-1/+1
	Fixes #40883.
2017-06-19	Backport fixes to LLVM 4.0 ARM codegen bugs	Ariel Ben-Yehuda	-1/+1
	So ARM had quite a few codegen bugs on LLVM 4.0 which are fixed on LLVM trunk. This backports 5 of them: r297871 - ARM: avoid clobbering register in v6 jump-table expansion. - fixes rust-lang/rust#42248 r294949 - [Thumb-1] TBB generation: spot redefinitions of index r295816 - [ARM] Fix constant islands pass. r300870 - [Thumb-1] Fix corner cases for compressed jump tables r302650 - [IfConversion] Add missing check in IfConversion/canFallThroughTo - unblocks rust-lang/rust#39409
2017-06-16	Auto merge of #42410 - nagisa:llvmup, r=sanxiyn	bors	-1/+1
	Upgrade LLVM Includes https://github.com/rust-lang/llvm/pull/80
2017-06-08	Upgrade LLVM	Simonas Kazlauskas	-1/+1
	Includes https://github.com/rust-lang/llvm/pull/80 Includes https://github.com/rust-lang/llvm/pull/79 Also adds tests and thus fixes #24194
2017-06-04	Merge branch 'profiling' of github.com:whitequark/rust into profiling	Marco Castelluccio	-0/+4

2017-05-28	add NullOp::SizeOf and BinOp::Offset	Ariel Ben-Yehuda	-5/+9

2017-05-13	LLVM: Add support for EABI-compliant libcalls on MSP430.	Vadzim Dambrouski	-1/+1
	This change will allow rust code to have proper support for division and multiplication using libgcc libcalls.
2017-05-06	trigger llvm rebuild	Tim Neumann	-1/+1

2017-05-01	Auto merge of #41560 - alevy:rwpi-ropi, r=eddyb	bors	-23/+43
	Add RWPI/ROPI relocation model support This PR adds support for using LLVM 4's ROPI and RWPI relocation models for ARM. ROPI (Read-Only Position Independence) and RWPI (Read-Write Position Independence) are two new relocation models in LLVM for the ARM backend ([LLVM changset](https://reviews.llvm.org/rL278015)). The motivation is that these are the specific strategies we use in userspace [Tock](https://www.tockos.org) apps, so supporting this is an important step (perhaps the final step, but can't confirm yet) in enabling userspace Rust processes. ## Explanation ROPI makes all code and immutable accesses PC relative, but not assumed to be overriden at runtime (so for example, jumps are always relative). RWPI uses a base register (`r9`) that stores the addresses of the GOT in memory so the runtime (e.g. a kernel) only adjusts r9 tell running code where the GOT is. ## Complications adding support in Rust While this landed in LLVM master back in August, the header files in `llvm-c` have not been updated yet to reflect it. Rust replicates that header file's version of the `LLVMRelocMode` enum as the Rust enum `llvm::RelocMode` and uses an implicit cast in the ffi to translate from Rust's notion of the relocation model to the LLVM library's notion. My workaround for this currently is to replace the `LLVMRelocMode` argument to `LLVMTargetMachineRef` with an int and using the hardcoded int representation of the `RelocMode` enum. This is A Bad Idea(tm), but I think very nearly the right thing. Would a better alternative be to patch rust-llvm to support these enum variants (also a fairly trivial change)?
2017-05-01	Add profiling support, through the rustc -Z profile flag.	whitequark	-0/+4
	When -Z profile is passed, the GCDAProfiling LLVM pass is added to the pipeline, which uses debug information to instrument the IR. After compiling with -Z profile, the $(OUT_DIR)/$(CRATE_NAME).gcno file is created, containing initial profiling information. After running the program built, the $(OUT_DIR)/$(CRATE_NAME).gcda file is created, containing branch counters. The created .gcno and .gcda files can be processed using the "llvm-cov gcov" and "lcov" tools. The profiling data LLVM generates does not faithfully follow the GCC's format for .gcno and .gcda files, and so it will probably not work with other tools (such as gcov itself) that consume these files.
2017-04-28	Added LLVMRustRelocMode	Amit Aryeh Levy	-34/+43
	Replaces the llvm-c exposed LLVMRelocMode, which does not include all relocation model variants, with a LLVMRustRelocMode modeled after LLVMRustCodeMode.
2017-04-27	Update LLVM to fix incorrect codegen on MSP430.	Vadzim Dambrouski	-1/+1
	The bug was reported by @akovaski here: https://github.com/rust-embedded/rfcs/issues/20#issuecomment-296482148
2017-04-26	Add RWPI/ROPI relocation model support	Amit Aryeh Levy	-4/+15
	Adds support for using LLVM 4's ROPI and RWPI relocation models for ARM
2017-04-26	Cherry pick LLVM hexagon fixes	Michael Wu	-1/+1

2017-04-25	Add Hexagon support	Michael Wu	-1/+8
	This requires an updated LLVM with D31999 and D32000 to build libcore. A basic hello world builds and runs successfully on the hexagon simulator.
2017-04-12	Expose LLVM appendModuleInlineAsm	A.J. Gardner	-0/+4

2017-03-24	update LLVM with fix for PR32379	Ariel Ben-Yehuda	-1/+1
	Fixes #40593.
2017-03-20	Auto merge of #39628 - arielb1:shimmir, r=eddyb	bors	-1/+1
	Translate shims using MIR This removes one large remaining part of old trans.
2017-03-19	update LLVM	Ariel Ben-Yehuda	-1/+1
	pick up a fix to LLVM PR29151.
2017-03-16	add missing global metadata	Tim Neumann	-3/+8

2017-03-16	clang-format	Tim Neumann	-5/+2

2017-03-16	isolate llvm 4.0 code path	Tim Neumann	-12/+6

2017-03-12	rustbuild: Add option for enabling partial LLVM rebuilds	Vadim Petrochenkov	-1/+1

2017-03-10	LLVM: Update submodule to include SRet support patch for MSP430.	Vadzim Dambrouski	-1/+1

2017-03-02	LLVM: Update submodule to include x86-interrupt ABI patches	Philipp Oppermann	-1/+1

2017-02-15	rustc: Link statically to the MSVCRT	Alex Crichton	-1/+1
	This commit changes all MSVC rustc binaries to be compiled with `-C target-feature=+crt-static` to link statically against the MSVCRT instead of dynamically (as it does today). This also necessitates compiling LLVM in a different fashion, ensuring it's compiled with `/MT` instead of `/MD`. cc #37406
2017-02-13	Auto merge of #39456 - nagisa:mir-switchint-everywhere, r=nikomatsakis	bors	-0/+6
	[MIR] SwitchInt Everywhere Something I've been meaning to do for a very long while. This PR essentially gets rid of 3 kinds of conditional branching and only keeps the most general one - `SwitchInt`. Primary benefits are such that dealing with MIR now does not involve dealing with 3 different ways to do conditional control flow. On the other hand, constructing a `SwitchInt` currently requires more code than what previously was necessary to build an equivalent `If` terminator. Something trivially "fixable" with some constructor methods somewhere (MIR needs stuff like that badly in general). Some timings (tl;dr: slightly faster^1 (unexpected), but also uses slightly more memory at peak (expected)): ^1: Not sure if the speed benefits are because of LLVM liking the generated code better or the compiler itself getting compiled better. Either way, its a net benefit. The CORE and SYNTAX timings done for compilation without optimisation. ``` AFTER: Building stage1 std artifacts (x86_64-unknown-linux-gnu -> x86_64-unknown-linux-gnu) Finished release [optimized] target(s) in 31.50 secs Finished release [optimized] target(s) in 31.42 secs Building stage1 compiler artifacts (x86_64-unknown-linux-gnu -> x86_64-unknown-linux-gnu) Finished release [optimized] target(s) in 439.56 secs Finished release [optimized] target(s) in 435.15 secs CORE: 99% (24.81 real, 0.13 kernel, 24.57 user); 358536k resident CORE: 99% (24.56 real, 0.15 kernel, 24.36 user); 359168k resident SYNTAX: 99% (49.98 real, 0.48 kernel, 49.42 user); 653416k resident SYNTAX: 99% (50.07 real, 0.58 kernel, 49.43 user); 653604k resident BEFORE: Building stage1 std artifacts (x86_64-unknown-linux-gnu -> x86_64-unknown-linux-gnu) Finished release [optimized] target(s) in 31.84 secs Building stage1 compiler artifacts (x86_64-unknown-linux-gnu -> x86_64-unknown-linux-gnu) Finished release [optimized] target(s) in 451.17 secs CORE: 99% (24.66 real, 0.20 kernel, 24.38 user); 351096k resident CORE: 99% (24.36 real, 0.17 kernel, 24.18 user); 352284k resident SYNTAX: 99% (52.24 real, 0.56 kernel, 51.66 user); 645544k resident SYNTAX: 99% (51.55 real, 0.48 kernel, 50.99 user); 646428k resident ``` cc @nikomatsakis @eddyb