rust - https://github.com/rust-lang/rust

Age	Commit message (Collapse)	Author	Lines
2023-12-02	Rename `HandlerInner::delayed_span_bugs` as `HandlerInner::span_delayed_bugs`.	Nicholas Nethercote	-1/+1
	For reasons similar to the previous commit.
2023-12-02	Rename `HandlerInner::delay_span_bug` as `HandlerInner::span_delayed_bug`.	Nicholas Nethercote	-2/+2
	Because the corresponding `Level` is `DelayedBug` and `span_delayed_bug` follows the pattern used everywhere else: `span_err`, `span_warning`, etc.
2023-11-29	Auto merge of #118348 - Mark-Simulacrum:feature-code-size, r=compiler-errors	bors	-4/+4
	Cut code size for feature hashing This locally cuts ~32 kB of .text instructions. This isn't really a clear win in terms of readability. IMO the code size benefits are worth it (even if they're not necessarily present in the x86_64 hyperoptimized build, I expect them to translate similarly to other platforms). Ultimately there's lots of "small ish" low hanging fruit like this that I'm seeing that seems worth tackling to me, and could translate into larger wins in aggregate.
2023-11-27	QueryContext: rename try_collect_active_jobs -> collect_active_jobs and ↵	klensy	-8/+5
	change it's return type from Option<QueryMap> to QueryMap As there currently always Some(...) inside
2023-11-26	Cut code size for feature hashing	Mark Rousskov	-4/+4
	This locally cuts ~32 kB of .text instructions.
2023-11-26	Auto merge of #117301 - saethlin:finish-rmeta-encoding, r=WaffleLapkin	bors	-1/+1
	Call FileEncoder::finish in rmeta encoding Fixes https://github.com/rust-lang/rust/issues/117254 The bug here was that rmeta encoding never called FileEncoder::finish. Now it does. Most of the changes here are needed to support that, since rmeta encoding wants to finish _then_ access the File in the encoder, so finish can't move out. I tried adding a `cfg(debug_assertions)` exploding Drop impl to FileEncoder that checked for finish being called before dropping, but fatal errors cause unwinding so this isn't really possible. If we encounter a fatal error with a dirty FileEncoder, the Drop impl ICEs even though the implementation is correct. If we try to paper over that by wrapping FileEncoder in ManuallyDrop then that just erases the fact that Drop automatically checks that we call finish on all paths. I also changed the name of DepGraph::encode to DepGraph::finish_encoding, because that's what it does and it makes the fact that it is the path to FileEncoder::finish less confusing. r? `@WaffleLapkin`
2023-11-26	Use `rustc_fluent_macro::fluent_messages!` directly.	Nicholas Nethercote	-3/+1
	Currently we always do this: ``` use rustc_fluent_macro::fluent_messages; ... fluent_messages! { "./example.ftl" } ``` But there is no need, we can just do this everywhere: ``` rustc_fluent_macro::fluent_messages! { "./example.ftl" } ``` which is shorter.
2023-11-26	Avoid need for `{D,Subd}iagnosticMessage` imports.	Nicholas Nethercote	-1/+0
	The `fluent_messages!` macro produces uses of `crate::{D,Subd}iagnosticMessage`, which means that every crate using the macro must have this import: ``` use rustc_errors::{DiagnosticMessage, SubdiagnosticMessage}; ``` This commit changes the macro to instead use `rustc_errors::{D,Subd}iagnosticMessage`, which avoids the need for the imports.
2023-11-23	Rollup merge of #118169 - SparrowLii:deadlock_issue, r=compiler-errors	Matthias Krüger	-6/+12
	print query map for deadlock when using parallel front end print query map for deadlock when using parallel front end, so that we can analyze where and why deadlock occurs
2023-11-22	Call FileEncoder::finish in rmeta encoding	Ben Kimock	-1/+1

2023-11-23	Nit of deadlock detected	SparrowLii	-1/+1

2023-11-22	also make 'core_intrinsics' internal	Ralf Jung	-1/+1

2023-11-22	Replace `custom_encodable` with `encodable`.	Nicholas Nethercote	-0/+1
	By default, `newtype_index!` types get a default `Encodable`/`Decodable` impl. You can opt out of this with `custom_encodable`. Opting out is the opposite to how Rust normally works with autogenerated (derived) impls. This commit inverts the behaviour, replacing `custom_encodable` with `encodable` which opts into the default `Encodable`/`Decodable` impl. Only 23 of the 59 `newtype_index!` occurrences need `encodable`. Even better, there were eight crates with a dependency on `rustc_serialize` just from unused default `Encodable`/`Decodable` impls. This commit removes that dependency from those eight crates.
2023-11-22	print query map for deadlock when using parallel front end	SparrowLii	-6/+12

2023-11-21	Fix `clippy::needless_borrow` in the compiler	Nilstrieb	-6/+6
	`x clippy compiler -Aclippy::all -Wclippy::needless_borrow --fix`. Then I had to remove a few unnecessary parens and muts that were exposed now.
2023-11-21	Add HashStable_NoContext to simplify HashStable implementations in rustc_type_ir	Michael Goulet	-2/+0

2023-11-16	Reduce exposure of things.	Nicholas Nethercote	-30/+27

2023-11-15	Remove unused features.	Nicholas Nethercote	-2/+0

2023-10-28	Rollup merge of #116534 - cjgillot:no-dep-tasks, r=davidtwco	Jubilee	-22/+4
	Remove -Zdep-tasks. This option is not useful any more, we can use `tracing` and `RUSTC_LOG` to debug the dep-graph.
2023-10-26	Stash and cancel cycle errors for auto trait leakage in opaques	Michael Goulet	-1/+13

2023-10-22	fix broken link: update incremental compilation url	gvozdvmozgu	-1/+1

2023-10-13	Format all the let chains in compiler	Michael Goulet	-1/+3

2023-10-08	Remove -Zdep-tasks.	Camille GILLOT	-22/+4

2023-09-27	Auto merge of #116163 - compiler-errors:lazyness, r=oli-obk	bors	-3/+1
	Don't store lazyness in `DefKind::TyAlias` 1. Don't store lazyness of a type alias in its `DefKind`, but instead via a query. 2. This allows us to treat type aliases as lazy if `#[feature(lazy_type_alias)]` OR if the alias contains a TAIT, rather than having checks for both in separate parts of the codebase. r? `@oli-obk` cc `@fmease`
2023-09-26	Don't store lazyness in DefKind	Michael Goulet	-3/+1

2023-09-25	Rename `cold_path` to `outline`	John Kåre Alsaker	-2/+2

2023-09-21	Move `DepKind` to `rustc_query_system` and define it as `u16`	John Kåre Alsaker	-331/+364

2023-09-20	Auto merge of #115542 - saethlin:fileencoder-is-bufwriter, r=WaffleLapkin	bors	-3/+5
	Simplify/Optimize FileEncoder FileEncoder is basically a BufWriter except that it exposes access to the not-written-to-yet region of the buffer so that some users can write directly to the buffer. This strategy is awesome because it lets us avoid calling memcpy for small copies, but the previous strategy was based on the writer accessing a `&mut [MaybeUninit<u8>; N]` and returning a `&[u8]` which is an API which currently mandates the use of unsafe code, making that interface in general not that appealing. So this PR cleans up the FileEncoder implementation and builds on that general idea of direct buffer access in order to prevent `memcpy` calls in a few key places when encoding the dep graph and rmeta tables. The interface used here is now 100% safe, but with the caveat that internally we need to avoid trusting the number of bytes that the provided function claims to have written. The original primary objective of this PR was to clean up the FileEncoder implementation so that the fix for the following issues would be easy to implement. The fix for these issues is to correctly update self.buffered even when writes fail, which I think it's easy to verify manually is now done, because all the FileEncoder methods are small. Fixes https://github.com/rust-lang/rust/issues/115298 Fixes https://github.com/rust-lang/rust/issues/114671 Fixes https://github.com/rust-lang/rust/issues/114045 Fixes https://github.com/rust-lang/rust/issues/108100 Fixes https://github.com/rust-lang/rust/issues/106787
2023-09-20	PR feedback	Ben Kimock	-4/+1

2023-09-12	Use `UnhashMap` for the index	John Kåre Alsaker	-2/+3

2023-09-12	Encode the number of dep kinds encountered in the dep graph	John Kåre Alsaker	-3/+15

2023-09-12	Store a index per dep node kind	John Kåre Alsaker	-7/+14

2023-09-10	Reimplement FileEncoder with a small-write optimization	Ben Kimock	-3/+8

2023-09-11	Auto merge of #115388 - Zoxc:sharded-lock, r=SparrowLii	bors	-16/+10
	Add optimized lock methods for `Sharded` and refactor `Lock` This adds methods to `Sharded` which pick a shard and also locks it. These branch on parallelism just once instead of twice, improving performance. Benchmark for `cfg(parallel_compiler)` and 1 thread: <table><tr><td rowspan="2">Benchmark</td><td colspan="1"><b>Before</b></th><td colspan="2"><b>After</b></th></tr><tr><td align="right">Time</td><td align="right">Time</td><td align="right">%</th></tr><tr><td>🟣 <b>clap</b>:check</td><td align="right">1.6461s</td><td align="right">1.6345s</td><td align="right"> -0.70%</td></tr><tr><td>🟣 <b>hyper</b>:check</td><td align="right">0.2414s</td><td align="right">0.2394s</td><td align="right"> -0.83%</td></tr><tr><td>🟣 <b>regex</b>:check</td><td align="right">0.9205s</td><td align="right">0.9143s</td><td align="right"> -0.67%</td></tr><tr><td>🟣 <b>syn</b>:check</td><td align="right">1.4981s</td><td align="right">1.4869s</td><td align="right"> -0.75%</td></tr><tr><td>🟣 <b>syntex_syntax</b>:check</td><td align="right">5.7629s</td><td align="right">5.7256s</td><td align="right"> -0.65%</td></tr><tr><td>Total</td><td align="right">10.0690s</td><td align="right">10.0008s</td><td align="right"> -0.68%</td></tr><tr><td>Summary</td><td align="right">1.0000s</td><td align="right">0.9928s</td><td align="right"> -0.72%</td></tr></table> cc `@SparrowLii`
2023-09-10	Auto merge of #115668 - Zoxc:deadlock-msg, r=jackh726	bors	-1/+3
	Make the deadlock panic clearly refer to a deadlock
2023-09-08	Make the deadlock panic clearly refer to a deadlock	John Kåre Alsaker	-1/+3

2023-09-08	Add optimized lock methods for `Sharded`	John Kåre Alsaker	-16/+10

2023-09-07	Use `Freeze` for `SourceFile.lines`	John Kåre Alsaker	-4/+5

2023-09-07	Auto merge of #110050 - saethlin:better-u32-encoding, r=nnethercote	bors	-43/+381
	Use a specialized varint + bitpacking scheme for DepGraph encoding The previous scheme here uses leb128 to encode the edge tables that represent the incr comp dependency graph. The problem with that scheme is that leb128 has overhead for larger values, and generally relies on the distribution of encoded values being heavily skewed towards smaller values. That is definitely not the case for a dep node index, since they are handed out sequentially and the whole range is covered, the distribution is actually biased in the opposite direction: Most dep nodes are large. This PR implements a different varint encoding scheme. Instead of applying varint encoding to individual dep node indices (which is extremely branchy) we now apply it per node. While being built, each node now stores its edges in a `SmallVec` with a bit of extra logic to track the max value of each edge. Then we varint encode the whole batch. This is a gamble: We save on space by only claiming 2 bits per node instead of ~3 bits per edge which is a nice savings but needs to balance out with the space overhead that a single large index in a node with a lot of edges will encode unnecessary bytes in each of that node's edge indices. Then, to keep the runtime overhead of this encoding scheme down we deserialize our indices by loading 4 bytes for each then masking off the bytes that are't ours. This is much less code and branches than leb128, but relies on having some readable bytes past the end of each edge list. We explicitly add such padding to the in-memory data during decoding. And we also do this decoding lazily, turning a dense on-disk encoding into a peak memory reduction. Then we apply a bit-packing scheme; since in https://github.com/rust-lang/rust/pull/115391 we now have unused bits on `DepKind`, we use those unused bits (currently there are 7!) to store the 2 bits that we need for the byte width of the edges in each node, then use the remaining bits to store the length of the edge list, if it fits. r? `@nnethercote`
2023-09-06	Add comments with the same level of detail as the PR description	Ben Kimock	-12/+54

2023-09-04	Use a specialized varint + bitpacking scheme for DepGraph encoding	Ben Kimock	-44/+340

2023-09-03	Use relative positions inside a SourceFile.	Camille GILLOT	-33/+7

2023-09-01	Use `OnceLock` for `SingleCache`	John Kåre Alsaker	-6/+6

2023-08-30	Don't use `wait_for_query` without the Rayon thread pool	John Kåre Alsaker	-13/+13

2023-08-29	Auto merge of #114894 - Zoxc:sharded-cfg-cleanup2, r=cjgillot	bors	-47/+10
	Remove conditional use of `Sharded` from query state `Sharded` is already a zero cost abstraction, so it shouldn't affect the performance of the single thread compiler if LLVM does its job. r? `@cjgillot`
2023-08-27	Pass ErrorGuaranteed to cycle error	Michael Goulet	-6/+10

2023-08-25	Fix waiting on a query that panicked	John Kåre Alsaker	-1/+12

2023-08-24	Optimize `lock_shards`	John Kåre Alsaker	-9/+5

2023-08-24	Remove conditional use of `Sharded` from query state	John Kåre Alsaker	-43/+10

2023-08-24	Auto merge of #114860 - Zoxc:sharded-layout, r=SparrowLii	bors	-1/+1
	Make `Sharded` an enum and specialize it for the single thread case This changes `Sharded` to use a single shard by an enum, reducing the size of `Sharded` for greater cache efficiency. Performance improvement with 1 thread and `cfg(parallel_compiler)`: <table><tr><td rowspan="2">Benchmark</td><td colspan="1"><b>Before</b></th><td colspan="2"><b>After</b></th></tr><tr><td align="right">Time</td><td align="right">Time</td><td align="right">%</th></tr><tr><td>🟣 <b>clap</b>:check</td><td align="right">1.7009s</td><td align="right">1.6748s</td><td align="right">💚 -1.53%</td></tr><tr><td>🟣 <b>hyper</b>:check</td><td align="right">0.2525s</td><td align="right">0.2451s</td><td align="right">💚 -2.90%</td></tr><tr><td>🟣 <b>regex</b>:check</td><td align="right">0.9519s</td><td align="right">0.9353s</td><td align="right">💚 -1.74%</td></tr><tr><td>🟣 <b>syn</b>:check</td><td align="right">1.5504s</td><td align="right">1.5280s</td><td align="right">💚 -1.45%</td></tr><tr><td>🟣 <b>syntex_syntax</b>:check</td><td align="right">5.9536s</td><td align="right">5.8873s</td><td align="right">💚 -1.11%</td></tr><tr><td>Total</td><td align="right">10.4092s</td><td align="right">10.2706s</td><td align="right">💚 -1.33%</td></tr><tr><td>Summary</td><td align="right">1.0000s</td><td align="right">0.9825s</td><td align="right">💚 -1.75%</td></tr></table> I did see an unexpected 0.23% change for the serial compiler, so this could use a perf run to see if that reproduces. cc `@SparrowLii`