Introduce `ByteSymbol`.

It's like `Symbol` but for byte strings. The interner is now used for both `Symbol` and `ByteSymbol`. E.g. if you intern `"dog"` and `b"dog"` you'll get a `Symbol` and a `ByteSymbol` with the same index and the characters will only be stored once. The motivation for this is to eliminate the `Arc`s in `ast::LitKind`, to make `ast::LitKind` impl `Copy`, and to avoid the need to arena-allocate `ast::LitKind` in HIR. The latter change reduces peak memory by a non-trivial amount on literal-heavy benchmarks such as `deep-vector` and `tuple-stress`. `Encoder`, `Decoder`, `SpanEncoder`, and `SpanDecoder` all get some changes so that they can handle normal strings and byte strings. This change does slow down compilation of programs that use `include_bytes!` on large files, because the contents of those files are now interned (hashed). This makes `include_bytes!` more similar to `include_str!`, though `include_bytes!` contents still aren't escaped, and hashing is still much cheaper than escaping.
author: Nicholas Nethercote <n.nethercote@gmail.com> 2025-06-02 08:59:29 +1000
committer: Nicholas Nethercote <n.nethercote@gmail.com> 2025-06-30 20:42:27 +1000
commit: 478f8287c0e2c35cda511fd3ac01b7ac78ee7cfe (patch)
tree: 4d8f19b4e4e440ed8a22ee809ce2a565707d4c27 /compiler/rustc_ast_pretty
parent: ed2d759783dc9de134bbb3f01085b1e6dbf539f3 (diff)
download: rust-478f8287c0e2c35cda511fd3ac01b7ac78ee7cfe.tar.gz
rust-478f8287c0e2c35cda511fd3ac01b7ac78ee7cfe.zip
1 files changed, 6 insertions, 2 deletions
diff --git a/compiler/rustc_ast_pretty/src/pprust/state/expr.rs b/compiler/rustc_ast_pretty/src/pprust/state/expr.rs
index 7651e8365a2..8a2cb64b2a0 100644
--- a/compiler/rustc_ast_pretty/src/pprust/state/expr.rs
+++ b/compiler/rustc_ast_pretty/src/pprust/state/expr.rs
@@ -469,8 +469,12 @@ impl<'a> State<'a> {
             ast::ExprKind::Lit(token_lit) => {
                 self.print_token_literal(*token_lit, expr.span);
             }
-            ast::ExprKind::IncludedBytes(bytes) => {
-                let lit = token::Lit::new(token::ByteStr, escape_byte_str_symbol(bytes), None);
+            ast::ExprKind::IncludedBytes(byte_sym) => {
+                let lit = token::Lit::new(
+                    token::ByteStr,
+                    escape_byte_str_symbol(byte_sym.as_byte_str()),
+                    None,
+                );
                 self.print_token_literal(lit, expr.span)
             }
             ast::ExprKind::Cast(expr, ty) => {
author	Nicholas Nethercote <n.nethercote@gmail.com>	2025-06-02 08:59:29 +1000
committer	Nicholas Nethercote <n.nethercote@gmail.com>	2025-06-30 20:42:27 +1000
commit	478f8287c0e2c35cda511fd3ac01b7ac78ee7cfe (patch)
tree	4d8f19b4e4e440ed8a22ee809ce2a565707d4c27 /compiler/rustc_ast_pretty
parent	ed2d759783dc9de134bbb3f01085b1e6dbf539f3 (diff)
download	rust-478f8287c0e2c35cda511fd3ac01b7ac78ee7cfe.tar.gz rust-478f8287c0e2c35cda511fd3ac01b7ac78ee7cfe.zip