Detect `NulInCStr` error earlier.

By making it an `EscapeError` instead of a `LitError`. This makes it like the other errors produced when checking string literals contents, e.g. for invalid escape sequences or bare CR chars. NOTE: this means these errors are issued earlier, before expansion, which changes behaviour. It will be possible to move the check back to the later point if desired. If that happens, it's likely that all the string literal contents checks will be delayed together. One nice thing about this: the old approach had some code in `report_lit_error` to calculate the span of the nul char from a range. This code used a hardwired `+2` to account for the `c"` at the start of a C string literal, but this should have changed to a `+3` for raw C string literals to account for the `cr"`, which meant that the caret in `cr"` nul error messages was one short of where it should have been. The new approach doesn't need any of this and avoids the off-by-one error.
author: Nicholas Nethercote <n.nethercote@gmail.com> 2023-12-07 09:53:08 +1100
committer: Nicholas Nethercote <n.nethercote@gmail.com> 2024-01-12 16:19:37 +1100
commit: 9018d2c455df78d3f2900b4ced3ed63962e4f11e (patch)
tree: 1f54c57b0d1c6c95d07cd08b54b4fd5d14ef989e /compiler/rustc_lexer/src
parent: 62d7ed4a6775c4490e493093ca98ef7c215b835b (diff)
download: rust-9018d2c455df78d3f2900b4ced3ed63962e4f11e.tar.gz
rust-9018d2c455df78d3f2900b4ced3ed63962e4f11e.zip
1 files changed, 15 insertions, 2 deletions
diff --git a/compiler/rustc_lexer/src/unescape.rs b/compiler/rustc_lexer/src/unescape.rs
index abec12f52a6..0a632c4d12a 100644
--- a/compiler/rustc_lexer/src/unescape.rs
+++ b/compiler/rustc_lexer/src/unescape.rs
@@ -59,6 +59,9 @@ pub enum EscapeError {
     /// Non-ascii character in byte literal, byte string literal, or raw byte string literal.
     NonAsciiCharInByte,
 
+    // `\0` in a C string literal.
+    NulInCStr,
+
     /// After a line ending with '\', the next line contains whitespace
     /// characters that are not skipped.
     UnskippedWhitespaceWarning,
@@ -122,10 +125,20 @@ where
 {
     match mode {
         CStr => {
-            unescape_non_raw_common(src, mode, callback);
+            unescape_non_raw_common(src, mode, &mut |r, mut result| {
+                if let Ok(CStrUnit::Byte(0) | CStrUnit::Char('\0')) = result {
+                    result = Err(EscapeError::NulInCStr);
+                }
+                callback(r, result)
+            });
         }
         RawCStr => {
-            check_raw_common(src, mode, &mut |r, result| callback(r, result.map(CStrUnit::Char)));
+            check_raw_common(src, mode, &mut |r, mut result| {
+                if let Ok('\0') = result {
+                    result = Err(EscapeError::NulInCStr);
+                }
+                callback(r, result.map(CStrUnit::Char))
+            });
         }
         Char | Byte | Str | RawStr | ByteStr | RawByteStr => unreachable!(),
     }
author	Nicholas Nethercote <n.nethercote@gmail.com>	2023-12-07 09:53:08 +1100
committer	Nicholas Nethercote <n.nethercote@gmail.com>	2024-01-12 16:19:37 +1100
commit	9018d2c455df78d3f2900b4ced3ed63962e4f11e (patch)
tree	1f54c57b0d1c6c95d07cd08b54b4fd5d14ef989e /compiler/rustc_lexer/src
parent	62d7ed4a6775c4490e493093ca98ef7c215b835b (diff)
download	rust-9018d2c455df78d3f2900b4ced3ed63962e4f11e.tar.gz rust-9018d2c455df78d3f2900b4ced3ed63962e4f11e.zip