diff options
| author | Ruud van Asseldonk <dev@veniogames.com> | 2016-11-02 22:49:27 +0100 |
|---|---|---|
| committer | Ruud van Asseldonk <dev@veniogames.com> | 2016-11-30 11:09:29 +0100 |
| commit | cd7fade0a9c1c8762d2fba7c65c1b82e8d369711 (patch) | |
| tree | 946873bd4b027e019a115f2f1359ecd64ac6502e /src/libstd | |
| parent | 8e373b47872872a2ce61c5b02f4dd96d90d046ee (diff) | |
| download | rust-cd7fade0a9c1c8762d2fba7c65c1b82e8d369711.tar.gz rust-cd7fade0a9c1c8762d2fba7c65c1b82e8d369711.zip | |
Add small-copy optimization for io::Cursor
During benchmarking, I found that one of my programs spent between 5 and 10 percent of the time doing memmoves. Ultimately I tracked these down to single-byte slices being copied with a memcopy in io::Cursor::read(). Doing a manual copy if only one byte is requested can speed things up significantly. For my program, this reduced the running time by 20%. Why special-case only a single byte, and not a "small" slice in general? I tried doing this for slices of at most 64 bytes and of at most 8 bytes. In both cases my test program was significantly slower.
Diffstat (limited to 'src/libstd')
| -rw-r--r-- | src/libstd/io/cursor.rs | 18 |
1 files changed, 15 insertions, 3 deletions
diff --git a/src/libstd/io/cursor.rs b/src/libstd/io/cursor.rs index 1b5023380a7..9b50168a954 100644 --- a/src/libstd/io/cursor.rs +++ b/src/libstd/io/cursor.rs @@ -219,9 +219,21 @@ impl<T> io::Seek for Cursor<T> where T: AsRef<[u8]> { #[stable(feature = "rust1", since = "1.0.0")] impl<T> Read for Cursor<T> where T: AsRef<[u8]> { fn read(&mut self, buf: &mut [u8]) -> io::Result<usize> { - let n = Read::read(&mut self.fill_buf()?, buf)?; - self.pos += n as u64; - Ok(n) + // First check if the amount of bytes we want to read is small: the read + // in the else branch will end up calling `<&[u8] as Read>::read()`, + // which will copy the buffer using a memcopy. If we only want to read a + // single byte, then the overhead of the function call is significant. + let num_read = { + let mut inner_buf = self.fill_buf()?; + if buf.len() == 1 && inner_buf.len() > 0 { + buf[0] = inner_buf[0]; + 1 + } else { + Read::read(&mut inner_buf, buf)? + } + }; + self.pos += num_read as u64; + Ok(num_read) } } |
