Auto merge of #16350 - roife:neon-support-for-line-index, r=Veykril - rust

diff options

author	bors <bors@rust-lang.org>	2024-01-16 09:12:10 +0000
committer	bors <bors@rust-lang.org>	2024-01-16 09:12:10 +0000
commit	18abb126a384dd295d08e1f365a6996672f65674 (patch)
tree	b49f99c2d9996c633dba0127997f420d7515fa83 /tests/codegen/src-hash-algorithm
parent	5df53c96127095d75c0724f545ccd280dc0fafa9 (diff)
parent	df538288e09a45483a56a9af99fb7fd5333d53d8 (diff)
download	rust-18abb126a384dd295d08e1f365a6996672f65674.tar.gz rust-18abb126a384dd295d08e1f365a6996672f65674.zip

Auto merge of #16350 - roife:neon-support-for-line-index, r=Veykril

internal: Speedup line index calculation via NEON for aarch64

This commit provides SIMD acceleration (via NEON) for `line-index` library on aarch64 architecture, which improves performance for Apple Silicon users (and potentially for future aarch64-based chips).

The algorithm used here follows the same process as the original implementation using SSE2. Most of the vector instructions in SSE2 have corresponding parts in neon. The only issue is that there is no corresponding instruction for `_mm_movemask_epi8` in neon. To address this problem, I referred to the article at https://community.arm.com/arm-community-blogs/b/infrastructure-solutions-blog/posts/porting-x86-vector-bitmask-optimizations-to-arm-neon.

Diffstat (limited to 'tests/codegen/src-hash-algorithm')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: