Skip to content

BLAKE2b NEON suffers poor performance on ARMv8/Aarch64 with Cortex-A57 #367

@noloader

Description

@noloader

Here's an ugly result for Blake2 testing with Crypto++ and Botan on ARMv8/Aarch64 with Cortex-A57. Cortex-A53 is OK, meaning it does not slow down. A53 runs at about the same speed for both CXX and NEON.

A57, Crypto++ (3 second benchmark):

  • CXX implementation: 5.7 cpb
  • NEON implementation: 12.6 cpb

A57, Botan (speed test, 3000 ms):

  • CXX implementation: 315.197 MiB/sec (945.594 MiB in 3000.008 ms)
  • NEON implementation: 148.028 MiB/sec (444.086 MiB in 3000.014 ms)

The astute reader will realize those numbers should be inverted :(

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions