Description
I've just run the binary_trees
benchmark on an ARMv8
, Cortex-A53 processor, having converted an Android TV box to Linux.
I'd found previously, on a much weaker (but more power efficient) armv7
Cortex A5, the results were equal. On the new machine (using the latest official aarch64
rustc nightly) ./binary_trees 23
produces the following results:
sysalloc
1m28s 5m10s 0m10s
jemalloc
1m35s 5m10s 0m53s
which is palpably worse actually, even though Cortex-A53 is a much stronger core.
I'm beginning to think jemalloc
only makes sense on Intel processors with heaps or L1/L2 cache.
More benchmark ideas welcome, though.
added retroactively:
To reproduce, unpack the attachment and run:
cargo build --release && time target/release/binary_trees 23
inside the binary_trees directory. Uncomment the first 2 lines in main.rs to produce a sysalloc version.