Open
Description
Reverting 88e7b8b commit brings performance back.
It is reproducible on AMD EPYC 9684X: time changes from 36.8 seconds to 42.7 seconds.
The following lines of code slowed down by more than twice:
950 deviatoric_stress_tensor(:,:) = stress_tensor(:,:)
951 deviatoric_stress_tensor(1,1) = stress_tensor(1,1) - hydrostatic_stress
952 deviatoric_stress_tensor(2,2) = stress_tensor(2,2) - hydrostatic_stress
953 deviatoric_stress_tensor(3,3) = stress_tensor(3,3) - hydrostatic_stress
954 !
955 damaged_dev_stress_tensor(:,:) = deviatoric_stress_tensor(:,:)/(1.0_LONGreal - &
956 crack_parameter * damage)
957 !
958 ! compute the yield function
959 !
960 equivalent_stress = sqrt(1.5_LONGreal* sum((damaged_dev_stress_tensor(:,:) &
961 - back_stress_tensor(:,:))**2))
default.log.gz - before/after SLP with the current Flang compiler
revert.log.gz - before/after SLP with the reverted commit
Reproducer command: opt -mcpu=znver4 -print-after=slp-vectorizer --passes=slp-vectorizer