Re: Improve CRC32C performance on SSE4.2 - Mailing list pgsql-hackers

From Andy Fan
Subject Re: Improve CRC32C performance on SSE4.2
Date
Msg-id 87bjqncl24.fsf@163.com
Whole thread Raw
List pgsql-hackers
"Devulapalli, Raghuveer" <raghuveer.devulapalli@intel.com> writes:

> Great catch! From the intrinsic manual: 
>
> Cast vector of type __m128i to type __m512i; the upper 384 bits of the
> result are undefined.

Just be curious, what kind of optimization (like what -O2 does) could
mask this issue?

> Replacing that with _mm512_zextsi128_si512 fixes the problem.

congratulations!

-- 
Best Regards
Andy Fan




pgsql-hackers by date:

Previous
From: Tom Lane
Date:
Subject: Re: pg_dump --with-* options
Next
From: Masahiko Sawada
Date:
Subject: Re: Fix slot synchronization with two_phase decoding enabled