f82() - A Hash Function with strong theoretical Properties
April 2, 2025
f82() #
f82()
is hash function which hashes (or maps) a unt64_t
8 byte unsigned integer key to a uint16_t
2
byte hash value. It belongs to the class of tabulation hashing functions and is perfect for
implementing lock-less hash tables and other data structures that require an uniform and perfect
distribution.
The Implementation #
The implementation is straightforward, most importantly: The random material needs to be from a high quality random number generator (in our case we retrieved it from the Quantis QRNG device).
|
|
4-Independency? #
Despite its simplicity, tabulation hashing has strong theoretical properties that distinguish it from some other hash functions. In particular, it is 3-independent: every 3-tuple of keys is equally i likely to be mapped to any 3-tuple of hash values. However, it is not 4-independent.
What does this actually mean?
As Pătraşcu & Thorup (2012) observe, tabulation hashing is 3-independent but not 4-independent. For any single key x, T[x0,0] is equally likely to take on any hash value, and the exclusive or of T[x0,0] with the remaining table values does not change this property. For any two keys x and y, x is equally likely to be mapped to any hash value as before, and there is at least one position i where xi ≠ yi; the table value T[yi,i] is used in the calculation of h(y) but not in the calculation of h(x), so even after the value of h(x) has been determined, h(y) is equally likely to be any valid hash value. Similarly, for any three keys x, y, and z, at least one of the three keys has a position i where its value zi differs from the other two, so that even after the values of h(x) and h(y) are determined, h(z) is equally likely to be any valid hash value.
This RPCL example dialog shows this dependency between 0xaaaa
, 0xaabb
, 0xbbaa
and 0xbbbb
(XORing
the 4 results is always 0 - independent of the random material in place):
$ rpclsh
ok ".f82" @ .
returns f82() result in hex ( u64x - result )
ok ".f82-4" @ .
returns XOR of 4 f82() results in hex ( h0 h1 h2 h3 - result )
ok aaaa .f82 .
6BE6
ok aabb .f82 .
E7F
ok bbaa .f82 .
644
ok bbbb .f82 .
63DD
ok aaaa aabb bbaa bbbb .f82-4 .
0
ok
For the mentioned and intended purposes this lack of 4-independency is neglectable.