http://domain.com?x=1&y=2
and http://domain.com?y=2&x=1
are different URLs but in fact both of them will load the exact same page. Therefore, we wanted to assign them an identical Int64 ID. Thus, by canonizing the URLs before mapping them to Int64, we can convert the URLs to uniform representation.https://domain.com
and http://domain.com
are both canonized to domain.com
because we wanted to show the same rating widget on HTTP and HTTPS equivalent pages.http://my-spa.com/#state1
and http://my-spa.com/#state2
, the URL canonization function has to be modified to support that.bindec()
, decbin()
, base_convert()
. We have found out that 64 bit integers and PHP are not playing well. None of the mentioned functions consistently supports 64 bit. After digging around on Google, we were lead to a post about 32 bit limitations in PHP which included the suggestion to use GMP, a really cool library for multiple precision integers. Using this library, we managed to create this one line hash function that generates a 64 bit integer out of arbitrary length string.