The UniProt Reference Clusters are three separate datasets that compress sequence space at different resolutions