McIntosh · 2w From google research earlier today on X. … Introducing TurboQuant: Our new compression algorithm that reduces LLM key-value cache memory by at least 6x and delivers up to 8x speedup, all with zero ... ethfi @ethfi 1774423264 No secrets here