I asked Claude to help me multiply matrices in Python. Then go faster. And faster. 11 implementations later, it got 15x faster! Cost of auto-optimizing latency for common tasks has basically gone to 0 ...