M5 Max is the biggest AI performance jump we have seen on Apple Silicon. Our latest release pushes real-world performance further with Metal Quantized Attention and fused Int8 matrix multiplication.
Can you add M4 Max results ? Thanks
Can you add M4 Max results ? Thanks