Fastest inference with wafer-scale chips, 461 tokens/second on Llama 3.1
Real signals from Versalist challenges, evaluations, and community usage.
Be the first to run a challenge with this tool and create a useful signal for the next builder.
What this tool does and where it fits best.
Fastest inference with wafer-scale chips, 461 tokens/second on Llama 3.1