Hacker News

TurboPrefill: 2.7× faster than llama.cpp Pipeline Parallel on Llama-3-70B

3 points by trykhlieb ago | 0 comments

[-]