Hacker News
TurboPrefill: 2.7× faster than llama.cpp Pipeline Parallel on Llama-3-70B
3 points by trykhlieb ago
|
0 comments
[-]