>8 token/s DeepSeek R1 671B Q4_K_M with 1~2 Arc A770 on Xeon — https://github.com/intel/ipex-llm/blob/main/docs/mddocs/Quickstart/llamacpp_portable_zip_gpu_quickstart.md#HackerNews #DeepSeek #ArcA770 #Xeon #Tokenization #LLM #GitHub
Mastodon is the best way to keep up with what's happening.
Follow anyone across the fediverse and see it all in chronological order. No algorithms, ads, or clickbait in sight.