Show HN: FlashQwen – A from-scratch CUDA inference engine for Qwen3

(github.com)

5 points | by langtang1996 a day ago ago

No comments yet.