HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
1.
▲
Show HN: Made a batching LLM API for a project. Mistral 200 tk/s on RTX 3090
github.com/epolewski
discuss
2 years ago
muttled
3 points