HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
1.
▲
High performance client for Baseten.co
github.com/basetenlabs
1 comment
a year ago
mich5632
7 points
2.
▲
Show HN: Baseten – Build ML-powered applications
baseten.co
11 comments
4 years ago
philipkiely
112 points
3.
▲
Show HN: ChatLLaMA – A ChatGPT style chatbot for Facebook's LLaMA
chatllama.baseten.co
215 comments
3 years ago
aaronrelph
402 points
4.
▲
Running GPT-OSS-120B at 500 tokens per second on Nvidia GPUs
baseten.co
175 comments
10 months ago
philipkiely
247 points
5.
▲
A guide to open-source LLM inference and performance
baseten.co
14 comments
3 years ago
varunshenoy
113 points
6.
▲
DALL-E Mini – Generate images from a text prompt
app.baseten.co
22 comments
4 years ago
tuhins
52 points
7.
▲
How we got Stable Diffusion XL inference to under 2 seconds
baseten.co
5 comments
3 years ago
varunshenoy
51 points
8.
▲
Show HN: Free Stable Diffusion 2.0 hosted interface
app.baseten.co
2 comments
4 years ago
philipkiely
25 points
9.
▲
BaseTen: The fastest way to build ML-powered applications
baseten.co
4 comments
5 years ago
sahillavingia
20 points
10.
▲
Show HN: Fine-tune generative models in 1 line of code
blueprint.baseten.co
discuss
3 years ago
aqader
16 points
11.
▲
Show HN: Baseten Chains – Framework and SDK for Multi-Model AI Products
baseten.co
5 comments
2 years ago
mikejulietbravo
9 points
12.
▲
The Math Behind TurboQuant
baseten.co
3 comments
3 months ago
philipkiely
8 points
13.
▲
Hosted Stable Diffusion Demo
app.baseten.co
discuss
4 years ago
philipkiely
7 points
14.
▲
Serving four million Riffusion requests in two days
baseten.co
discuss
4 years ago
philipkiely
5 points
15.
▲
Try it yourself: Speech to text with Whisper
app.baseten.co
discuss
4 years ago
philipkiely
5 points
16.
▲
How BaseTen is using “docs as code”
blog.baseten.co
discuss
4 years ago
philipkiely
5 points
17.
▲
SDXL inference in under 2 seconds
baseten.co
1 comment
3 years ago
tuhins
3 points
18.
▲
How We Built the Fastest Kimi K2.5 on Artificial Analysis
baseten.co
discuss
4 months ago
philipkiely
3 points
19.
▲
Deploying Stable Diffusion in Production Using Truss
baseten.co
discuss
4 years ago
philipkiely
3 points
20.
▲
Open Source Inference Engine Baseten Raises $40M from IVP, Spark and Greylock
baseten.co
1 comment
2 years ago
mikejulietbravo
2 points
21.
▲
Faster Mixtral inference with TensorRT-LLM and quantization
baseten.co
1 comment
2 years ago
tikkun
2 points
22.
▲
Inference Engineering
baseten.com
discuss
4 months ago
simonpure
2 points
23.
▲
Show HN: Inference Engineering
baseten.com
discuss
4 months ago
philipkiely
2 points
24.
▲
How to double tokens per second for Llama 3 with Medusa
baseten.co
discuss
2 years ago
philipkiely
2 points
25.
▲
Show HN: Automatically Build Nvidia TRT-LLM Engines
baseten.co
discuss
2 years ago
mikejulietbravo
2 points
26.
▲
FP8: Efficient model inference with 8-bit floating point numbers
baseten.co
discuss
2 years ago
philipkiely
2 points
27.
▲
Code generation interactive demo (Salesforce Codegen mono 2B)
app.baseten.co
discuss
4 years ago
philipkiely
2 points
28.
▲
Working at an early-stage company as an early-stage engineer
blog.baseten.co
discuss
5 years ago
tuhins
2 points
29.
▲
Inferless Joins Baseten
baseten.co
discuss
4 months ago
agcat
1 points
30.
▲
Continual learning and the post monolith AI era
baseten.co
discuss
4 months ago
jxmorris12
1 points
More