HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
181.
▲
Waveglow Inference in CUDA C++
github.com/Saurabh-29
10 comments
7 years ago
Saurabh_29
72 points
182.
▲
HuggingFace text-generation-inference is reverting to Apache 2.0 License
github.com/huggingface
1 comment
2 years ago
Thicken2320
19 points
183.
▲
HuggingFace Text Generation Library License Changed from Apache 2 to Hfoil
github.com/huggingface
3 comments
3 years ago
bratao
6 points
184.
▲
New open-source model with 8k context runs on CPU, outperforms GPT-3
github.com/abacaj
1 comment
3 years ago
sheepscreek
5 points
185.
▲
Accelerating LLM Serving with Speculative Inference and Token Tree Verification
github.com/flexflow
1 comment
3 years ago
zhihaojia
3 points
186.
▲
Hugging Face reverts the license back to Apache 2.0
github.com/huggingface
discuss
2 years ago
vmatsiiako
3 points
187.
▲
Fast inference for text models using Rust
github.com/huggingface
discuss
3 years ago
l-m-z
3 points
188.
▲
MPT 30B inference code using CPU
github.com/abacaj
discuss
3 years ago
djha-skin
3 points
189.
▲
Text to Speech CUDA Programming
github.com/Saurabh-29
discuss
7 years ago
Saurabh_29
3 points
190.
▲
Bayesian inference and forecast of Covid-19 in Germany by a Max-Planck-Institute
github.com/Priesemann-Group
3 comments
6 years ago
freemint
2 points
191.
▲
Diffbot GraphRAG LLM
github.com/diffbot
1 comment
a year ago
miket
2 points
192.
▲
GPT4ALL Python3 Local LLM Conversation Recorder
github.com/13alvone
1 comment
3 years ago
13alvone
2 points
193.
▲
Show HN: Bert NLP inference in browser using WebAssembly-SIMD
github.com/jobergum
discuss
4 years ago
jkb79
2 points
194.
▲
Private Decentralized Inference on Consumer Hardware [pdf]
github.com/Layr-Labs
1 comment
2 months ago
doener
1 points
195.
▲
Open Source Stable Diffusion with LCM-LoRA
github.com/joshfischer1108
1 comment
3 years ago
joshfischer1108
1 points
196.
▲
Private decentralized inference on consumer hardware [pdf]
github.com/Layr-Labs
discuss
2 months ago
andsoitis
1 points
197.
▲
VGGT PyTorch Inference
github.com/ibaiGorordo
discuss
a year ago
Tycho87
1 points
198.
▲
Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model
github.com/cactus-compute
211 comments
a month ago
HenryNdubuaku
776 points
199.
▲
Launch HN: Hyprnote (YC S25) – An open-source AI meeting notetaker
180 comments
a year ago
yujonglee
270 points
200.
▲
Show HN: Fastify's slow startup is an AJV problem – here's a drop-in fix
discuss
3 months ago
greatvenerable
2 points
201.
▲
Finished a project mixing GNNs, RL, and operations research
github.com/MehdiZouitine
1 comment
a year ago
Md_Zouzou
1 points
202.
▲
Show HN: I built an Image Embedding API inspired by text-embedding-inference
github.com/bernardo-sb
discuss
a year ago
bernardo-sb
1 points
203.
▲
Show HN: ImageEmbeddingInference – like text-embeddings-inference but for images
github.com/bernardo-sb
discuss
a year ago
bernardo-sb
1 points
204.
▲
Show HN: Sightline – Shodan-style search for real-world infra using OSM Data
github.com/ni5arga
1 comment
5 months ago
ni5arga
26 points
205.
▲
Ask HN: Are you saving inference costs on GPUs at your company
1 comment
a year ago
idomi
5 points
206.
▲
Show HN: Revibing nanochat's inference model in C++ with ggml
github.com/k-ye
discuss
5 months ago
makechan
5 points
207.
▲
Show HN: Letting an LLM write robot programs
boesch.dev
discuss
2 months ago
encrux
3 points
208.
▲
Show HN: Portaltext, grand strategy-style recursive tooltips for the web
portaltext.com
discuss
6 days ago
alaskahoffman
2 points
209.
▲
Show HN: MLX-Ruby – Ruby Bindings for Apple's MLX ML Framework
github.com/skryl
1 comment
4 months ago
skryl
1 points
210.
▲
Show HN: ReFlow Studio – An offline tool to dub, translate, and censor videos
github.com/ananta-sj
discuss
5 months ago
linearAmend
1 points
More