HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
511.
▲
Nvidia releases 8B model with learned 8x KV cache compression
huggingface.co
4 comments
5 months ago
alecco
9 points
512.
▲
Nvidia releases weights for Llama-3.1-Nemotron-70B-Instruct
huggingface.co
3 comments
2 years ago
rvnx
9 points
513.
▲
Stable Diffusion XL Inpainting model released
huggingface.co
2 comments
3 years ago
GaggiX
9 points
514.
▲
Opentensor and Cerebras announce BTLM-3B-8K, a leading 3B param. language model
huggingface.co
2 comments
3 years ago
cs-fan-101
9 points
515.
▲
Spaces ZeroGPU: Dynamic GPU Allocation for Spaces
huggingface.co
1 comment
2 years ago
9woc
9 points
516.
▲
Perspectives for first principles prompt engineering
huggingface.co
1 comment
2 years ago
ororm
9 points
517.
▲
ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models
huggingface.co
1 comment
2 years ago
UUjiasuqi
9 points
518.
▲
Argilla released Notux 8x7B - DPO fine-tune of Mixtral 8x7B
huggingface.co
1 comment
2 years ago
alvarobartt
9 points
519.
▲
LLM Arena. Mistral-small best open model. Gemini Pro beaten by 2 open models
huggingface.co
1 comment
3 years ago
Palmik
9 points
520.
▲
Meta-llama (Meta Llama 2)
huggingface.co
1 comment
3 years ago
gavi
9 points
521.
▲
Summary of the Tokenizers
huggingface.co
1 comment
3 years ago
O__________O
9 points
522.
▲
Show HN: Sentiment Analysis on Encrypted Data with Homomorphic Encryption
huggingface.co
1 comment
4 years ago
zacchj
9 points
523.
▲
RunwayML fine tuned Stable Diffusion 1.5 model
huggingface.co
1 comment
4 years ago
senthilnayagam
9 points
524.
▲
DeepSeek: Thinking with Visual Primitives [pdf]
huggingface.co
discuss
2 months ago
krackers
9 points
525.
▲
The Smol Training Playbook: The Secrets to Building World-Class LLMs
huggingface.co
discuss
8 months ago
wanderingmind
9 points
526.
▲
Show HN: A Transformer model that preserves logical equivalence
huggingface.co
discuss
a year ago
snowkylin
9 points
527.
▲
Mistral-Large-Instruct-2411 – advanced dense Large Language Model (LLM) 123B
huggingface.co
discuss
2 years ago
maremmano
9 points
528.
▲
MIT Researchers Unveil New Method to Improve LLM Inference Performance
huggingface.co
discuss
2 years ago
tabudata
9 points
529.
▲
Aryn/deformable-detr-DocLayNet – open-source Layout Model
huggingface.co
discuss
2 years ago
skeptrune
9 points
530.
▲
AIMO (AI Math Olympiad) progress prize winning solution
huggingface.co
discuss
2 years ago
kashifr
9 points
531.
▲
Mistral-7B-v0.3 released on HuggingFace
huggingface.co
discuss
2 years ago
cuuupid
9 points
532.
▲
Microsoft Phi-3 3.8B model with 128k Context
huggingface.co
discuss
2 years ago
nitinreddy88
9 points
533.
▲
The Stack v2: a 3B files in 600 programming languages dataset
huggingface.co
discuss
2 years ago
victormustar
9 points
534.
▲
DeepSeek-Prover-V2-671B
huggingface.co
3 comments
a year ago
dvrp
8 points
535.
▲
NousResearch/Nous-Hermes-2-Llama-2-70B
huggingface.co
2 comments
2 years ago
tosh
8 points
536.
▲
Gradio-Lite: Serverless Gradio Running in the Browser
huggingface.co
2 comments
3 years ago
whitphx
8 points
537.
▲
Show HN: Parley: The RPG where you Negotiate with Bandits
huggingface.co
2 comments
3 years ago
upwardbound
8 points
538.
▲
GLM-5.2
huggingface.co
1 comment
8 days ago
gomizhuce
8 points
539.
▲
Reka Edge – 7B fast, efficient VLM (open-weights)
huggingface.co
1 comment
3 months ago
kwajiehao
8 points
540.
▲
Z-Image Turbo Released – 6B Parameter Text to Image Model
huggingface.co
1 comment
7 months ago
rossriley
8 points
More