Search: github.com/vlm | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

1.

Replace OCR with Vision Language Models

github.com/vlm-run

a year ago

292 points

2.

Run structured extraction on documents/images locally with Ollama and Pydantic

github.com/vlm-run

a year ago

170 points

3.

A Node.js SDK for calling Vision Language Models

github.com/vlm-run

a year ago

6 points

4.

Unified Vision-Language Agents – Detect, Segment, OCR, Generate and More

github.com/vlm-run

7 months ago

5 points

5.

Show HN: Visually parse an entire YouTube video frame by frame

github.com/vlm-run

a year ago

5 points

6.

Vlms-zero-to-hero: readings from the fundamentals to the cutting edge of VLMS

github.com/SkalskiP

a year ago

2 points

7.

Experimental Optical Encoder for Qwen3-VLM-2B-Instruct

github.com/Volkopat

8 months ago

1 points

8.

Asn1c: The Lionet ASN.1 Compiler

8 months ago

1 points

9.

Show HN: r1_vlm – Open-Source Framework for Visual Reasoning with GRPO

github.com/groundlight

a year ago

5 points

10.

Mlx-VLM: Fast Local VLMs and Omni Models on Apple Silicon with MLX

github.com/Blaizzy

3 months ago

2 points

11.

Show HN: I achieved over 10% improvement on 3D vision PointCLIP

github.com/genji970

a year ago

2 points

12.

Show HN: 2500 vision benchmarks / evals for Vision Language Models

github.com/Overshoot-ai

2 months ago

zakariaelhjouji

1 points

13.

Show HN: Vlm in 3D PC, 16 shot scanobjectnn top1 acc: 99.91

github.com/genji970

a year ago

1 points

14.

Show HN: VLMs Can Respond Twice as Fast Without Losing Quality

github.com/sergey-automation

2 days ago

2 points

15.

Super fast and accurate image classification on edge devices

github.com/Paulescu

9 months ago

1 points

16.

Show HN: Benchmarking VLMs vs. Traditional OCR

a year ago

146 points

17.

Show HN: LoongForge-A high-performance training framework for LLM, VLM, VLA, Wan

github.com/baidu-baige

a month ago

10 points

18.

Show HN: Cursed Browser – a VLM reads the HTML and hallucinates the page

github.com/scosman

a month ago

7 points

19.

Cursed_browser: Web browser with a VLM as rendering engine

github.com/scosman

a month ago

4 points

20.

SketchVLM: Letting VLMs draw on images while explaining their reasoning

github.com/Brandon-Collins7

2 months ago

3 points

21.

Show HN: Unsiloed Chunker – VLM powered semantic chunking for RAG

github.com/Unsiloed-AI

a year ago

3 points

22.

LoongForge-A high-performance training framework for LLM, VLM, DIT, VLA models

github.com/baidu-baige

a month ago

2 points

23.

Show HN: Vision AI Checkup, an Optometrist for VLMs

visioncheckup.com

a year ago

2 points

24.

The simplest, fastest repository for training/finetuning small-sized VLMs

github.com/huggingface

a year ago

2 points

25.

Advanced Quantization Algorithm for LLMs/VLMs

github.com/intel

a year ago

2 points

26.

Show HN: LLM / VLM language agent implementations

github.com/arthurcolle

a year ago

2 points

27.

Show HN: A VLM-powered image search engine built with Ruby on Rails

github.com/neonwatty

a year ago

2 points

28.

Show HN: A/B test your own VLMs for document parsing (Self-hosted Arena)

github.com/Bae-ChangHyun

4 months ago

1 points

29.

Show HN: Offline AI Photo Search (local VLM and semantic search)

github.com/Pankaj4152

7 months ago

1 points

30.

"Captions With Attitude" in the browser from local VLM using llama.cpp in Go

github.com/hybridgroup

7 months ago

1 points