HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
301.
▲
Show HN: Validated Table Extractor–Verify PDF Tables Using Docling+Vision LLMs
github.com/2dogsandanerd
discuss
7 months ago
2dogsanerd
3 points
302.
▲
WasmVision – computer vision using WebAssembly now with MCP and GPU support
github.com/wasmvision
discuss
a year ago
deadprogram
3 points
303.
▲
MiniCPM-O 2.6, GPT-4o Level MLLM for Vision, Speech and Multimodal on Your Phone
github.com/OpenBMB
discuss
a year ago
rvnx
3 points
304.
▲
Show HN: X.infer-Framework agnostic computer vision inference
github.com/dnth
discuss
2 years ago
dnth
3 points
305.
▲
A Minimalist Implementation of Vision Transformers in Tinygrad
github.com/EthanBnntt
discuss
2 years ago
ethanbnntt
3 points
306.
▲
Show HN: Parse Vision, open source tool to visualise OCR output
github.com/orasik
discuss
2 years ago
Oras
3 points
307.
▲
Show HN: Flaim – pre-trained vision backbones for Flax
github.com/BobMcDear
discuss
2 years ago
bornaahz
3 points
308.
▲
Show HN: Comparing various contrastive losses on text and vision embeddings
github.com/viig99
discuss
2 years ago
kartoolOz
3 points
309.
▲
Open-Source Evaluation and Testing Framework for Computer Vision Models
discuss
2 years ago
iamheinrich
3 points
310.
▲
Close-Circuit Telegram Vision Location Tracking with Telegram API Integration
github.com/IvanGlinkin
discuss
2 years ago
andrewstuart
3 points
311.
▲
PinchBar: The TouchBar in the Vision Pro
github.com/zac
discuss
2 years ago
goranmoomin
3 points
312.
▲
Open Source bicyclist warning system with computer vision and mmWave Radar
github.com/burningion
discuss
2 years ago
burningion
3 points
313.
▲
Ensemble: Separate Screens for Different Mac Apps in the Vision Pro
github.com/saagarjha
discuss
2 years ago
sashank_1509
3 points
314.
▲
Keyboard and Trackpad for Apple Vision Pro
gist.github.com
discuss
2 years ago
qzervaas
3 points
315.
▲
Show HN: First ever reliable browser automation by GPT-4-Vision
github.com/vignshwarar
discuss
3 years ago
vignesh_warar
3 points
316.
▲
We tried injecting hallucinogenics into vision models
github.com/encord-team
discuss
3 years ago
ulrikhansen54
3 points
317.
▲
LLaVA: Visual Instruction Tuning: Large Language-and-Vision Assistant
github.com/haotian-liu
discuss
3 years ago
tosh
3 points
318.
▲
Bpycv: Computer Vision and Deep Learning Utils for Blender
github.com/DIYer22
discuss
3 years ago
f_devd
3 points
319.
▲
Turbo: An experimental text editor based on Scintilla and Turbo Vision
github.com/magiblot
discuss
3 years ago
myth_drannon
3 points
320.
▲
Lavis – A Library for Language-Vision Intelligence
github.com/salesforce
discuss
4 years ago
madmax108
3 points
321.
▲
Show HN: Blender Python 3D simulator for computer vision and inverse render AI
github.com/3cology
discuss
4 years ago
legel
3 points
322.
▲
How Do Vision Transformers Work?
github.com/xxxnell
discuss
4 years ago
lnyan
3 points
323.
▲
Mpv adds support for Dolby Vision playback
github.com/mpv-player
discuss
4 years ago
WithinReason
3 points
324.
▲
Show HN: Computer vision Blackjack basic strategy web app (with Tensorflow.js)
github.com/roboflow-ai
discuss
5 years ago
yeldarb
3 points
325.
▲
Computer Vision Project: Fingerprint Minutiae Feature Extraction
github.com/Utkarsh-Deshmukh
discuss
5 years ago
d_utkarsh
3 points
326.
▲
Open Vision API: open-source computer vision API based on open source models
github.com
discuss
5 years ago
pythops
3 points
327.
▲
Caer – A GPU-accelerated Computer Vision library (faster than Torchvision)
github.com/jasmcaus
discuss
5 years ago
jasmcaus
3 points
328.
▲
3D Computer Vision in Julia
github.com/nirmal-suthar
discuss
6 years ago
robomaster12
3 points
329.
▲
Best Practices, code samples, and documentation for Computer Vision
github.com/microsoft
discuss
6 years ago
rubenbe
3 points
330.
▲
Pythia: A modular framework for vision and language
github.com/facebookresearch
discuss
6 years ago
theBashShell
3 points
More