HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
361.
▲
Show HN: Convert any design into HTML code using GPT4 vision
github.com/mostafasadeghi97
1 comment
3 years ago
_mostafa97
2 points
362.
▲
INT-FP-QSim: Simulating LLMs and Vision Transformers
github.com/lightmatter-ai
1 comment
3 years ago
IllustriousSir
2 points
363.
▲
Train vision transformers using Google images
github.com/nateraw
1 comment
5 years ago
italosayan
2 points
364.
▲
Show HN: Slideo – Synchronize Slides with Video Using Computer Vision (OpenCV)
github.com/hediet
1 comment
5 years ago
Gehinnn
2 points
365.
▲
pip3 install videoflow - New library for computer vision on videos
1 comment
7 years ago
jadielam
2 points
366.
▲
JavaScript Computer Vision library.
inspirit.github.com
discuss
13 years ago
divy
2 points
367.
▲
Show HN: SoMatic – Vision-based OS automation framework for AI agents
github.com/Smyan1909
discuss
a month ago
smyansondur
2 points
368.
▲
Show HN: Neuroscope – Real-time “x-ray vision” into LLMs’ minds
github.com/cjroth
discuss
3 months ago
rothific
2 points
369.
▲
Alibaba releases open-source vision model for native layered image editing
github.com/QwenLM
discuss
6 months ago
bakigul
2 points
370.
▲
Yzma – local Vision Language Models/LLMs in Go using llama.cpp without CGo
github.com/hybridgroup
discuss
9 months ago
deadprogram
2 points
371.
▲
Show HN: Magnitude MCP – vision-first browser interaction for Claude Code
github.com/sagekit
discuss
9 months ago
anerli
2 points
372.
▲
Show HN: Demo of AI-enabled voice/vision features on open source hardware [video]
youtube.com
discuss
9 months ago
mmajzoobi
2 points
373.
▲
Show HN: Plug-and-play Python utils for any computer-vision pipeline
github.com/roboflow
discuss
a year ago
birdinleconey
2 points
374.
▲
Show HN: I achieved over 10% improvement on 3D vision PointCLIP
github.com/genji970
discuss
a year ago
genji970
2 points
375.
▲
Smolvlm – Realtime Vision Language Model Demo
github.com/ngxson
discuss
a year ago
informal007
2 points
376.
▲
Search images like text using Vision Language Models
github.com/StarlightSearch
discuss
a year ago
r0rshrk
2 points
377.
▲
OmniTool – Control a Windows 11 VM with OmniParser plus vision model of choice
github.com/microsoft
discuss
a year ago
danboarder
2 points
378.
▲
Sparrow: Open-source data processing with ML, LLM and Vision LLM
github.com/katanaml
discuss
a year ago
madbiz
2 points
379.
▲
Visual Product Search: Combining React Native, Cloud Vision, Algolia, and Remix
discuss
2 years ago
iliashad
2 points
380.
▲
ShowUI: A lightweight vision-language-action model for GUI agents
github.com/showlab
discuss
2 years ago
punkpeye
2 points
381.
▲
BiomedGPT: A Generalist Vision-Language Foundation Model for Biomedical Tasks
github.com/taokz
discuss
2 years ago
giuliomagnifico
2 points
382.
▲
Mini-Omni2: Towards Open-Source GPT-4o with Vision, Speech, Duplex Capabilities
github.com/gpt-omni
discuss
2 years ago
taikon
2 points
383.
▲
Ollama with Experimental Vision Support
github.com/ollama
discuss
2 years ago
rspoerri
2 points
384.
▲
Show HN: Created a notebook to compare the top LMSYS vision models easily
github.com/Portkey-AI
discuss
2 years ago
roh26it
2 points
385.
▲
Recognize faces in photos using local models with Apple Vision
github.com/Nexuist
discuss
2 years ago
nexuist
2 points
386.
▲
Show HN: I made a simple unified LLM client with tool calling and vision support
github.com/piEsposito
discuss
2 years ago
someguy12345678
2 points
387.
▲
Implementation of Google's ScreenAI: Vision-Lang Model for UI and Understanding
github.com/kyegomez
discuss
2 years ago
spxneo
2 points
388.
▲
Apple Vision Pro and ROG Ally: Portable console gaming setup guide
gist.github.com
discuss
2 years ago
osy
2 points
389.
▲
Godot Support for VisionOS
github.com/kevinw
discuss
2 years ago
dagmx
2 points
390.
▲
TrackTales: Zero-shot narrator for mpd using GPT-4-vision
github.com/mlang
discuss
2 years ago
lynx23
2 points
More