Search: github.com/Lision | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

361.

Show HN: Convert any design into HTML code using GPT4 vision

github.com/mostafasadeghi97

3 years ago

2 points

362.

INT-FP-QSim: Simulating LLMs and Vision Transformers

github.com/lightmatter-ai

3 years ago

2 points

363.

Train vision transformers using Google images

github.com/nateraw

5 years ago

2 points

364.

Show HN: Slideo – Synchronize Slides with Video Using Computer Vision (OpenCV)

github.com/hediet

5 years ago

2 points

365.

pip3 install videoflow - New library for computer vision on videos

7 years ago

2 points

366.

JavaScript Computer Vision library.

inspirit.github.com

13 years ago

2 points

367.

Show HN: SoMatic – Vision-based OS automation framework for AI agents

github.com/Smyan1909

a month ago

2 points

368.

Show HN: Neuroscope – Real-time “x-ray vision” into LLMs’ minds

github.com/cjroth

3 months ago

2 points

369.

Alibaba releases open-source vision model for native layered image editing

github.com/QwenLM

6 months ago

2 points

370.

Yzma – local Vision Language Models/LLMs in Go using llama.cpp without CGo

github.com/hybridgroup

9 months ago

2 points

371.

Show HN: Magnitude MCP – vision-first browser interaction for Claude Code

github.com/sagekit

9 months ago

2 points

372.

Show HN: Demo of AI-enabled voice/vision features on open source hardware [video]

9 months ago

2 points

373.

Show HN: Plug-and-play Python utils for any computer-vision pipeline

github.com/roboflow

a year ago

2 points

374.

Show HN: I achieved over 10% improvement on 3D vision PointCLIP

github.com/genji970

a year ago

2 points

375.

Smolvlm – Realtime Vision Language Model Demo

github.com/ngxson

a year ago

2 points

376.

Search images like text using Vision Language Models

github.com/StarlightSearch

a year ago

2 points

377.

OmniTool – Control a Windows 11 VM with OmniParser plus vision model of choice

github.com/microsoft

a year ago

2 points

378.

Sparrow: Open-source data processing with ML, LLM and Vision LLM

github.com/katanaml

a year ago

2 points

379.

Visual Product Search: Combining React Native, Cloud Vision, Algolia, and Remix

2 years ago

2 points

380.

ShowUI: A lightweight vision-language-action model for GUI agents

github.com/showlab

2 years ago

2 points

381.

BiomedGPT: A Generalist Vision-Language Foundation Model for Biomedical Tasks

github.com/taokz

2 years ago

giuliomagnifico

2 points

382.

Mini-Omni2: Towards Open-Source GPT-4o with Vision, Speech, Duplex Capabilities

github.com/gpt-omni

2 years ago

2 points

383.

Ollama with Experimental Vision Support

github.com/ollama

2 years ago

2 points

384.

Show HN: Created a notebook to compare the top LMSYS vision models easily

github.com/Portkey-AI

2 years ago

2 points

385.

Recognize faces in photos using local models with Apple Vision

github.com/Nexuist

2 years ago

2 points

386.

Show HN: I made a simple unified LLM client with tool calling and vision support

github.com/piEsposito

2 years ago

someguy12345678

2 points

387.

Implementation of Google's ScreenAI: Vision-Lang Model for UI and Understanding

github.com/kyegomez

2 years ago

2 points

388.

Apple Vision Pro and ROG Ally: Portable console gaming setup guide

gist.github.com

2 years ago

2 points

389.

Godot Support for VisionOS

github.com/kevinw

2 years ago

2 points

390.

TrackTales: Zero-shot narrator for mpd using GPT-4-vision

github.com/mlang

2 years ago

2 points