Search: github.com/Lision | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

451.

RVAA: Recursive Vision-Action Agent for Long Video Understanding

github.com/mohammed840

5 months ago

1 points

452.

LLM Vision: Visual intelligence for your smart home

github.com/valentinfrlch

6 months ago

1 points

453.

Show HN: Agenteract – Drive mobile apps with LLMs using UI trees (no vision)

6 months ago

1 points

454.

Training YOLO vision models on Kaggle datasets

github.com/mfranzon

8 months ago

1 points

455.

VisionOS Godot Engine support merged

a year ago

1 points

456.

Show HN: Vision AI Label Studio – Open-Source Image Labeling Tool

a year ago

1 points

457.

Show HN: OSS AI Agent for Computer Vision

github.com/picselliahq

a year ago

1 points

458.

[Google Research] Handwriting Conversion with Vision Language Model

github.com/google-research

a year ago

1 points

459.

Show HN: Vision, PDF reading and Python

github.com/ilevd

a year ago

1 points

460.

Computer vision models inference directly on mobile

github.com/software-mansion

a year ago

1 points

461.

DeepSeek-VL2: Moe Vision-Language Models for Advanced Multimodal Understanding [pdf]

github.com/deepseek-ai

2 years ago

1 points

462.

OpenDAL Going to Set Vision as "One Layer, All Storage"

github.com/apache

2 years ago

1 points

463.

Show HN: Capd – idea to visually analyze active PowerShell with OpenAI Vision

github.com/Lywald

2 years ago

1 points

464.

Roboflow Notebooks: 60+ computer vision modeling notebooks

github.com/roboflow

2 years ago

1 points

465.

Eagle: Vision-Centric High-Resolution Multimodal LLMs with Mixture of Encoders

github.com/NVlabs

2 years ago

1 points

466.

Unibench: Vision-Language Model Evaluation

github.com/facebookresearch

2 years ago

1 points

467.

Try to dump traditional mouse. Click by [Vim] + [screen vision-recognition] way

github.com/garywill

2 years ago

1 points

468.

Show HN: Gesture Composer for VisionOS [video]

2 years ago

1 points

469.

Moondream: Tiny Vision Language Model

github.com/vikhyat

2 years ago

1 points

470.

Show HN: Geniusrise – open-source inference endpoints for text, vision, audio

2 years ago

1 points

471.

Show HN: Building WebApp with Vision Pro Like UI with CSS

github.com/kelvinkoko

2 years ago

1 points

472.

3D Printing Failure Detection with GPT4 Vision

github.com/myrakrusemark

2 years ago

1 points

473.

SeeAct GPT-4V(ision) Is a Generalist Web Agent, If Grounded

github.com/OSU-NLP-Group

2 years ago

1 points

474.

AI Employe: Actions Augmented Browser Automation Using GPT-4 Vision

github.com/vignshwarar

2 years ago

1 points

475.

Show HN: Labelformat now supports all major vision labeling formats

github.com/lightly-ai

3 years ago

1 points

476.

Sound and Vision – Video Streaming to the ESP32

github.com/atomic14

3 years ago

1 points

477.

Large Language-and-Vision Assistant for BioMedicine

github.com/microsoft

3 years ago

yagizdegirmenci

1 points

478.

MViTv2: Improved Multiscale Vision Transformers for Classification and Detection

github.com/facebookresearch

3 years ago

1 points

479.

VoxelGPT: Open-source AI assistant for curating computer vision datasets

github.com/voxel51

3 years ago

1 points

480.

A general representation modal across vision, audio, language modalities

github.com/OFA-Sys

3 years ago

1 points