HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
271.
▲
Show HN: A pdf parser based on multimodal LLM
github.com/lazyFrogLOL
1 comment
2 years ago
lazyfroghacker
1 points
272.
▲
CrayEye now supports local/FOSS models – multimodal vision multitool
github.com/alexdredmon
1 comment
2 years ago
anais9
1 points
273.
▲
Infra for building multimodal embeddings, built in Rust for speed and robustness
github.com/StarlightSearch
1 comment
2 years ago
Sonam_AI
1 points
274.
▲
Guiding Instruction-Based Image Editing via Multimodal Large Language Models
github.com/tsujuifu
1 comment
2 years ago
andsoitis
1 points
275.
▲
A framework to enable multimodal models to operate a computer
github.com/OthersideAI
1 comment
3 years ago
pyinstallwoes
1 points
276.
▲
Go MultiModule Workspaces:The Easy Way to Build and Run Code in Multiple Modules
github.com/mobiledatabooks
1 comment
4 years ago
thstart
1 points
277.
▲
Show HN: imgp – multicore batch image resizer and rotator. Go crunch 'em
github.com/jarun
1 comment
9 years ago
apjana
1 points
278.
▲
Bucardo multimaster and master/slave Postgres replication
github.com/bucardo
discuss
11 years ago
gnocchi
1 points
279.
▲
Publication under FOSS licence of a multimodal journey planner
github.com/CanalTP
discuss
12 years ago
tristramg
1 points
280.
▲
Multimedia story telling for the web
github.com/codevise
discuss
12 years ago
trutz
1 points
281.
▲
GoodQ4All – A Local-First Multimodal Memory and Intelligence System
github.com/GoodQ02
discuss
15 days ago
joesdomingo
1 points
282.
▲
Show HN: Omni – airgapped macOS multimodal search over local files
github.com/hanxiao
discuss
16 days ago
artex_xh
1 points
283.
▲
Show HN: Gemini Omni – A curated list of native multimodal guides and showcases
github.com/cnemri
discuss
a month ago
cnemri
1 points
284.
▲
Show HN: Reverse lookup XKCD comics using Gemini multimodal embeddings
github.com/hemanth
discuss
3 months ago
init0
1 points
285.
▲
Show HN: Pixrep – Turn code repositories into PDFs for multimodal LLMs
github.com/TingjiaInFuture
discuss
4 months ago
TingjiaInFuture
1 points
286.
▲
MiRAGE: Open-source framework for multimodal RAG evaluation
discuss
4 months ago
mmhetric
1 points
287.
▲
Puma 3D Printed Multimodality Microscope
github.com/TadPath
discuss
4 months ago
o4c
1 points
288.
▲
Show HN: X-AnyLabeling – An open-source multimodal annotation ecosystem for CV
github.com/CVHub520
discuss
6 months ago
CVHub520
1 points
289.
▲
Show HN: Unisondb A open source streaming multimodal database for Edge Computing
github.com/ankur-anand
discuss
7 months ago
ankuranand
1 points
290.
▲
Multicloud app that includes DePIN (Demo)
github.com/dkloudio
discuss
a year ago
hkdb
1 points
291.
▲
Neuralink Open Sources Data Catalog for Multimodal Data
github.com/neuralinkcorp
discuss
a year ago
skadamat
1 points
292.
▲
Qwen2.5-Omni is an end-to-end multimodal model
github.com/QwenLM
discuss
a year ago
tosh
1 points
293.
▲
Aana SDK, a framework for building AI enabled multimodal applications
github.com/mobiusml
discuss
a year ago
omneity
1 points
294.
▲
Show HN: Kfe – Cross-Platform Search Engine for Local Multimedia Files
github.com/Fl0k3n
discuss
a year ago
flok3n
1 points
295.
▲
Show HN: Magnitude – Natural language E2E testing with multimodal LLM agents
github.com/magnitudedev
discuss
a year ago
thrgreenwald
1 points
296.
▲
Full multimodal Android llm app running without netowrk
github.com/alibaba
discuss
a year ago
juude
1 points
297.
▲
AtomicRing; Fast MultiCast and Single Consumer Lock-Free Queues
github.com/rezabrizi
discuss
a year ago
rezatabrizi
1 points
298.
▲
Mfsync: Encrypted local filesharing using multicast host lookup
github.com/k4lipso
discuss
a year ago
kalipso
1 points
299.
▲
DeepSeek-VL2: Moe Vision-Language Models for Advanced Multimodal Understanding [pdf]
github.com/deepseek-ai
discuss
2 years ago
limoce
1 points
300.
▲
Show HN: AnyModal – A Flexible Multimodal Language Model Framework for PyTorch
github.com/ritabratamaiti
discuss
2 years ago
anneta
1 points
More