I just released ParseHawk v0.1.0:
Apache-2.0 licensed 100% local document AI platform that extracts JSON from PDFs, images etc.
It builds on top of NuMind's NuExtract3 but additionally enforces a provided JSON schema with constrained decoding.
It works on Apple Silicon with pre-bundled vllm-metal as well as Linux + NVIDIA with vllm.
Looking forward to your feedback!