Show HN: VQASynth – pipelines to synthesize VQA datasets

Heykuki News

1 point

2 years ago

Inspired by the recent work in SpatialVLM, we reproduce similar data synthesis pipelines using openly available models.

We compare our results to alternative annotation pipelines like RAM-Grounded-SAM.

Our repo uses a simple pipeline in docker compose to produce datasets suitable for fine-tuning multimodal models like LLaVA.

No comments

Threaded

Loading comments...

Show HN: VQASynth – pipelines to synthesize VQA datasets | Heykuki News