TankWork is an open-source desktop agent framework that enables AI to perceive and control your computer through computer vision and system-level interactions. Agents can:
Control your computer directly through voice or text commands Process real-time screen content using computer vision and expert skill routing Interact through natural language voice commands and text input Provide continuous audio-visual feedback and action logging Switch seamlessly between assistant and computer control modes