Spelltest framework simulates conversations between AI ‘synthetic users' in an environment to test and refine LLM-based applications. It ensures your app converse with utmost accuracy and relevance. Post-chat, Spelltest assesses responses, providing qualitative and quantitative feedback on performance. Suitable for both chat and completion modes.
When to use: - After modifying your prompt. - When your LLM provider updates. - As a CI step for you repo.
All feedback and collaborations appreciated!