Show HN: Prompt-Engineering Tool: AI-to-AI Testing for LLM

37 points

3 years ago

Spelltest framework simulates conversations between AI ‘synthetic users' in an environment to test and refine LLM-based applications. It ensures your app converse with utmost accuracy and relevance. Post-chat, Spelltest assesses responses, providing qualitative and quantitative feedback on performance. Suitable for both chat and completion modes.

When to use: - After modifying your prompt. - When your LLM provider updates. - As a CI step for you repo.

All feedback and collaborations appreciated!

2 comments

2 comments