Struggling to choose the right AI model for your project?

sterlingchin · March 4, 2025, 7:38pm

Ever found yourself wondering which LLM would work best for your specific needs? In our latest video, I walk you through TWO powerful methods to evaluate AI models like GPT, Claude, Gemini, and others using Postman!

The first method uses Collection Runner to benchmark multiple models simultaneously, comparing performance metrics like token usage, response time, and content length all at once. Perfect for data-driven developers who need comprehensive testing.

What’s your go-to method for comparing AI models? Do you prefer quantitative metrics or qualitative response evaluation?

The second approach uses Postman Flows for real-time, interactive comparisons. This visual method lets you see exactly how different models respond to the same prompt simultaneously, with an AI evaluator determining which response is best based on your custom criteria.

Both the Collection Runner and Flow examples from this tutorial are available in our public workspace! Have you tried either of these approaches before? Which metrics matter most to you when selecting an AI model?

Watch the full tutorial here to see both methods in action:

allenheltondev · March 4, 2025, 11:04pm

Love everything about this!

My favorite thing of all just might be that 90’s sitcom style thumbnail for the video

Topic		Replies	Views
LLM Model Evaluation Public Collection - Feedback or Questions? Building AI on Postman collections , collection-runner , ai	2	119	February 7, 2025
How do you test and eval AI on Postman today? Building AI on Postman documentation , tests , collections , ai	2	173	January 15, 2025
POST/CON Workshop: Mastering AI Agent Automation with Postman Flows Building AI on Postman postcon , postman-flows , ai , agentic-ai	1	70	March 26, 2025
See the AI Agent Builder in action Building AI on Postman postman-flows , agentic-ai	0	74	February 25, 2025
FlowBot and PostGPT Show & Tell show-and-tell	0	40	April 17, 2025

Struggling to choose the right AI model for your project?

Related topics