How do I use this scorecard?

Run the scorecard against each vendor in your shortlist. For each question, answer Yes / Partial / No based on what the vendor demonstrably offers (verified in their docs, contract, or pilot — not what their salesperson promised). The live score updates as you answer. Compare scores across vendors to make a defensible decision.

What are deal-breakers?

Questions weighted 5 represent capabilities so critical that 'no' should disqualify the vendor regardless of strong scores elsewhere. Examples: 'Data isolation (our prompts aren't used to train other models)', 'Data export rights in contract', 'Pricing predictable at our scale'. A single weight-5 'no' triggers a walk-away verdict.

Why weighted scoring instead of just yes/no?

Simple yes/no checklists treat every requirement as equally important — they don't. A vendor missing a 'public changelog' (weight 2) is fine; a vendor missing 'data export rights' (weight 5) is dangerous. Weighted scoring forces explicit trade-off thinking and produces a defensible final number.

How long does this take?

10-20 minutes per vendor, faster after the first one. Most teams can't answer every question from public information — that's the point. The questions where you check 'unsure' become the questions you ask in the next sales call.

Can I save / share my scorecard?

Use the 'Email me this scorecard' button below to send the full breakdown to yourself or your team. Each vendor evaluation can be re-run later if details change. We don't store data server-side — your answers live in your browser.

How did you choose the 42 questions?

Based on Forrester and Gartner AI agent procurement frameworks (2024-25) plus 40+ post-mortems of real AI agent migrations: what went wrong, what should have been asked before signing. The categories — integration depth, pricing transparency, security/compliance, vendor risk, reliability, support, performance, governance — map to the top recurring failure modes.

AI Agent Vendor Selection Scorecard

42 weighted questions across integration, pricing, security, vendor risk, reliability, support, performance, and governance. Live scoring with deal-breaker detection. Email the result.

Loading scorecard…

Frequently asked questions

How do I use this scorecard?
Run the scorecard against each vendor in your shortlist. For each question, answer Yes / Partial / No based on what the vendor demonstrably offers (verified in their docs, contract, or pilot — not what their salesperson promised). The live score updates as you answer. Compare scores across vendors to make a defensible decision.
What are deal-breakers?
Questions weighted 5 represent capabilities so critical that 'no' should disqualify the vendor regardless of strong scores elsewhere. Examples: 'Data isolation (our prompts aren't used to train other models)', 'Data export rights in contract', 'Pricing predictable at our scale'. A single weight-5 'no' triggers a walk-away verdict.
Why weighted scoring instead of just yes/no?
Simple yes/no checklists treat every requirement as equally important — they don't. A vendor missing a 'public changelog' (weight 2) is fine; a vendor missing 'data export rights' (weight 5) is dangerous. Weighted scoring forces explicit trade-off thinking and produces a defensible final number.
How long does this take?
10-20 minutes per vendor, faster after the first one. Most teams can't answer every question from public information — that's the point. The questions where you check 'unsure' become the questions you ask in the next sales call.
Can I save / share my scorecard?
Use the 'Email me this scorecard' button below to send the full breakdown to yourself or your team. Each vendor evaluation can be re-run later if details change. We don't store data server-side — your answers live in your browser.
How did you choose the 42 questions?
Based on Forrester and Gartner AI agent procurement frameworks (2024-25) plus 40+ post-mortems of real AI agent migrations: what went wrong, what should have been asked before signing. The categories — integration depth, pricing transparency, security/compliance, vendor risk, reliability, support, performance, governance — map to the top recurring failure modes.

Loading…

Frequently asked questions

How do I use this scorecard?
Run the scorecard against each vendor in your shortlist. For each question, answer Yes / Partial / No based on what the vendor demonstrably offers (verified in their docs, contract, or pilot — not what their salesperson promised). The live score updates as you answer. Compare scores across vendors to make a defensible decision.
What are deal-breakers?
Questions weighted 5 represent capabilities so critical that 'no' should disqualify the vendor regardless of strong scores elsewhere. Examples: 'Data isolation (our prompts aren't used to train other models)', 'Data export rights in contract', 'Pricing predictable at our scale'. A single weight-5 'no' triggers a walk-away verdict.
Why weighted scoring instead of just yes/no?
Simple yes/no checklists treat every requirement as equally important — they don't. A vendor missing a 'public changelog' (weight 2) is fine; a vendor missing 'data export rights' (weight 5) is dangerous. Weighted scoring forces explicit trade-off thinking and produces a defensible final number.
How long does this take?
10-20 minutes per vendor, faster after the first one. Most teams can't answer every question from public information — that's the point. The questions where you check 'unsure' become the questions you ask in the next sales call.
Can I save / share my scorecard?
Use the 'Email me this scorecard' button below to send the full breakdown to yourself or your team. Each vendor evaluation can be re-run later if details change. We don't store data server-side — your answers live in your browser.
How did you choose the 42 questions?
Based on Forrester and Gartner AI agent procurement frameworks (2024-25) plus 40+ post-mortems of real AI agent migrations: what went wrong, what should have been asked before signing. The categories — integration depth, pricing transparency, security/compliance, vendor risk, reliability, support, performance, governance — map to the top recurring failure modes.

AI Agent Vendor Selection Scorecard

Frequently asked questions

How do I use this scorecard?

What are deal-breakers?

Why weighted scoring instead of just yes/no?

How long does this take?

Can I save / share my scorecard?

How did you choose the 42 questions?

AI Agent Vendor Selection Scorecard

Integration0.0/23

Pricing0.0/18

Security0.0/30

Vendor0.0/18

Reliability0.0/18

Support0.0/15

Performance0.0/20

Governance0.0/10

Get the full report

Frequently asked questions

How do I use this scorecard?

What are deal-breakers?

Why weighted scoring instead of just yes/no?

How long does this take?

Can I save / share my scorecard?

How did you choose the 42 questions?

Integration0.0/23

Pricing0.0/18

Security0.0/30

Vendor0.0/18

Reliability0.0/18

Support0.0/15

Performance0.0/20

Governance0.0/10

Get the full report