Question
What goes wrong when you ignore that pilot programs as system experiments?
Quick Answer
Running a pilot that is not a genuine experiment. Common corruptions include: selecting the best team for the pilot (guaranteeing success but preventing learning), providing the pilot team with extra resources not available at scale (inflating results), not measuring unintended consequences (only.
The most common reason fails: Running a pilot that is not a genuine experiment. Common corruptions include: selecting the best team for the pilot (guaranteeing success but preventing learning), providing the pilot team with extra resources not available at scale (inflating results), not measuring unintended consequences (only tracking the intended benefit), running the pilot indefinitely instead of making a go/no-go decision (using the pilot as a substitute for commitment), and declaring the pilot a success before the results are in (using the pilot as political cover rather than evidence). A genuine pilot is designed to learn, not to prove. The most valuable pilot is one that reveals problems — because the problems can be fixed in the pilot before they become organization-wide.
The fix: Design a pilot for a system change you want to make. Define five elements: (1) Scope — what is the bounded context for the pilot? Choose a team, project, or process that is representative of the broader organization but small enough to monitor closely. (2) Duration — how long will the pilot run? Long enough to observe the full cycle of the changed process but short enough to maintain organizational attention. Six to twelve weeks for process changes; three to six months for structural changes. (3) Metrics — what will you measure? Include both the intended outcome (does the change work?) and unintended consequence indicators (what side effects emerge?). (4) Comparison — what will you compare the pilot results against? The team's own pre-pilot performance is the simplest comparison, but a parallel unchanged team provides stronger evidence. (5) Decision criteria — what results would lead you to expand the pilot, modify it, or abandon it? Define these criteria before the pilot begins to prevent after-the-fact rationalization.
The underlying principle is straightforward: Test systemic changes on a small scale before rolling them out broadly. A pilot program is a bounded experiment — a deliberate test of the proposed system change in a contained context where the change can be observed, measured, and refined without risking the entire organization. Pilots serve three functions: they generate evidence (does the change produce the intended outcome?), they reveal unintended consequences (what side effects emerge in practice?), and they build organizational confidence (the change has been tested and it works). System changes deployed without piloting are organizational gambles — large bets on untested designs.
Learn more in these lessons