Why Executive Hires Fail in the First 18 Months

i. The short answerThe short answer

Boards usually file these failures under bad luck, chemistry, or politics. They are rarely any of those things. Most of them are a method error committed months earlier, during assessment, by a process that was never built to see it.

Executive hires fail in the first 18 months because most assessment processes measure one thing and predict another. They measure capability: technical skills, soft skills, and management skills, which interviews and track records surface well. They use that evidence to predict identity outcomes: whether the person's values, motivation, and operating style can live inside a specific organization. Capability and identity are different assessment registers, and evidence from one does not answer questions from the other. When the registers are conflated, the hire passes every interview and still fails, because the question that decided the outcome was never actually assessed.

This is the central argument of our research position paper, The Capability-Identity Distinction in Executive Hiring. What follows is the practitioner's version.

ii. The two questionsThe two questions every assessment answers

Every senior assessment process answers two questions, whether it knows it or not.

The first is: what can this person do? That is capability. It covers domain expertise, communication, stakeholder handling, team leadership, and judgment under pressure. Capability is observable, benchmarkable, and developable. It is also where assessment processes are strongest, because track records, structured interviews, and references are built for exactly this evidence.

The second is: who is this person? That is identity: stable patterns of values, character, decisional preferences, and the real drivers behind their interest in this role now. Identity is not malleable the way capability is. The behavioral genetics and adult-development literature cited in the paper puts the heritability of major personality dimensions at 40 to 60 percent, with rank-order consistency rising through adulthood. Coaching improves capability in months. It does not meaningfully rewrite identity on any timeline a hiring decision cares about.

Both registers matter. Capability decides whether the executive can do the work. Identity decides whether they can do the work here: inside this governance model, this pace, this ownership structure, this politics. A candidate can hold every required skill and still be the wrong person for the world they are about to enter.

iii. Why interviews rewardWhy interviews reward articulation and miss identity

Ask a candidate to describe their leadership style and you will learn something real. But what you learn is how fluently they describe leadership. Fluency of self-description is a communication skill. It sits in the capability register. The candidate who answers most smoothly is often the one who has rehearsed the answer most, which correlates with how actively they have been interviewing, not with depth of leadership identity.

The interview-faking research summarized in the paper makes the practical point: impression management is real, measurable, and sustainable across a single structured interview. It degrades across multiple conversations that probe specific past decisions from different angles and triangulate references against them. Identity evidence does not come from better questions in one meeting. It comes from a different process architecture: more conversations, structured exploration of real decisions, and attention to what the candidate defends, avoids, and treats as obvious.

A single excellent interview therefore measures capability and samples presentation. It does not assess identity, however confident everyone feels afterwards.

iv. What fifty yearsWhat fifty years of derailment research shows

The pattern this article describes is not a private discovery. The derailment literature has documented it since the 1960s, approaching from the failure side what our research approaches from the assessment side.

The earliest evidence is also the starkest. Industrial psychologist Jon Bentz followed managers at Sears, Roebuck for three decades. Every one of them had been hired on superior performance in rigorous cognitive and psychological assessment. Roughly 65 percent eventually stalled or were fired. Bentz traced the failures not to intelligence but to what he called overriding personality defects: patterns that stayed dormant while things went well and surfaced under sustained pressure.

The Center for Creative Leadership then compared executives who arrived with executives who derailed. The finding was uncomfortable: the two groups were astonishingly alike. Same strengths, same ambition, same records of achievement, and significant flaws in both. What separated them was not the presence of flaws but whether a given flaw mattered in that context, at that level. The single most cited derailment reason across the CCL studies was insensitivity to others. Not incompetence.

The modern numbers hold the same shape. Leadership IQ tracked 20,000 hires across 312 organizations: 46 percent failed within 18 months, and technical incompetence explained just 11 percent of those failures. Attitudinal and interpersonal factors explained 89 percent. Hogan's aggregation of twelve derailment studies puts the managerial failure base rate between 30 and 67 percent.

Read through the register frame, the convergence is hard to ignore. Flaws that stay dormant until pressure arrives are identity-register patterns. Two equally capable groups separated only by context means fit is a relation, not a property of the candidate. And an industry that concentrates its assessment effort on capability is assessing hardest on the register that explains the smallest share of failures.

v. The 18-month clockThe 18-month clock

The two registers fail on different schedules, which is why the conventional wisdom says senior failures take a year or more to surface.

Capability failures show up fast. An executive who cannot do the work reveals it within a quarter, and the organization usually has correction tools: coaching, restructured scope, added support. These failures are visible, addressable, and comparatively rare at senior level, because capability is what the process actually screened.

Identity failures compound slowly. A values mismatch with the owner family does not explode in week two. It accumulates through small decisions: what the executive protects, how they handle conflict with the board, which trade-offs they refuse, how they spend political capital. The organization reads the early signals as settling-in friction. By the time the pattern is undeniable, the hire is twelve months in, relationships have hardened, and the correction tools that work for capability do not apply. You cannot coach someone into wanting a mandate they discovered they do not want, or into accepting a governance model they resist.

There is a reason the dormant patterns wake. DDI's survey of 1,700 HR executives and transitioned leaders found that executives rate a senior transition as more stressful than bereavement or divorce. Sustained pressure of that order depletes exactly the self-monitoring that kept the interview performance coherent. What assessment never examined begins to show.

That is the 18-month clock. It is not that failure takes 18 months to happen. The failure was present at the start. It takes that long to become visible, because it lives in the register the assessment never examined. The direct costs are only part of the damage; we have written separately about the hidden cost of a failed executive hire.

vi. When identity assessmentWhen identity assessment is operationally critical

Not every mandate needs deep identity work, and a search firm that claims otherwise is selling process. The paper identifies four conditions under which identity assessment stops being optional:

1. Culture is a working criterion, not a marketing artifact. Family-controlled industrial groups, luxury houses with values continuity, founder-led companies in succession, mission-driven healthcare organizations. In these environments values shape daily operating decisions, and a candidate's fit with them is part of the job. 2. The interpersonal complexity of the role exceeds its technical complexity. Roles whose core demand is coalitions, stakeholder maps, and political navigation depend more on identity than capability. 3. The cost of mismatch is high and the correction window is long. Multi-year mandates make early identity errors expensive in ways capability errors are not. 4. The candidate pool is deep on capability. When several finalists can all do the work, identity is the only differentiator left, whether the process assesses it or not.

For mandates that meet these conditions, identity assessment is one of the central disciplines of the search. For mandates that do not, standard capability assessment is appropriate, and the additional depth should not be billed as if it were needed.

vii. What a register-disciplinedWhat a register-disciplined process looks like

The fix is not a new interview question. It is structural, and the paper sets it out in operational terms.

Capability and identity are assessed in deliberately separated conversations. Capability conversations probe scope, scenarios, decisions, and results. Identity conversations probe values under pressure, decisional patterns across different situations, and what the candidate consistently protects or sacrifices. The same consultant can run both, but should always know which register the conversation is in.

Shortlist evidence is presented in two distinct layers: capability evidence scored against role requirements, and identity evidence described against the company's actual operating reality. The client weighs them independently instead of receiving a single merged fit summary that hides the distinction.

And the depth of identity work is matched to the mandate, using the four conditions above, rather than applied uniformly as theatre.

One caution from the research: structured identity assessment is not the informal "culture fit" screening the sociology literature rightly criticizes. Hiring people the interviewer would enjoy at dinner produces homogeneity, not fit. Identity assessment calibrates against the operating reality of the role, on documented decisional evidence, kept deliberately separate from capability judgments. The difference between the two practices is the subject of our editorial Culture fit and soft skills are not the same.

viii. FAQFAQ

How many executive hires fail?

Across the major studies, the failure base rate for senior hires clusters between 30 and 50 percent. Leadership IQ found 46 percent of 20,000 tracked hires failed within 18 months; Hogan's aggregation of twelve derailment studies spans 30 to 67 percent. The share attributable to technical incapacity is small: 11 percent in the Leadership IQ data.

What is the most common reason executive hires fail?

Failures attributed to "fit", politics, or chemistry are usually identity-register failures: a mismatch between the executive's values, motivation, or operating style and the organization's real environment. They look unpredictable because the assessment process only gathered capability evidence, which cannot answer identity questions.

Can interviews predict whether a senior hire will succeed?

Structured interviews predict capability well: what the person can do. A single interview cannot reliably assess identity, because fluent self-description is itself a skill and can be rehearsed. Identity evidence requires multiple conversations probing real past decisions, plus reference triangulation against them.

How long should it take to know whether an executive hire is working?

Capability problems usually surface within the first quarter and often respond to support. Identity mismatches typically become undeniable between months six and eighteen, and rarely respond to coaching. A hire that "suddenly" fails at month twelve almost always carried the mismatch from day one.

---

For mandates where the identity stakes are real, this discipline is built into how we assess senior hires and into our methodology. If your last senior hire failed for reasons nobody can quite name, brief us on the mandate. We will tell you honestly which register it failed in.

Alessio Montaruli is the founder and Group CEO of KiTalent. The framework in this article is developed in full, with its academic grounding, in The Capability-Identity Distinction in Executive Hiring.

Why Executive Hires Fail in the First 18 Months

i. The short answerThe short answer

ii. The two questionsThe two questions every assessment answers

iii. Why interviews rewardWhy interviews reward articulation and miss identity

iv. What fifty yearsWhat fifty years of derailment research shows

v. The 18-month clockThe 18-month clock

vi. When identity assessmentWhen identity assessment is operationally critical

vii. What a register-disciplinedWhat a register-disciplined process looks like

viii. FAQFAQ

How many executive hires fail?

What is the most common reason executive hires fail?

Can interviews predict whether a senior hire will succeed?

How long should it take to know whether an executive hire is working?

Alessio Montaruli

Founder editorials on the same argument

Culture fit and soft skills are not the same

A profile is not a person

Why AI maps the talent market but cannot read the candidate

From the research shelf

If this market is shaping your next leadership hire, we should talk

Popular searches

Suggested reads