Lawrence Kohlberg’s stages of moral development remain a cornerstone of developmental psychology, offering a structured framework for understanding how individuals reason about right and wrong. Building upon the cognitive developmental work of Jean Piaget, Kohlberg proposed that moral reasoning evolves through six distinct stages grouped into three levels: pre-conventional, conventional, and post-conventional. Plus, while his theory revolutionized the field and sparked decades of research, it has also faced substantial scrutiny. Understanding which of these are major criticisms of Kohlberg's theory is essential for students, educators, and psychologists seeking a nuanced view of moral psychology But it adds up..
The Core Framework: A Brief Recap
Before diving into the critiques, it helps to visualize the architecture Kohlberg built. He presented participants with moral dilemmas—most famously the Heinz dilemma, where a man considers stealing an expensive drug to save his dying wife. Kohlberg was less interested in the choice itself (steal or not steal) than in the reasoning behind it No workaround needed..
- Level 1: Pre-conventional Morality (Stages 1 & 2): Reasoning is based on external consequences. Stage 1 focuses on obedience and punishment avoidance; Stage 2 focuses on self-interest and instrumental exchange ("you scratch my back, I’ll scratch yours").
- Level 2: Conventional Morality (Stages 3 & 4): Reasoning aligns with societal norms. Stage 3 emphasizes interpersonal accord and being a "good person"; Stage 4 emphasizes maintaining social order, law, and duty.
- Level 3: Post-conventional Morality (Stages 5 & 6): Reasoning transcends specific laws. Stage 5 views laws as social contracts subject to change for the greater good; Stage 6 relies on universal ethical principles like justice, equality, and human dignity.
Kohlberg argued this sequence was universal, invariant, and hierarchical. It is precisely these structural claims that have drawn the most fire.
Gender Bias and the "Different Voice" Critique
Perhaps the most famous and enduring criticism comes from Carol Gilligan, a former research assistant of Kohlberg. In her seminal work In a Different Voice (1982), Gilligan argued that Kohlberg’s theory exhibits a profound andocentric bias—it takes the male experience as the universal human standard Easy to understand, harder to ignore..
The Justice vs. Care Orientation
Kohlberg’s highest stages prioritize a justice orientation: abstract principles of fairness, rights, equality, and impartial application of rules. Gilligan’s research suggested that women (and many men) frequently approach moral dilemmas through a care orientation. This perspective prioritizes relationships, responsibility, compassion, and the avoidance of harm within specific contexts rather than abstract universal rules.
Because Kohlberg’s original longitudinal study used an all-male sample (72 boys from Chicago), the scoring criteria were calibrated to recognize justice reasoning as "higher" or "more mature." When women were later tested using these same criteria, they frequently scored at Stage 3 (interpersonal concordance), which Kohlberg classified as conventional—lower than the post-conventional stages dominated by men.
The criticism: The theory mistakes a gender difference in moral voice for a developmental deficiency. By defining moral maturity solely through the lens of justice and autonomy, the theory systematically devalues the moral reasoning style often socialized in women—one focused on connection, narrative context, and care. While subsequent meta-analyses (like those by Walker, 1984) show gender differences in stage scores are small or non-existent when education and occupation are controlled, the theoretical bias in defining the endpoint of development remains a valid philosophical critique.
Cultural Relativism vs. Universalism
Kohlberg claimed his stages were culturally universal—that every human culture progresses through the same sequence in the same order. Critics, particularly cross-cultural psychologists and anthropologists, argue this claim reflects a Western, liberal, individualistic bias.
Individualism as the Pinnacle
Stage 5 (Social Contract) and Stage 6 (Universal Principles) enshrine values central to Western Enlightenment thought: individual rights, democratic due process, and the autonomy of the self. In many collectivist cultures—found across East Asia, Africa, and Latin America—moral maturity is defined not by asserting individual rights against the group, but by fulfilling role-based duties, maintaining harmony, respecting hierarchy, and preserving the collective good And that's really what it comes down to..
The criticism: When researchers apply Kohlberg’s scoring manual to non-Western populations, individuals often score lower, appearing "stuck" at the conventional level. Critics like Richard Shweder and Joan Miller argue this doesn't indicate a lack of development; it indicates a different cultural telos (end goal). The theory mistakes the content of Western morality for the structure of universal morality. It fails to recognize that reasoning based on communal responsibility and spiritual purity can be just as cognitively complex as reasoning based on individual rights.
The Hypothetical Dilemma Methodology
Kohlberg relied almost exclusively on responses to hypothetical moral dilemmas (like Heinz stealing the drug). This methodology faces three distinct methodological criticisms It's one of those things that adds up..
1. Reasoning vs. Behavior Gap
The most pragmatic criticism is the weak correlation between moral reasoning (what people say they would do) and moral behavior (what people actually do). Knowing the "right" answer at Stage 5 or 6 does not predict helping a stranger, resisting peer pressure to cheat, or whistleblowing on corruption. Critics like Augusto Blasi argue that moral identity and commitment are better predictors of action than reasoning stage alone. A person at Stage 3 who deeply identifies as a "helper" may act more morally than a Stage 5 philosopher who views ethics as an intellectual exercise.
2. Artificiality and Lack of Ecological Validity
Hypothetical dilemmas strip away the emotional weight, time pressure, relational history, and ambiguity of real-life situations. In the Heinz dilemma, the participant has no relationship with Heinz, his wife, or the druggist. Real moral decisions are embedded in webs of affection, power dynamics, and incomplete information. Critics argue that reasoning in a sterile interview setting accesses verbal intelligence and hypothetical thinking skills more than it accesses the moral intuition that drives daily life.
3. The "Production" vs. "Recognition" Gap
Later research (notably by James Rest and the Neo-Kohlbergian school) showed that people can often recognize higher-stage reasoning when presented with it (e.g., in the Defining Issues Test), even if they cannot produce it spontaneously in an interview. This suggests Kohlberg’s interview method underestimates the moral competence of participants, particularly younger children or those with lower verbal fluency That's the part that actually makes a difference..
The Regression and Stage 6 Problem
Kolhberg’s structuralism demanded that stages be invariant sequences—you cannot skip a stage, and you cannot regress. That said, longitudinal data eventually forced Kohlberg to confront a messy reality: Stage 6 virtually disappeared.
In his 20-year longitudinal study, Kohlberg found almost no adults reasoning consistently at Stage 6 (Universal Ethical Principles). By the 1980s, he officially dropped Stage 6 as a measurable stage, relegating it to a "theoretical ideal." This created a theoretical crisis: if the endpoint of the invariant sequence is empirically non-existent, the structural claim of a six-stage hierarchy collapses.
Not the most exciting part, but easily the most useful.
To build on this, critics noted stage mixture is the norm, not the exception. On the flip side, individuals reason at multiple stages simultaneously depending on the dilemma's content. A person might use Stage 5 reasoning for political issues (civil liberties) but Stage 3 reasoning for family loyalty Not complicated — just consistent..
…hard stage” thesis. When individuals draw on different stages for different domains, the notion of a single, overarching moral structure loses its explanatory power. This pattern—often termed horizontal décalage—suggests that moral reasoning is more modular than hierarchical: people may apply principled thinking to abstract political questions while relying on interpersonal concerns in family or peer contexts, and vice‑versa. Such flexibility indicates that the cognitive processes underlying moral judgment are sensitive to content, context, and personal relevance, rather than being governed by a rigid, stage‑locked algorithm Most people skip this — try not to..
The disappearance of Stage 6 further destabilizes the invariant‑sequence claim. If the putative pinnacle of moral development is rarely, if ever, observed in empirical samples, the theory’s developmental trajectory lacks an empirically verified endpoint. And kohlberg’s later decision to treat Stage 6 as a “theoretical ideal” acknowledges this shortfall, but it also raises a deeper question: what does it mean to posit a stage that functions only as a philosophical benchmark? Critics argue that the construct becomes unfalsifiable, reducing the theory’s scientific utility.
People argue about this. Here's where I land on it.
In response to these challenges, several research programs have shifted focus from stage classification to the measurement of moral competence and motivation. The Defining Issues Test (DIT‑2), for instance, quantifies the extent to which individuals prefer and put to use postconventional considerations when evaluating dilemmas, without demanding that they produce such reasoning spontaneously. Similarly, the Moral Identity Scale captures the extent to which moral traits are central to self‑concept, a variable that predicts moral behavior more reliably than stage scores alone. Dual‑process models, which distinguish between intuitive, affect‑driven judgments and deliberative, reasoning‑based judgments, also accommodate the observed variability: intuitive processes may dominate in emotionally charged, relational situations, whereas deliberative reasoning emerges more readily in abstract, impersonal contexts.
These alternative frameworks preserve Kohlberg’s insight that moral cognition develops with age and experience, yet they accommodate the messiness of real‑world moral life. They recognize that individuals can simultaneously hold multiple moral schemas, that context shapes which schema is activated, and that personal commitment and identity often bridge the gap between judgment and action Simple, but easy to overlook..
Conclusion
The critique of Kohlberg’s stage model reveals that while his work pioneered the study of moral development as a cognitive progression, its strict stage hierarchy fails to capture the fluid, context‑sensitive nature of everyday moral reasoning. Empirical evidence shows that Stage 6 is rarely attained, that individuals routinely mix stages across issues, and that moral identity and commitment often outweigh pure reasoning stage in predicting behavior. Contemporary approaches—emphasizing moral competence, identity, and dual‑process dynamics—offer a more nuanced and empirically grounded account of how people figure out the moral complexities of life. In moving beyond the rigid stage sequence, researchers retain Kohlberg’s legacy of developmental inquiry while embracing the richness and variability inherent in human morality No workaround needed..