6a

Good Taxonomy Can Address Classification Challenges in Personality Pathology by Providing Informative Priors That Balance Information Compression and Fidelity: Commentary on Categorical Models of Personality Disorders

Nathan T. HallAlison M. Schreiber, and Michael N. Hallquist

Introduction

Weinberg documents the history of categorical models of personality disorders (PDs) and presents a model based largely on the Diagnostic and Statistical Manual of Mental Disorders (DSM; American Psychiatric Association, 2013). He further argues that this model has notable benefits including the utility of disorder-specific concepts, which may aid in case conceptualization and treatment. The author reviews ubiquitous criticisms of the DSM model such as excessive comorbidity (Widiger & Trull, 2007) that have fueled support for dimensional and hybrid accounts of personality pathology. Indeed, despite the remarkable complexity of psychopathology, there is also structure in the patterns of symptom expression1 both within an individual and at the population level (Krueger et al., 2018). Building on this overview, we focus specifically on how taxonomic science can help clinicians and scientists navigate the often overwhelming complexity of conceptualizing key features of an individual.

With the recent proliferation of taxonomies of psychopathology – DSM-5 Section II versus III, HiTOP (Kotov et al., 2017), RDoC (Insel et al., 2010), and other dimensional models such as the SNAP (Simms & Clark, 2006) – we believe that now is an important time to reflect on the goals of classification (cf. Blashfield & Draguns, 1976). We propose that any good2 taxonomy of personality pathology compresses clinical data in order to balance representational simplicity and information fidelity. To explicate this point, we draw an analogy to digital photography, which faces a similar tradeoff between file size and image fidelity.

We note that one challenge to any taxonomy is the risk of reifying the underlying distinctions it makes (Hyman, 2010). In this regard, the DSM’s categorical model assumes that diagnoses are “natural kinds” despite empirical evidence that distinctions among PDs are often blurry (Widiger & Trull, 2007). Crucially, the problem of reification can lead to a “taxonomy by authority” that puts up epistemic blinders that likely impede scientific progress (Markon, 2013). Thus, we suggest treating taxonomic constructs as open concepts (Zachar, Turkheimer, & Shaffner, in press) that are modifiable in light of new information.

How Are Clinical Psychologists Like Digital Photographers?

To set the stage for conceptualizing taxonomies of personality pathology, meet Addison, a digital photographer who was recently hired to produce a web exposé on bunnies for a pet website. Addison just finished a photoshoot and has selected 60 of the best bunny pictures. The challenge is that the design manager insists that each picture should be no more than 200 kilobytes so that the web page loads faster and does not tax server bandwidth. Addison’s original files are 30 megabytes each and contain rich visual detail, but they are 150 times larger than the acceptable size. To strike a balance between file size and image quality, she applies a “lossy” JPEG compression algorithm. In lossy compression, the information in an image file is compressed into fewer bits by searching for statistical dependencies and removing details that do not unduly harm the fidelity of the picture. For example, subtle changes in hue from one pixel to the next could be collapsed into the same hue. By applying more severe compression, Addison can achieve smaller file sizes, but at the expense of image quality. For a visual depiction of such compression effects, see https://michaelhallquist.github.io/PD_information_compression_fidelity/datacompression_rabbit_color_web.jpg. The link to the extended version can be found here: https://michaelhallquist.github.io/PD_information_compression_fidelity/

Next, meet Devon, a first-year graduate student in clinical psychology conducting his first intake assessment. The client presents with a wide array of problems including binge drinking, explosive arguments with romantic partners, suicidality, frequent self-injury, and intermittent feelings of sadness and anxiety. The client also behaves flirtatiously toward Devon and says that she likes to flirt with people, but that this has led to unwanted sexual attention and even assault. Needless to say, Devon feels overwhelmed by the volume and complexity of clinical information and is now faced with writing an intake report to guide treatment planning. Like Addison, Devon is faced with the problem of how to capture the richness of the client’s experience while compressing the complexity into a simpler case formulation such as a diagnosis or personality profile. We propose that a good taxonomy can aid in this endeavor, but that without such guidance, Devon’s confusion is not due to ineptitude but to natural limitations in the representational capacity of all humans. The crux of classification problems in personality pathology is, which “compression algorithm” will capture the most clinical information while not overwhelming Devon with details that could lead to suboptimal decisions influenced by cognitive heuristics?

Information Overload and the Need to Compress

When provided with a large quantity of information, humans can suffer from information overload, often performing worse than simple “actuarial” decision rules (Dawes, Faust, & Meehl, 1989). Instead, decisions are often better when clinicians rely on a few highly important pieces of information (Faust, 2012). In the face of uncertainty and complexity, humans use a number of mental heuristics that simplify decision-making, which can lead to biased or idiosyncratic decisions (Tversky & Kahneman, 1974). In fact, in a complex value-based decision-making task, we found that individuals who selectively maintained a few high-value options while forgetting low-value alternatives exhibited better task performance (Hallquist & Dombrovski, 2019). In the case of psychiatric taxonomy, basic work on the limits of human representational capacity (Ma, Husain, & Bays, 2014) suggests that a good taxonomy should prune away or deemphasize peripheral information while retaining the most informative features. Thus, we propose that information compression and information fidelity are two axiomatic3 principles of any good taxonomy.

The principle of information compression is that a taxonomy should leverage the regularities in psychopathology features to emphasize dominant sources of covariation. Compression schemes may be hierarchical, as in the case of dimensional models of normal and abnormal personality, where broad distinctions such as internalizing and externalizing can be subdivided into finer features. Although compression is an important conceptual principle, it also has formal ties to multivariate approaches such as factor analysis and cluster analysis. In factor models, a large correlation matrix is thought to reflect a smaller number of latent dimensions that explain most of the covariation. In this way, if correlations among 80 features of psychopathology can be captured by eight latent factors, we have compressed the data tenfold, substantially simplifying the problem.

The principle of information fidelity is that a taxonomy should maintain essential features that reasonably approximate the structure of an individual or the population. Conceptually, if a taxonomy has high information fidelity, measuring a patient in terms of its features alone, one should be able to infer more detailed aspects of the clinical presentation. For example, if a patient’s medical chart contains the diagnoses of borderline personality disorder (BPD) and generalized anxiety disorder, could a clinician use this information to predict the patient’s level of antagonism? Returning to factor analysis, if the compression scheme has high information fidelity, we could back-project from scores on the eight factors to estimated responses on all 80 features with reasonable accuracy. Although we necessarily sacrifice detail when we compress the features of psychopathology, a scheme with high information fidelity can still approximate these details.

We believe that attending to the dialectical relationship between information compression and information fidelity (akin to the fit-parsimony tradeoff in statistics) opens a productive space for professionals to consider which taxonomy (“compression algorithm”) accomplishes the most with the fewest features. Importantly, the appropriate level of compression may depend on the scientific or clinical question and the evidence of incremental utility for using a less compressed (i.e., more detailed) over a more compressed (i.e., less detailed) representation.

Judging Taxonomies of Personality Pathology: How Do We Move Forward?

When choosing among psychiatric taxonomies, we are often faced with the challenging problem of comparing systems that are qualitatively different. For example, the DSM-5’s Section II model of PDs compresses 79 symptoms into 10 clinical syndromes, which can be thought of as binary variables, whereas the SNAP compresses 375 items into 12 trait and 3 temperament dimensions. In Devon’s case, applying these taxonomies may give rather different clinical impressions.

Using the DSM-5 Section II categorical model, Devon would rate the presence of 79 symptoms and identify whether any sum of symptoms exceeds the stated diagnostic threshold for each PD diagnosis. This approach would likely lead to a diagnosis of BPD, given the presence of unstable relationships, labile affect, and chronic suicidality. Furthermore, the BPD diagnosis would greatly compress the clinical complexities, framing the patient’s presentation in terms of “borderlinearity.”4 As Weinberg notes, diagnostic prototypes can promote further thinking about more specific distinctions among individuals with the same diagnosis, such as the importance of interpersonal hypersensitivity in BPD. Measured by the SNAP, flirtatious behavior would be represented as heightened exhibitionism, whereas the rest of patient’s presentation would be described by mistrust, aggression, self-harm, and disinhibition.

Even though this example is intentionally simplified for illustration (i.e., the problems listed in the example are already compressed), the point is that the features that enter into case conceptualization and scientific thinking vary across taxonomies. This highlights the tension between a taxonomy that is reasonably comprehensive (less compressed) and one that is simple, potentially at the expense of explanatory power. Dimensional models of personality disorders have grown up in the tradition of personality psychology, which seeks to describe a wide array of individual differences. This is an admirable approach, but there are also considerable cognitive challenges to interpreting multidimensional trait profiles. For example, the Personality Inventory for DSM-5 (PID-5; Krueger, Derringer, Markon, Watson, & Skodol, 2012) contains five domains and 25 facets, representing a rich, but potentially complex, system for describing personality pathology. By contrast, the categorical diagnosis approach of DSM-5 Section II is probably too simplistic and narrow.

How, then, can we find the “Goldilocks” just-right balance between information compression and fidelity? Part of the answer depends on a judgment of quality, which is difficult to define and open to debate. For Devon’s patient, would expanding the feature space beyond “borderlinearity” alone help to mitigate over-compression? Conversely, could the profile from the SNAP be further compressed – for example, by only focusing on extreme elevations such as mistrustfulness and self-harm – to make the complex trait profile more clinically actionable? Regardless, the compression inherent to any taxonomy should preserve the clinical picture, rather than yielding an erroneous or scrambled representation that obscures structure.

An incremental taxonomic science should also pursue an empirical path that compares the quantitative alignment of alternative models to psychopathology data. Using variants of latent variable models, personality pathology researchers can compare the relative evidence for categorical, dimensional, and hybrid taxonomies using information-theoretic fit indices (Markon & Krueger, 2006). These criteria formalize the intuition that more complex models necessarily fit better, but at the expense of parsimony – echoing the tensions between fidelity and compression articulated above. Importantly, quantitatively comparing taxonomic model evidence depends on the features (i.e., variables) to be compressed being identical between models. That is, to meaningfully compare classification systems, the inputs need to be the same even if the representations differ (e.g., 5 traits versus 10 categories).

In addition, to overcome debatable distinctions about quality, one clear target for advancing taxonomies of personality pathology is to compare their clinical and predictive utility. A taxonomy that provides effective compression may reduce information overload, mitigating the impact of cognitive heuristics and freeing up cognitive resources to make more nuanced judgments. For example, can Devon conceptualize the client’s binge drinking, argumentativeness, and suicidality as reflecting a core problem with disinhibition? If so, this could inform treatment strategies that address the shared liability. Furthermore, such a simplification could provide Devon mental space to think about how high SNAP exhibitionism may support a cycle in which romantic infidelity, in conjunction with disinhibition, leads to explosive arguments.

Stepping out to the nomothetic level, studies of predictive utility can also advance taxonomic science. If we can agree on a set of clinical and psychological outcomes that are important to predict (e.g., suicidality), then we can determine which signals are useful to retain and which can be safely compressed. For example, Eaton and colleagues (2013) found that internalizing pathology (a broad latent dimension) outperformed any DSM PD in predicting future internalizing pathology, suicide attempts, and other health-related outcomes.

The Goal of Taxonomy Is to Provide Informative Priors

We propose that a good taxonomy should arm Devon with critical prior information. This is an idea borrowed from Bayesian decision theory, in which decision-makers bring prior information to bear on current decisions, thus potentially reducing uncertainty and focusing attention on key variables. Priors in psychiatric description should provide an empirically based roadmap to make predictions when faced with uncertain information, rather than promoting reification or encouraging overreliance on clinical experience.

In treatment settings, clinicians often operate with noisy information such as brief psychiatric interviews. We propose that a good taxonomy should provide prior information about what features are important to focus on when working with limited data. Indeed, personality pathology is often overlooked in clinical assessments because key features are not systematically assessed or emphasized (e.g., Ruggero, Zimmerman, Chelminski, & Young, 2010). Furthermore, population norms for personality pathology assessments can provide crucial information about the rarity of a trait elevation in a given patient. Such norms would help Devon to incorporate base rates into clinical judgment, a classic example of how prior information can lead to more accurate decisions (Meehl & Rosen, 1955). Importantly, priors necessarily bias predictions, but taxonomic science should strive to provide priors that bias clinicians toward good decisions (Hertwig & Grüne-Yanoff, 2017).

By emphasizing information compression and information fidelity as two “axiomatic” principles of a good taxonomy, we hope to promote further discussion among advocates of different models of personality pathology. If disparate taxonomies can be judged on similar criteria and compared using quantitative methods that address the parsimony-fit tradeoff, this will motivate incremental progress in the classification of personality pathology. Ultimately, a taxonomy that provides empirically supported informative priors can maximize the system’s utility in both clinical science and practice.

References

American Psychiatric Association. (2013). Diagnostic and Statistical Manual of Mental Disorders (5th ed.). Arlington, VA: American Psychiatric Publishing.

Blashfield, R. K., & Draguns, J. G. (1976). Toward a taxonomy of psychopathology: The purpose of psychiatric classification. British Journal of Psychiatry129(6), 574–583.

Dawes, R. M., Faust, D., & Meehl, P. E. (1989). Clinical versus actuarial judgment. Science243(4899), 1668–1674.

Eaton, N. R., Krueger, R. F., Keyes, K. M., Wall, M., Hasin, D. S., Markon, K. E., … Grant, B. F. (2013). The structure and predictive validity of the internalizing disorders. Journal of Abnormal Psychology122(1), 86–92.

Faust, D. (2012). Decision research can increase the accuracy of clinical judgment and thereby improve patient care. In S. O. Lilienfeld & W. T. O’Donohue (Eds.), The Great Ideas of Clinical Science (pp. 49–76). New York: Routledge.

Hallquist, M. N., & Dombrovski, A. Y. (2019). Selective maintenance of value information helps resolve the exploration/exploitation dilemma. Cognition183, 226–243.

Hertwig, R., & Grüne-Yanoff, T. (2017). Nudging and boosting: Steering or empowering good decisions. Perspectives on Psychological Science12(6), 973–986.

Hyman, S. E. (2010). The diagnosis of mental disorders: The problem of reification. Annual Review of Clinical Psychology6, 155–179.

Insel, T., Cuthbert, B., Garvey, M., Heinssen, R., Pine, D. S., Quinn, K., … Wang, P. (2010). Research Domain Criteria (RDoC): Toward a new classification framework for research on mental disorders. American Journal of Psychiatry167(7), 748–751.

Kotov, R., Krueger, R. F., Watson, D., Achenbach, T. M., Althoff, R. R., Bagby, R. M., … Zimmerman, M. (2017). The Hierarchical Taxonomy of Psychopathology (HiTOP): A dimensional alternative to traditional nosologies. Journal of Abnormal Psychology126(4), 454–477.

Krueger, R. F., Derringer, J., Markon, K. E., Watson, D., & Skodol, A. E. (2012). Initial construction of a maladaptive personality trait model and inventory for DSM-5. Psychological Medicine42(9), 1879–1890.

Krueger, R. F., Kotov, R., Watson, D., Forbes, M. K., Eaton, N. R., Ruggero, C. J., … Zimmermann, J. (2018). Progress in achieving quantitative classification of psychopathology. World Psychiatry17(3), 282–293.

Ma, W. J., Husain, M., & Bays, P. M. (2014). Changing concepts of working memory. Nature Neuroscience17(3), 347–356.

Markon, K. E. (2013). Epistemological pluralism and scientific development: An argument against authoritative nosologies. Journal of Personality Disorders27(5), 554–579.

Markon, K. E., & Krueger, R. F. (2006). Information-theoretic latent distribution modeling: Distinguishing discrete and continuous latent variable models. Psychological Methods11(3), 228–243.

Meehl, P. E., & Rosen, A. (1955). Antecedent probability and the efficiency of psychometric signs, patterns, or cutting scores. Psychological Bulletin52(3), 194–216.

Ruggero, C. J., Zimmerman, M., Chelminski, I., & Young, D. (2010). Borderline personality disorder and the misdiagnosis of bipolar disorder. Journal of Psychiatric Research44(6), 405–408.

Simms, L. J., & Clark, L. A. (2006). The Schedule for Nonadaptive and Adaptive Personality (SNAP): A dimensional measure of traits relevant to personality and personality pathology. In S. Strack (Ed.), Differentiating Normal and Abnormal Personality (2nd ed., pp. 431–450). New York: Springer.

Tversky, A., & Kahneman, D. (1974). Judgment under uncertainty: Heuristics and biases. Science185(4157), 1124–1131.

Widiger, T. A., & Trull, T. J. (2007). Plate tectonics in the classification of personality disorder: Shifting to a dimensional model. American Psychologist62(2), 71–83.

Zachar, P., Turkheimer, E., & Shaffner, K. (in press). Defining and redefining phenotypes: Operational definitions as open concepts. In A. G. C. Wright & M. N. Hallquist (Eds.), Handbook of Research Methods in Clinical Psychology. Cambridge University Press.

1We note that there remains an active debate about whether symptoms or underlying mechanisms (biological, cognitive, genetic, etc.) should be the primary focus of taxonomic efforts. The central thesis of this commentary is agnostic on this point, but it is likely that both symptoms and mechanisms will be important to moving forward.

2We use the term “good” throughout the commentary to signify positive attributes of a taxonomic system, while acknowledging that any taxonomy provides an imperfect roadmap where information is necessarily lost.

3We do not use the terms “axiom” or “axiomatic” in their strict mathematical sense. Instead, we use them to emphasize necessary conditions of a good taxonomy, which we hope can help adjudicate among alternative models.

4An evocative phrase borrowed from Aidan G. C. Wright.

If you find an error or have any questions, please email us at admin@erenow.org. Thank you!