Depression screening: a practical strategy

,

Applied Evidence

Depression screening: a practical strategy

J Fam Pract. 2003 February;52(2):127-134

PDF Download

References

1. Katon W, Schulberg H. Epidemiology of depression in primary care. Gen Hosp Psychiatry 1992;14:237-47.

2. Magruder-Habib K, Zung WW, Feussner JR. Improving physicians’ recognition and treatment of depression in general medical care. Results from a randomized clinical trial. Med Care 1990;28:239-50.

3. Coyne JC, Schwenk TL, Fechner-Bates S. Nondetection of depression by primary care physicians reconsidered. Gen Hosp Psychiatry 1995;17:3-12.

4. Williams JW, Mulrow CD, Kroenke K, et al. Case-finding for depression in primary care: a randomized trial. Am J Med 1999;106:36-43.

5. Greenberg PE, Stiglin LE, Finkelstein SN, Berndt ER. The economic burden of depression in 1990. J Clin Psychiatry 1993;54:405-18.

6. Katon W, Von Korff M, Lin E, et al. Distressed high utilizers of medical care. DSM-III-R diagnoses and treatment needs. Gen Hosp Psychiatry 1990;12:355-62.

7. Von Korff M, Ormel J, Katon W, Lin EH. Disability and depression among high utilizers of health care. A longitudinal analysis. Arch Gen Psychiatry 1992;49:91-100.

8. Wells KB, Stewart A, Hays RD, et al. The functioning and well-being of depressed patients. Results from the Medical Outcomes Study. JAMA 1989;262:914-9.

9. Ford DE, Mead LA, Chang PP, Cooper-Patrick L, Wang NY, Klag MJ. Depression is a risk factor for coronary artery disease in men: the precursors study. Arch Intern Med 1998;158:1422-6.

10. Pignone MP, Gaynes BN, Rushton JL, et al. Screening for depression in adults: a summary of the evidence for the U.S. Preventive Services Task Force. Ann Intern Med 2002;136:765-76.

11. US. Preventive Services Task Force. Screening for depression: recommendations and rationale. Ann Intern Med 2002;136:760-4.

12. Spitzer RL, Kroenke K, Williams JB. Validation and utility of a self-report version of PRIME-MD: the PHQ primary care study. Primary Care Evaluation of Mental Disorders. Patient Health Questionnaire. JAMA 1999;282:1737-44.

13. Valenstein M, Vijan S, Zeber JE, Boehm K, Buttar A. The cost-utility of screening for depression in primary care. Ann Intern Med 2001;134:345-60.

14. Jaen CR, Stange KC, Nutting PA. Competing demands of primary care: a model for the delivery of clinical preventive services. J Fam Pract 1994;38:166-71.

15. Klinkman MS. Competing demands in psychosocial care. A model for the identification and treatment of depressive disorders in primary care. Gen Hosp Psychiatry 1997;19:98-111.

16. American Psychiatric Association, American Psychiatric Association, Task Force on DSM-IV. Diagnostic and statistical manual of mental disorders: DSM-IV-TR. 4th ed. Washington, DC: American Psychiatric Association; 2000.

17. Schwenk TL, Coyne JC, Fechner-Bates S. Differences between detected and undetected patients in primary care and depressed psychiatric patients. Gen Hosp Psychiatry 1996;18:407-15.

18. Williams JW, Jr, Noel PH, Cordes JA, Ramirez G, Pignone M. Is this patient clinically depressed? JAMA 2002;287:1160-70.

19. Spitzer RL, Williams JB, Gibbon M, First MB. The Structured Clinical Interview for DSM-III-R (SCID). I: History, rationale, and description. Arch Gen Psychiatry 1992;49:624-9.

20. Zigmond AS, Snaith RP. The hospital anxiety and depression scale. Acta Psychiatr Scand 1983;67:361-70.

21. Silverstone PH. Poor efficacy of the Hospital Anxiety and Depression Scale in the diagnosis of major depressive disorder in both medical and psychiatric patients. J Psychosom Res 1994;38:441-50.

22. Wells KB, Sherbourne C, Schoenbaum M, et al. Impact of disseminating quality improvement programs for depression in managed primary care: a randomized controlled trial. JAMA 2000;283:212-20.

23. Stewart AL, Hays RD, Ware JE, Jr. The MOS short-form general health survey. Reliability and validity in a patient population. Med Care 1988;26:724-35.

24. Rost K, Nutting P, Smith J, Coyne JC, Cooper-Patrick L, Rubenstein L. The role of competing demands in the treatment provided primary care patients with major depression. Arch Fam Med 2000;9:150-4.

25. Leon AC, Portera L, Olfson M, et al. False positive results: a challenge for psychiatric screening in primary care. Am J Psychiatry 1997;154:1462-4.

26. Nease DE, Jr, Klinkman MA, Volk RJ. Improved detection of depression in primary care through severity detection. J Fam Pract 2002;51:1065-70.

27. Spitzer RL, Williams J, Kroenke K, Linzer M, deGruy FV, Hann SR, et al. Utility of a new procedure for diagnosing mental disorders in primary care: the PRIME-MD 1000 study. JAMA 1994;272:1749-56.

28. Kroenke K, Spitzer RL, Williams JB. The PHQ-9: validity of a brief depression severity measure. J Gen Intern Med 2001;16:606-13.

29. Coyne JC, Fechner-Bates S, Schwenk TL. Prevalence, nature, and comorbidity of depressive disorders in primary care. Gen Hosp Psychiatry 1994;16:267-76.

30. Wells K, Burnam M, Rogers W, Hays R, Camp P. The course of depression in adult outpatients: results from the Medical Outcomes Study. Arch Gen Psychiatry 1992;49:788-94.

31. Lambert MJ, Hatch DR, Kingston MD, Edwards BC. Zung, Beck, and Hamilton Rating Scales as measures of treatment outcome: a meta-analytic comparison. J Consult Clin Psychol 1986;54:54-9.

32. Beck A, Ward C, Mendelson M, Mock J, Erbaugh J. An inventory for measuring depression. Arch Gen Psychiatry 1961;4:561-71.

33. Zung WW, Richards CB, Short MJ. Self-rating depression scale in an outpatient clinic. Further validation of the SDS. Arch Gen Psychiatry 1965;13:508-15.

34. Radloff LS. The CES-D Scale: A self-report depression scale for research in the general population. Applied Psychological Measurement 1977;1:385-401.

35. Sheikh JI, Yesavage JA. Geriatric Depression Scale (GDS): Recent evidence and development of a shorter version. In: Clinical Gerontology: A Guide to Assessment and Intervention. New York: Haworth Press; 1986;165-73.

Pros and cons of assessment scales

The advantages of using a scale are due to the manner in which patients experience depressive symptoms, along a continuum of mild to severe. A scale is able to represent these gradations in severity and may be helpful in guiding the need for treatment and treatment adjustments.

Unfortunately, this ability to measure the dimensional nature of depression is also a weakness, as a threshold must be identified above which the patient is classified as warranting further investigation. Ideally, these thresholds should be established in a representative primary care sample and predict functional status as well as likelihood of meeting DSM-IV diagnostic criteria. The ability of a scale to accurately identify patients in need of attention depends directly on the threshold.

Pros and cons of symptom counts

Instruments based on depression criteria are a relatively new innovation, appearing since the establishment of DSM-IV criteria that define reference symptoms, a minimum number of which must be present to diagnose depression. Depression criteria–based instruments have the advantage of not being dependent on a threshold of symptom severity.

However, in primary care settings this can also be a weakness because the presence of depression criteria alone may not be a reliable indicator of depression-related impairment.¹⁷ Instruments that can be used in both a diagnostic criteria and scale modes have a particular advantage in that the weaknesses of each are offset.

Characteristics of selected screening instruments

We searched MEDLINE and the Cochrane databases for reviews of depression screening, with particular attention to reviews of primary care-based trials. Forty-one papers emerged, 3 of which were systematic reviews. For this paper, we focused on the review published by Williams and colleagues,¹⁸ which summarizes primary care data on the depression screening instruments most widely used. They examined 379 studies that compared the primary care performance of these instruments with a reference standard diagnostic interview, such as the Structured Clinical Interview for DSM-IV (SCID).¹⁹ Twenty-eight studies met their criteria and were included in the systematic review.

In Table 2 we have adapted the information from Williams’s review and added a calculation of PPV based on a 10% prevalence estimate for depression in primary care populations. We chose to exclude information on the Single Question (SQ) screen because of its very low PPV and the Hopkins Symptom Checklist (HSCL) because of its length (25 questions). In addition, we chose to add the Hospital Anxiety and Depression Scale (HADS), using operating characteristic information from 2 studies,^20,21 because of its purported advantages in medically ill populations.

Beyond the SQ, it is useful to comment on “2-question screening” as suggested by the USPSTF. We are unable to find justification for this in the paper by Pingone and colleagues, which served as background for the recommendations.¹⁰ Although Pingone et al did cite the report of Wells and colleagues as using a 2-item screener, their study used not only 2 questions on mood and anhedonia but also other criteria in screening their population.²² Therefore, it is not appropriate as a source for 2-item screening performance characteristics.

Comparison of the operating characteristics of the selected instruments reveals that most yield PPV values in the 20% to 30% range, with the exception of the HADS, the PHQ, and the PHQ-9, which yield PPV values of 41.3%, 50%, and 55%, respectively.

The PHQ-9 (included in the (Appendix) offers a further advantage over the HADS and other instruments listed in that within a 9-item instrument both the presence of diagnostic criteria and severity may be assessed. Kroenke and colleagues have examined the use of the PHQ-9 as a severity instrument and found it to be a reliable and valid measure of depression severity when compared with the Medical Outcomes Study Short Form (SF-20).²³

We purposely have not examined negative predictive values (NPV) for the listed instruments. NPV is useful when screening using biomedical markers where a negative result allows extrapolation into the future due to a known, predictable time course for development of the screened-for condition. For example, a negative screening colonoscopy has value not just because of its current predictive value, but because we know something about how long it may take to develop precancerous polyps in a negative screened patient. However, this is not the case with depression. A patient that fails to meet criteria for depression today could fully meet criteria in 2 weeks and be quite depressed. Therefore we have chosen to focus on PPV in comparing depression screening instruments.

Selection and use of a screening instrument