Study Design: Prospective observation Study. Objectives: To compare the test-retest reliability and longitudinal validity (sensitivity to change) of 2 single-item numeric pain rating scales (NPRSs) with a 4-item pain intensity measure (P4). Background: Pain is a frequent outcome measure for patients seen in physical therapy; however, the error associated with efficient pain measures, such as the single-item NPRS, is greater thin for self-report measures of functional status. Initial evaluation of the P4 suggests that it is more reliable and sensitive to change than the NPRS. Methods and Measures: Two single-item NPRSs and the P4 were administered on 3 occasions-initial visit (n 220), within 72 hours of baseline (n = 213), and 12 days following baseline assessment (n 183)-to patients with musculoskeletal problems receiving physical therapy. Reliability was assessed using a type 2,11 intraclass correlation coefficient. Longitudinal validity was assessed by correlating the measures' change scores with a retrospective rating of change that included patients' and clinicians' perspectives. Results: The test-retest reliability and longitudinal validity of the P4 were significantly greater (P-1<.05) than both single-item NPRSs. Minimal delectable change of the P4 at the 90% confidence level was estimated to be a change of 22%, of the scale range (9 points) compared to 27.30% (3 points) and 31.8% (3.5 points) for the 2-day NPRS and 24-hour NPRS, respectively. Conclusions: The findings of this study suggest the P4 is more adept at assessing change in pain intensity than popular versions of single-item NPRSs.