From the Copenhagen Neuromuscular Center, Department of Neurology, Rigshospitalet, University of Copenhagen, Copenhagen, Denmark
*These authors contributed equally to this work.
Objective: The 2- and 6-minute walk tests are used to evaluate walking capacity, but reliability has been sparsely investigated in patients with neuromuscular diseases. The aim of this study was to investigate the relative and absolute reliability of the 2- and 6-minute walk tests in patients with neuromuscular diseases.
Design: Each patient performed a 2- and a 6-minute walk test on 2 test days separated by 1–2 weeks.
Subjects: A total of 93 adult patients (mean age 53 years, age range 22–83 years) with 12 different neuromuscular diseases were included.
Results: The mean walking distance increased by 4.3 and 11.2 m (p < 0.001) in repeated 2- and 6-minute walk tests, respectively. Intraclass correlation coefficient in the 2- and 6-minute walk tests was 0.99 (p < 0.001). Standard error of measurement was 4.9 m in the 2-minute walk test and 14.0 m in the 6-minute walk test. Minimal detectable difference was 13.7 m in the 2-minute walk test and 38.8 m in the 6-minute walk test.
Conclusion: These findings show good relative reliability of the 2- and 6-minute walk tests in patients with neuromuscular diseases. However, absolute reliability demonstrated variability in neuromuscular diseases. This should be considered when interpreting a change in walking distance.
Key words: neuromuscular diseases; reproducibility of results; walk test.
Accepted Feb 14, 2017; Epub ahead of print Mar 29, 2017
J Rehabil Med 2017; 49: 362–366
Correspondence address: Kirsten Lykke Knak, Rigshospitalet, Department of Neurology, Copenhagen Neuromuscular Center, 6921NM, Juliane Maries Vej 28, DK-2100 Copenhagen, Denmark. E-mail: kirsten.lykke.knak@regionh.dk
The 2-minute walk test (2MWT) and 6-minute walk test (6MWT) are used to evaluate walking capacity in patients with neuromuscular diseases (NMD) in the clinic and in clinical trials. However, reliability of the 2MWT has been documented only in Duchenne muscular dystrophy (1) and reliability of the 6MWT has been documented only in Duchenne muscular dystrophy (1, 2), myotonic dystrophy (3), and spinal and bulbar muscular atrophy (4). Relative reliability characterizes the trustworthiness of a test in a specific diagnosis. Absolute reliability provides actual values for measurement error that should be exceeded in order to represent a real difference in a patient’s clinical condition.
The aim of the present study was to investigate the relative and absolute reliability of the 2MWT and 6MWT in patients with NMD. The walk tests in some of the patients have been published previously in Andersen et al. (5).
A total of 93 patients were recruited from March 2014 to January 2015. Patients followed at the Copenhagen Neuromuscular Center were invited to participate in the study. Inclusion criteria were: biopsy or genetically confirmed NMD; age ≥ 18 years; and capacity to walk further than 60 m. Exclusion criteria were: conditions with a significant impact on walking capacity.
This is a methodological study. The patients were tested at our Neuromuscular Center on 2 test days at the same time of the day, 1–2 weeks apart. On each test day, a 2MWT and a 6MWT were performed, with 30 min rest between tests. Patients were block randomized (block size 10) using sealed envelopes on the first test day to perform either the 2MWT or the 6MWT first. The same order was used for re-test. The 6MWT was performed according to the American Thoracic Society (ATS) guidelines (6) and the 2MWT was a modified version of the ATS guidelines for the 6MWT. In the 2MWT, patients were informed that the test was a 2MWT, and the encouragement used at 3 min in ATS guidelines was used at 1 min in the 2MWT. Two investigators performed the 2MWT and 6MWT, but each patient was instructed by the same investigator at test and re-test.
Patient characteristics and walking distance in the present study investigating relative and absolute reliability have been reported in a published study assessing validity of the 2MWT compared with the 6MWT (5). The Regional Committee on Health Research Ethics in Denmark approved the study (H-4-2014-FSP) and informed verbal consent was obtained.
Analyses have been conducted in SPSS version 22, except Student’s paired t-tests, which were performed in Microsoft Excel 2010. As subgroup behave differently, we have conducted both analyses for all patients as a group and for each NMD. Data for all patients as a group and for each subgroup were normally distributed tested by Shapiro-Wilk test and Quantile-Quantile Plot in SPSS. Systematic differences in walking distance between test and re-test were tested by the parametric 2-sided Student’s paired t-test. Patient characteristics and change in walking distance between tests are presented as mean and standard deviation (SD). Relative reliability was determined by intraclass correlation (ICC) coefficient (model: 3, 2-way mixed, type: absolute agreement, confidence interval: 95%). Relative reliability indicates the degree of reliability and ranges from 0.00 = no reliability to 1.00 = perfect reliability (7). The higher reliability, the less measurement error in the assessment. Absolute reliability was investigated by standard error of measurement (SEM), minimal detectable difference (MDD) (repeatability) for 95% confidence interval and by Bland-Altman plots, which also tested for heteroscedasticity (if variability is unequal across magnitudes of the mean).
SEM = SD*√1-ICC (7) MDD95 = 1.96*SEM * √2 (7)
SEM% and MDD% were calculated using the formulas:
SEM% = SEM/mean*100 (8) MDD95% = MDD/mean*100 (8)
The mean for SEM and MDD95 is for the test and re-test. SEM and MDD95 estimate the smallest change that is beyond measurement error in a group and in a single subject, respectively (3). Bland-Altman plot visualizes agreement between repeated tests. The higher reliability, the closer the dots are to zero. Statistical significance was set at p ≤ 0.05. Results are shown for the NMD patients all together and for each NMD. Post hoc sample size calculation with alpha level = 0.05, 2-tailed test and correlation coefficient = 0.90 required a sample of 7 patients (7).
Patient characteristics were as follows: 12 different NMD (Fig. 1, Table I), 32 females, 61 males, age 53 years (SD 17), height 1.74 m (SD 0.10), weight 76.2 kg (SD 16.3), body mass index 25.2 kg/m2 (SD 5). Mean muscle strength, using the Medical Research Council scale, for ankle dorsal flexion was 4 (SD 1.6), ankle plantar flexion 4.4 (SD 1.1), hip flexion 4.4 (SD 1), and hip extension 4.5 (SD 0.7). Twenty-eight patients used assistive devices in the 2MWT and 6MWT. At re-test of the 2MWT, 60 patients walked longer, 27 walked shorter, and 6 walked the same distance. Similarly, in the 6MWT, 67 patients walked longer at re-test, 24 walked shorter, and 2 walked the same distance.
Fig. 1. Walking distance in the 2MWT and 6MWT in individual patients with neuromuscular diseases. 2MWT: 2-minute walk test; 6MWT: 6-minute walk test; DM1: myotonic dystrophy type 1; LGMD: limb-girdle muscular dystrophy; SBMA: spinobulbar muscular atrophy/Kennedy disease; CMT: Charcot-Marie-Tooth disease/hereditary motor and sensory neuropathy; FSHD1: facioscapulohumeral muscular dystrophy type 1; MM: mitochondrial myopathy; IBM: sporadic inclusion body myositis; BMD: Becker muscular dystrophy. The walking distance (m) is on the y-axis and test days on the x-axis.
Table I. Walking distance in the 2-minute walk test and 6-minute walk test in each neuromuscular diseases
For all patients as a group the mean walking distance in repeated 2MWT increased by 4.3 m (SD 9.1) (2.9 %) from 144.9 (SD 51.8) to 149.2 m (SD 52.2) (p < 0.001) and in the 6MWT by 11.2 m (SD 26.0) (2.7 %) from 412.0 (SD 155.6) to 423.2 m (SD 158.1) (p < 0.001). Results for subgroups are shown in Fig. 1.
Relative reliability was high for all patients as a group in 2MWT and 6MWT (ICC = 0.99, p < 0.001, 95% CI 0.98–1.00). Similar results were seen in all subgroups (ICC > 0.90, p < 0.004). For all diseases as a group SEM and MDD95 were 4.9 m (3.4%) and 13.7 m (9.3%), respectively, in the 2MWT, and 14.0 m (3.4%) and 38.8 m (9.3%) in the 6MWT. In subgroups, SEM varied from 0.5–9.1 m in the 2MWT and 7.1–37.0 m in the 6MWT, and MDD95 varied between 1.5–25.2 m in the 2MWT and between 19.7–102.5 m in the 6MWT. Results for subgroups are shown in Table II.
Table II. Reliability of the 2MWT and 6MWT in each neuromuscular diseases
Agreement between repeated 2MWT and 6MWT is shown in Bland-Altman plots (Fig. 2, A + B) with a 95% CI showing a variation of –13.9 to + 22.5 m for the 2MWT and –40.8 to + 63.3 m for the 6MWT. The difference in walking distance from test to re-test was not dependent on walking distance on the first test day, meaning there was no heteroscedasticity.
Fig. 2. Absolute reliability of the 2MWT and 6MWT in patients with neuromuscular diseases. 2MWT: 2-minute walk test, 6MWT: 6-minute walk test. The different NMD are symbolized by different colours. Each dot represents a single patient. Various (6): spinal muscular atrophy (3), myotonia congenital Thomsen disease (1), congenital myopathy (1) and polymyositis (1). A + B: The Bland-Altman plots show the change in walking distance between repeated tests for each patient compared with mean ± 1.96 standard deviation (SD) for all NMD as a group. The walking distance on test day 1 indicates whether the variability among tests depends on the walking distance on the first test day. Therefore, data on the x-axis is the walking distance from the first test day and not the mean walking distance from test day 1 and test day 2. The x-axis shows the walking distance (m) on the first test day. The y-axis shows the change in walking distance (m) from the first test day to the second test day. The closer the dots are to zero the more reliable the test is. The 3 horizontal lines in each graph illustrate the mean difference from test to re-test and the 95% confidence intervals, with the upper line showing 1.96*SD plus the mean and the lower line showing 1.96*SD minus mean.
The main finding of the present study is that the 2MWT and 6MWT are as reliable in NMD as in other diseases. Relative reliability was excellent, with an ICC of 0.99 in both the 2MWT and 6MWT. Similar results were seen in all subgroups (ICC > 0.90). These findings are consistent with previous studies of the 2MWT and 6MWT in different neuromuscular (3, 4, 9, 10) and non-neuromuscular (11) conditions (ICC 0.61–1.00). A general guideline suggests that ICC > 0.75 = good reliability, and ICC > 0.90 is recommended for clinical decisions (7). According to these guidelines, the 2MWT and 6MWT are reliable measures for clinical decisions. However, the acceptable level of ICC depends on the specific circumstances. A lower level of ICC can be accepted if 2MWT or 6MWT are secondary endpoints as opposed to a primary outcome.
Despite the good relative reliability of the 2MWT and 6MWT, there was a statistically significant increase in walking distance from test to re-test of 3% in both tests for NMD as a group (4.3 and 11.2 m in the 2MWT and 6MWT, respectively), which could be due to a learning effect. Variation was also present in subgroups, but the variation was not statistically significant in most of them. The results might be explained by small samples in subgroups and the large variability in walking capacity within subgroups. Statistically significant variability in walking distance in repeated 6MWT has been reported in previous studies of NMD with an increase of 3–5% (17.0–24.0 m) (3, 12). However, variability in post-polio syndrome (PPS) showed a non-statistically significant increase in walked distance of less than 2% in 6MWT (9) and 2MWT (10).
Absolute reliability provides information about absolute difference between repeated tests. In line with previous findings (3, 13, 14), SEM in our study was 4.9 m (3.4%) in the 2MWT and 14.0 m (3.4%) in the 6MWT for all patients as a group, but varied in subgroups. MDD95 was 13.7 m (9.3%) in the 2MWT and 38.8 m (9.3%) in the 6MWT for all patients as a group, but large subgroup variability was found. The MDD95 for the 2MWT and 6MWT is consistent with previous studies (3, 9, 13, 14), except from Stolwijk-Swüste et al (10), which reported a smallest detectable change of 22.9 m (16.8%) for the 2MWT in PPS. The inconsistency might be due to different NMD and patient characteristics (e.g. age and functional level) across studies. The larger SEM and MDD95 in the subgroups might be explained by: (i) different NMD; (ii) different circumstances in repeated tests in some patients (e.g. poor sleep); (iii) difference in sample size as a larger sample reduces SEM; and (iv) difference in walking capacity across subgroups as there was a tendency that subgroups with the longest walking distance (myotonic dystrophy type 1 and facioscapulohumeral muscular dystrophy) showed the smallest measurement error, while, in contrast, subgroups with the shortest walking distance (sporadic inclusion body myositis and spinobulbar muscular atrophy) showed the largest measurement error. Thus, measurement error seems to be dependent on the functional level; measurement error increases with decreasing walking capacity. Clinicians need to consider the walking capacity when interpreting a change between assessments. SEM% and MDD95% vary between 2MWT and 6MWT in subgroups. Not surprisingly, those showing the highest SEM% in 2MWT also have the highest MDD95% in the 2MWT. The NMD affected in the peripheral nervous system (spinobulbar muscular atrophy and Charcot-Marie-Tooth disease) show the highest SEM% and MDD95% in the 2MWT. In contrast, most of the muscular dystrophies (myotonic dystrophy type 1, facioscapulohumeral muscular dystrophy and Becker muscular dystrophy) show the highest SEM% and MDD95% in the 6MWT. These findings have important implications for the interpretation of a change in walking distance in repeated 2MWT and 6MWT at follow-up or in clinical trials. The variability among test results, especially in outliers, could potentially be corrected by measuring mean heart rate during walking, as suggested recently (12).
The present study was limited by only a very few protocol violations, such as testing performed on different times of day in a few patients. However, the patients with protocol violations showed similar results as the other patients and were therefore not excluded from the study.
In conclusion, our findings show good relative reliability of the 2MWT and 6MWT in patients with NMD as a group and in subgroups. However, absolute reliability demonstrated variability in NMD as a group and a considerable amount of variability across subgroups. Decreased level of walking capacity showed increased level of absolute measurement error. Thus, clinicians must be aware of the issues associated with absolute reliability when interpreting results of 2MWT and 6MWT in individual patients in the clinic.
KLK and LKA received travel grants from Lundbeck Foundation, Rigshospitalets’ Anniversary Foundation and PTC Therapeutics. JV received research and travel grants from Genzyme Corporation, and serves on the global scientific advisory board for Pompe disease for Genzyme Corporation. NW reports no disclosures.