Subthalamic nucleus deep brain stimulation (STN-DBS) is a safe and effective treatment of advanced Parkinson’s disease (PD), improving quality of life (QoL),1 motor and non-motor symptoms (NMS).2 However, considerable interindividual variability has been observed for postoperative QoL outcome, as approximately 43%–49% of patients do not experience clinically relevant QoL improvement after STN-DBS surgery.3–5 Several preoperative clinical predictors of greater postoperative QoL improvement have been reported: more severe non-motor total burden and motor impairment,6 and greater response of motor symptoms in the levodopa challenge test.7 However, these studies report general relationships between postoperative QoL outcome and predictor variables on a group level, but do not stratify QoL outcomes based on specific cut-offs of preoperative clinical parameters, which could be used for individual patients. Therefore, the findings of these studies are of limited use in clinical practice.
To investigate the probability of postoperative QoL improvement of individual subjects, we conducted a stratification analysis based on preoperative NMS total burden, severity of motor progression and motor response in the levodopa challenge test. Conceptually, a stratification scheme is considered clinically useful if it provides a high discriminatory ability of clinically meaningful postoperative QoL change and statistically feasible if it results in small within-stratum variation of QoL.8
As previous studies reported that non-motor aspects of PD predict postoperative QoL outcome better than motor aspects,5 9 we tested the hypothesis that preoperative non-motor total burden allows better stratification of postoperative QoL outcome than severity of motor progression and motor response in the levodopa challenge test.
Materials and methodsDesign and ethical approvalThis prospective, observational, international multicentre study in patients with advanced PD was conducted in four DBS centres (Cologne, Marburg, London and Manchester; GermanClinicalTrialsRegister: #6735).10–12 The study was carried out following the Declaration of Helsinki. All patients gave written and informed consent before participation.
PatientsPD was diagnosed according to the Movement Disorder Society (MDS) criteria.13 DBS treatment was performed in accordance to the MDS guidelines.14 DBS was not performed if clinically relevant cognitive and mood disorders were observed in multidisciplinary assessments including neuropsychological and neuropsychiatric specialist assessments.15 16
Clinical assessmentClinical assessments were conducted in the on-medication state before and in the on-medication and on-stimulation state at 6-month follow-up after DBS surgery:
Main outcome parameter:
The PDQuestionnaire (PDQ) is recommended by the Movement Disorders Society Scales Committee for QoL assessments in patients with PD.17 18 We used the 8-item version (PDQ-8), which contains the domains mobility, activities of daily living (ADL), emotional well-being, social support, cognition, communication, bodily discomfort and stigma.17 The abbreviated PDQ-8 correlates highly with the PDQ-39 and is commonly used in DBS studies.19–22 The results are reported as Summary Index (PDQ-8 SI) ranging from 0 (no impairment) to 100 (maximum impairment).
Outcomes used for stratification:
The Non-Motor Symptom Scale (NMSS) surveyed NMS in nine domains: cardiovascular, sleep/fatigue, mood/apathy, perceptual problems/hallucinations, attention/memory, gastrointestinal, urinary, sexual function and miscellaneous. The NMSS is commonly used in DBS studies, was assessed in the on-state and ranges from 0 (no impairment) to 360 points (maximum impairment).23–26 Based on published cut-offs, we stratified non-motor total burden as ‘mild to moderate’ (≤40), ‘severe’ (41–70) or ‘very severe’ (≥71).23 27
The Hoehn and Yahr (H&Y) Scale assessed the severity of motor symptom progression ranging from 1 to 528: 1 (unilateral involvement only usually with minimal or no functional disability), 2 (bilateral or midline involvement without impairment of balance), 3 (bilateral disease: mild to moderate disability with impaired postural reflexes; physically independent), 4 (severely disabling disease; still able to walk or stand unassisted) and 5 (confinement to bed or wheelchair unless aided). Based on published cut-offs for H&Y, we stratified the severity of motor progression as ‘minimal to mild’ (≤2), ‘moderate’ (3) or ‘severe’ (≥4).29
Motor response in the preoperative levodopa challenge test was quantified using the Unified PD Rating Scale-III (UPDRS-III), which ranges from 0 (no motor impairment) to 108 (maximum motor impairment). Off-medication state was assessed after long-acting dopamine agonists were halted for >72 and levodopa for >12 hours. Best on-medication state was assessed 45–60 min after administration of 250 mg levodopa.30 31 Based on published cut-offs, we stratified levodopa responses as ‘low’ (<33%), ‘partial’ (33%–50%), or ‘full’ (>50%).32
The resulting groups were termed (1) non-motor, (2) motor and (3) levodopa response strata, respectively.
Secondary outcome parameters:
ADL and motor complications were assessed with the Scales for Outcomes in PD-motor scale (SCOPA-M), which is an abbreviated form of the UPDRS and was used because of its time-efficiency.33 The two scales highly correlate in respective sections.34 The SCOPA-M ADL section ranges from 0 (no impairment) to 21 (maximum impairment) and the motor complications section ranges from 0 (no impairment) to 12 (maximum impairment).
Antiparkinsonian medication requirements were quantified as levodopa equivalent daily dose (LEDD) according to the Jost method.35
Statistical analysisShapiro-Wilk tests assessed the normality of distribution. We tested outcome changes from baseline to 6-month follow-up using Wilcoxon signed-rank tests and the Benjamini-Hochberg correction for multiple comparisons due to the use of multiple tests. We present adjusted p values at the 0.05 significance threshold throughout the manuscript. We calculate relative change from baseline to follow-up ((mean testbaseline–mean testfollow-up)/mean testbaseline×100) and quantified effect size with Cohen’s d.
We analysed the proportion of patients with a clinically meaningful QoL improvement using a distribution-based threshold (>1/2 SD of PDQ-8 SIbaseline).36–38 Previous studies have employed the same method to derive a distribution-based cut-off value for QoL improvement.5 39–41 Subsequently, we analysed postoperative PDQ-8 SI outcome changes of aforementioned non-motor (NMSS), motor (H&Y) and levodopa response strata.
We tested between-strata differences of baseline parameters and change scores using Kruskal-Wallis tests. We calculated change scores for clinical outcome parameters (change score=testbaseline–test6-month follow-up) and analysed longitudinal within-stratum outcome changes with Wilcoxon signed-rank tests. Benjamini-Hochberg correction accounted for multiple comparisons (multiple scales and three stratifications).
Furthermore, to assess the statistical feasibility of our stratification analysis, that is, the ability to identify homogenous groups of patients, we estimated PDQ-8 SI within-stratum variation based on its baseline SD. Moreover, to assess the clinical utility of our stratification analysis, we assessed discriminatory ability of clinically meaningful QoL outcomes by comparing cumulative distribution function curves, which provided information on the complete distribution of QoL changes in each stratum.
All statistical analyses were conducted with SPSS V.29.0.0.0.
ResultsWe included 175 patients (64 women) undergoing STN-DBS (see figure 1). In our cohort, the distribution-based threshold for clinically meaningful PDQ-8 SI improvement was 8.1 points. In four patients, baseline PDQ-8 SI was <8.1 points and thus by definition could not clinically relevantly improve at follow-up. Therefore, these four patients were excluded from further analyses.
Figure 1The flow chart describes the enrolment of patients. DBS, deep brain stimulation.
The remaining 171 patients were mean aged 61.2 years±10.7 with 10.5 years mean disease duration±4.9 (see Table 1). All main outcomes improved from baseline to follow-up (all p≤0.001). Effect sizes were ‘small’ for H&Y, ‘moderate’ for PDQ-8 SI, NMSS, UPDRS-III and SCOPA-M ADL and ‘large’ for motor complications and LEDD.
Table 1Demographic characteristics and clinical outcomes at baseline and change scores at 6-month follow-up
PDQ-8 SI change scores were available for 161 patients. We observed a considerable interindividual variability of PDQ-8 SI outcomes as 80/161 patients experienced a clinically relevant postoperative PDQ-8 SI improvement (49.4% ‘QoL responders’).
In the subsequent stratification analysis based on preoperative non-motor total burden, severity of motor progression, and levodopa response, the non-motor stratification resulted in the strata ‘mild to moderate’ in 56, ‘severe’ in 50 and ‘very severe’ in 60 patients. Motor progression stratification resulted in the strata ‘minimal to mild’ in 96, ‘moderate’ in 48 and ‘severe’ in 19 patients. The levodopa response stratification resulted in the response strata ‘low’ in 43, ‘partial’ in 55 and ‘full’ in 66 patients. The percentage of missing values was <5% for each scale and, therefore, required no imputations.42 The ‘low’ levodopa response stratum contained patients, in which the clinical indication for DBS treatment was case based on the following indications: medication-refractory tremor (38 %), troublesome dyskinesia (10 %) and on-off motor fluctuations (42 %) and others (10 %).
QoL at baseline in non-motor, motor and levodopa response strataAt baseline, PDQ-8 SI increased with more severe non-motor total burden, from ‘mild to moderate’ through ‘severe’ to ‘very severe’ (p<0.001, see table 2). In contrast, baseline PDQ-8 SI did not differ significantly between the motor or the levodopa challenge test response strata (p=0.105, respectively p=0.947).
Table 2Baseline clinical parameters in non-motor, motor and levodopa response strata
Online supplemental table 1 presents preoperative NMSS total, H&Y, levodopa challenge test response, UPDRS-III, SCOPA-M ADL and motor complications, and LEDD for each stratum in the non-motor, motor and levodopa response stratification. As expected, non-motor, motor and levodopa response stratification resulted in a significant difference in baseline NMSS total, H&Y and levodopa challenge test response in respective strata (all p<0.001). Notably, a difference in baseline H&Y was also observed in the non-motor stratification (p=0.041), a difference in baseline NMSS total in the motor progression stratification (p=0.020) and baseline motor complications differed in all three stratifications (all p<0.005).
In the non-motor strata, we observed between-strata differences for all baseline PDQ-8 domains (communication and bodily discomfort domains: p<0.001, emotional well-being, bodily discomfort and stigma: p<0.05) except for the social support domain (p=0.095), whereas motor progression stratification resulted in significant between-strata differences only in the mobility (p=0.031) and ADL (p=0.014) domains, and levodopa response stratification only in the bodily discomfort domain (p=0.026).
Change scores of clinical outcome parameters at 6-month follow-up in non-motor, motor and levodopa response strataAt 6-month follow-up, non-motor stratification resulted in between-strata differences of PDQ-8 SI change scores (p=0.003, see table 3), whereas we observed no significant between-strata differences in the motor or levodopa response stratification (both p>0.05). Post hoc comparisons showed that postoperative PDQ-8 SI changes of the ‘very severe’ non-motor stratum differed from the ‘mild-to-moderate’ (p=0.004) and the ‘severe’ (p=0.028) non-motor strata.
Table 3Changes of clinical parameters at 6-month follow-up in non-motor, motor and levodopa response stratification
Furthermore, explorative analyses showed that these between-strata differences were driven by the PDQ-8 domains cognition (p=0.033) and communication (p=0.007; see online supplemental figure 1). In contrast, no significant differences of PDQ-8 domain changes were observed between the motor and levodopa response strata. We also explored within-group PDQ-8 domain changes in the non-motor strata and observed that in the ‘very severe’ stratum all PDQ-8 domains improved (all p≤0.013) except for social support (p>0.05). In the ‘very severe’ non-motor stratum, effect sizes were ‘moderate’ in the ADL, cognition and bodily discomfort domains and ‘small’ in all other domains (see online supplemental table 2). In contrast, only two domains, mobility (p=0.028) and ADL (p=0.014), improved in the ‘severe’ non-motor stratum and only one domain, cognition (p=0.004), in the ‘mild-to-moderate’ non-motor stratum, and the effect sizes were generally smaller than in the ‘very severe’ non-motor stratum.
Online supplemental table 3 presents postoperative change scores of NMSS total, H&Y, UPDRS-III, SCOPA-M ADL and motor complications, and LEDD for each stratum in the non-motor, motor and levodopa response stratification. Non-motor and motor progression stratification resulted in significant between-strata differences of postoperative change scores of NMSS total, respectively H&Y Scale (both p<0.001). Notably, only non-motor stratification resulted in a significant between-strata difference of motor complication change scores (p=0.032). The postoperative improvement of motor complications increased from the ‘mild to moderate’ through the ‘severe’ to the ‘very severe’ non-motor stratum.
Within-stratum SDs of PDQ-8 SI at baseline and discriminatory ability of PDQ-8 SI changeIn the overall cohort, the SD of PDQ-8 SI was 16.1 points. Table 2 shows that non-motor stratification resulted in a reduction of within-stratum SD of baseline PDQ-8 SI (range: from 12.3 in the ‘mild-to-moderate’ stratum to 15.5 points in the ‘severe’ non-motor stratum). In contrast, levodopa response stratification reduced SD to a smaller degree (range: 14.1–15.8) and motor progression stratification did not reduce SD in all strata (range: 14.5–16.7). Therefore, non-motor stratification resulted in a greater reduction of SD of baseline PDQ-8 SI than motor and levodopa response stratifications. Compared with the overall cohort (16.1 points), the SD was reduced in all non-motor strata with a maximum reduction of 3.8 points (approximately 24%) in the ‘mild-to-moderate stratum’.
Furthermore, as illustrated in the cumulative distribution curves (see figure 2), in the non-motor stratification, approximately 62% of patients in the ‘very severe’ stratum experienced a clinically relevant PDQ-8 SI improvement compared with approximately 46% and 39% of patients in the ‘mild-to-moderate’ and ‘severe’ strata, respectively. In the motor progression stratification, approximately 56% of patients in the ‘severe’ stratum experienced a clinically relevant PDQ-8 SI improvement compared with approximately 48% in the ‘minimal-to-mild’ strata and approximately 49% in the ‘moderate’ strata. In the levodopa response stratification, the proportion of patients who experienced a clinically relevant PDQ-8 SI improvement ranged from approximately 49% in the ‘partial’ to approximately 50% in the ‘full’ and approximately 52% in the ‘low’ response strata. Therefore, the discriminatory ability for clinically relevant PDQ-8 SI improvement was greater in the non-motor stratification (approximately 23% difference between non-motor strata) than for motor progression and levodopa response stratification (approximately 8% and 3% difference between motor progression, respectively levodopa response strata).
Figure 2Cumulative distribution curves for patients stratified by Non-Motor Symptom (NMS) Scale total burden, Hoehn and Yahr motor stage and levodopa challenge test response.
DiscussionIn this prospective, observational, international multicentre study, we observed that preoperative non-motor total burden is a suitable stratification parameter for postoperative QoL outcome. Specifically, non-motor stratification was clinically useful because of its ability to discriminate QoL responders and QoL non-responders as the absolute risk of QoL non-response was reduced by 24% from the ‘very severe’ to ‘severe’ non-motor stratum. In contrast, the clinical usefulness of motor progression and levodopa response stratifications of QoL outcome was considerably smaller (8%, respectively 3% between-strata absolute risk reduction for QoL non-response). Moreover, in contrast to motor progression and levodopa response stratification, non-motor stratification reduced within-stratum variation of PDQ-8 SI compared with the overall cohort, which demonstrates its statistic feasibility for the identification of homogenous groups of patients.
Clinical efficacy of subthalamic stimulation in the overall cohortIn line with the previous literature, STN-DBS improved QoL, non-motor total burden and motor aspects of PD, and reduced dopaminergic medication requirements.1 The 46.2% mean preoperative levodopa challenge test response was well within the range of previous DBS studies.1 43 We confirm a considerable interindividual variability of postoperative QoL outcome as approximately 50% of our cohort experienced a clinically relevant postoperative QoL improvement, which is in line with previous studies.3–5
QoL impairments in non-motor, motor progression and levodopa response strataWe observed that non-motor stratification resulted in between-strata differences of baseline PDQ-8 SI as patients with more severe non-motor total burden expressed worse preoperative QoL. Data from the Earlystim trial and other cohorts indicates that patients with more severe baseline QoL impairment experience greater postoperative QoL improvement4 39 and our study demonstrates that worse preoperative non-motor total burden, not severity of motor progression or motor response in the levodopa challenge test, is associated with greater preoperative QoL impairment.
A meta-analysis of 22 studies showed that greater preoperative levodopa challenge test response is associated with better postoperative outcome of UPDRS subscales for ADL and motor examination.44 The relationship between the preoperative levodopa challenge test response and postoperative QoL outcome is less clear. We confirm the findings of an analysis of two randomised, controlled studies, which reported similar QoL improvements between patients with low, partial and full preoperative levodopa challenge test response.32
Indication evaluations for deep brain stimulation: levodopa challenge test response matters, but only as a supportive criterion for the diagnosis of PDA recent survey among 207 movement disorders experts from 59 countries reports that a minimum levodopa challenge test response of >50% or >33% is required in 15.2%, respectively 37.3% of respondents.45 This finding is surprising as there is no hard evidence that patients with a clinically established diagnosis of PD and <33% motor response in the levodopa challenge test per se experience negative outcomes following DBS.46 In light of the emerging evidence from several cohorts that postoperative QoL improvements are similar across levodopa response strata,32 the application of an absolute cut-off of levodopa challenge test response likely denies access to DBS treatment for patients with PD and low levodopa response (26.2% of our cohort). The partial levodopa response threshold implemented in some centres would have denied DBS treatment in 59.8% of our cohort.
The cut-off of >33% levodopa challenge test response supports the diagnosis of idiopathic PD as its positive predictive value for a clinical diagnosis of PD is 88.6%.47 In this context, we advocate applying the Movement Disorders Society (MDS) Clinical Diagnostic Criteria for PD as a first step in a case-based approach in DBS indication evaluations.13 The MDS criteria open up the possibility to clinically establish diagnosis of PD, even if patients show <30% motor response in the levodopa challenge test, which can occur, for example, due to gastrointestinal barriers to levodopa transport and absorption. Within the framework of these MDS criteria, the clinically established diagnosis of PD requires the absence of absolute exclusion criteria and red flags and, in patients with <30% levodopa response, the presence of at least two of the following criteria: (1) rest tremor in one limb, which may or may not be medication refractory, (2) troublesome dyskinesia, (3) unequivocal and marked on-off fluctuations, which must have at some point in time included predictable end-of-dose wearing-off and (4) positive results from at least one ancillary test for idiopathic PD with a specificity of 80% or higher. Once PD has been diagnosed according to the MDS criteria, we advocate an individualised case-based evaluation of the expected DBS outcome and benefit. This second step should include, but not be limited to, an assessment of QoL, non-motor and motor aspects of PD, a consideration of antiparkinsonian medication regimens, effects and side effects, an assessment of comorbidities and a careful reflection on the psychosocial and occupational situation of patients. Patients with PD who experience worse preoperative QoL impairments and worse non-motor total burden experience greater postoperative QoL improvements and, closely connected to this point, have a higher chance of experiencing a clinically relevant postoperative QoL improvement.
Toward a case-based approach in indication evaluations for deep brain stimulation for PD including non-motor assessmentsThere is a growing body of evidence which demonstrates the beneficial non-motor effects of STN-DBS: Data from randomised, controlled studies provides class I evidence for beneficial effects up of STN-DBS for up to 2 years on several non-motor aspects of PD, such as anxiety, depression, hyperdopaminergic behavioural disorders, pain/bodily discomfort or urinary symptoms.48–51 Moreover, class IIb evidence shows that non-motor total burden improves from baseline up to 3-year follow-up, which also correlates with an improvement of QoL.52 In the current study, we demonstrate that preoperative non-motor total burden is a clinically useful and statistically feasible stratification parameter for postoperative QoL response and that neither preoperative levodopa challenge test response nor preoperative severity of motor progression demonstrated these characteristics in our cohort.
Moreover, we observed between-strata differences of baseline motor complications in non-motor, motor progression and levodopa response stratifications. However, motor complication change scores were only different between non-motor strata as motor complication improvements were greater with worse preoperative non-motor total burden. Stratifications based on motor aspects of PD (severity of motor progression and levodopa response) were not suited to stratify postoperative motor complications outcome.
Continuing the longstanding tradition of clinical research on the refinement of selection criteria for DBS, we advocate that greater weight should be given to the non-motor aspects of PD. In the context of DBS indication evaluations, levodopa challenge test response should be viewed as a supportive criterion for the diagnosis of PD, not as an absolute exclusion criterion for DBS treatment when specific cut-offs of >33% or >50% are not achieved in individual patients. In our view, in DBS indication evaluations, a case-based, holistic approach may be considered best practice and should also include assessments of non-motor aspects of PD.
LimitationsIn particular, in the ‘severe’ motor progression stratum (n=19), stratification analysis resulted in a small stratum size, whereas all other non-motor, motor progression and levodopa response strata were larger (n>40) and the sample size of the overall cohort was larger than in most DBS studies.1 51 Studies with longer follow-up periods are needed for the stratification of long-term postoperative QoL outcome. Further studies in other target regions, for example, the Globus Pallidus internus, are needed. Another limitation of our stratification analysis is the use of global non-motor, motor progression and QoL scales (NMSS, H&Y, UPDRS-III response in the in a levodopa challenge test, and PDQ-8). Specific motor and NMS or QoL domains may drive the global effects reported in the present study and further work is needed to identify these aspects in detail. However, the scales we used in the present study are recommended by the MDS clinical outcomes committee and are commonly used for the assessment of non-motor and motor aspects of PD,17 28 53 54 and we were interested specifically in global disturbances of NMS and motor symptoms, and QoL.
ConclusionWe report that preoperative non-motor total burden, not severity of motor progression or levodopa challenge test response, is clinically useful and statistically feasible to stratify postoperative QoL outcome in patients undergoing STN-DBS for PD. Our findings highlight that, besides the traditional preoperative assessments of motor symptoms of PD including levodopa challenge test response, examinations of QoL disturbances and non-motor total burden are necessary components for a case-based, holistic approach in indication evaluations for DBS.
Comments (0)