Stroke results in neural damage to the brain and subsequent physical, cognitive, and emotional deficits (Dimyan & Cohen, 2011; Meyer et al., 2015; Ward, 2017). Globally, there are over 13.7 million new strokes each year, and stroke is a leading cause of adult long-term disability (Benjamin et al., 2018; Johnson et al., 2019; Lindsay et al., 2019). Since stroke is a vascular disease, there is vast heterogeneity in both brain and behavioral changes across patients, posing a significant challenge to the development of effective stroke neurorehabilitation strategies (Bernhardt et al., 2016). The recovery process also contributes to additional interindividual variability in the time course and extent of the resolution of neural damage and behavioral deficits.
Given the heterogeneity in both injury and recovery, large samples of diverse patients are needed to accurately study processes related to and supporting stroke recovery. However, a recent systematic review suggested that most studies seeking to determine the best predictors of stroke recovery are underpowered (Kim & Winstein, 2017). This is in part because acquiring large datasets of this nature is logistically and financially prohibitive for a single stroke research site. Recognizing these challenges, the Stroke Recovery and Rehabilitation Roundtable (SRRR) task force developed consensus statements on how to move stroke recovery research forward (Bernhardt et al., 2016; Bernhardt et al., 2019). Two key priority areas include: (a) a better understanding of the neurobiology of spontaneous and treatment-dependent recovery in humans; and (b) characterizing different stroke recovery phenotypes (Bernhardt et al., 2017), with a long-term goal of identifying potential biomarkers that predict stroke recovery (Boyd et al., 2017). Here, we describe the steps that the Enhancing Neuroimaging Genetics through Meta-Analysis (ENIGMA) Stroke Recovery working group has taken to address these priorities using a unique, big data approach with an emphasis on reliable and reproducible methods.
The ENIGMA worldwide consortium consists of over 1,400 researchers from 45 countries who have come together to form 26 different disease working groups as well as 24 additional working groups to study healthy human variation and develop novel, robust methods for neuroimaging and genetic meta-analyses (Thompson, 2019; Thompson et al., 2019; Thompson et al., 2020). Formed in 2009, ENIGMA has resulted in significant, large-scale contributions on the neural correlates of major depression (Schmaal et al., 2016), schizophrenia (Van Erp et al., 2018), bipolar disorder (Hibar et al., 2016), obsessive–compulsive disorder (Boedhoe et al., 2016), and epilepsy (Whelan et al., 2018). ENIGMA methods, discussed in this article, have also resulted in some of the largest studies of genetic correlates of specific brain characteristics, such as genetic variations determining subcortical, hippocampal, and intracranial brain volumes (Grasby et al., 2020; Hibar et al., 2015; Stein et al., 2012). In order to analyze multisite data in these large studies, ENIGMA typically uses a meta-analytic approach in which analysis methods, which are tested for reliability across diverse research sites, are openly shared. Participating research sites can analyze their brain magnetic resonance imaging (MRI) and genetic data locally and share either their site results (meta-analysis) or individual participant results (mega-analysis) with the rest of the working group (Zugman et al., 2020). In this way, many research sites can be brought together to meta- and mega-analyze retrospective MRI and genetic data collected across different research sites around the world. The developed methods perform consistently despite high variability in MRI scanner manufacturers, scanner strengths, and pulse sequences, and the same methods can be applied across different disease working groups, allowing for cross-disorder comparisons despite using different acquisition protocols (Boedhoe et al., 2019).
The ENIGMA Stroke Recovery working group is one of the 26 disease working groups within the ENIGMA consortium. Initially developed by a small group of researchers in 2015, ENIGMA Stroke Recovery has grown to over 100 members and has amassed a dataset of 2,137 MRI scans and counting from 39 research studies across 10 countries (see Table 1 for a summary of the number of scans by institution and geographical region at the time of publication). A primary goal of ENIGMA Stroke Recovery is to provide a reliable infrastructure for the collection and analysis of large, diverse datasets of poststroke brain MRI and behavioral data across research laboratories worldwide. The focus of the ENIGMA Stroke Recovery working group was initially on understanding the neural correlations of poststroke sensorimotor performance, which is affected in up to 80% of individuals after stroke (Rathore, Hinn, Cooper, Tyroler, & Rosamond, 2002). However, this focus has diversified as new members have joined. ENIGMA Stroke Recovery currently includes measures of sensorimotor, cognitive, emotional, and psychosocial behavior. Both ischemic and hemorrhagic stroke are included in the database, along with demographic, clinical and bedside diagnostic information. ENIGMA Stroke Recovery uses pipelines developed by several of the ENIGMA methods working groups (for a recent review, see Thompson et al., 2020, as well as other widely used software packages (Fischl et al., 2002; Fischl & Dale, 2000; Smith et al., 2004; Smith et al., 2007). Given the diversity and the unique challenges associated with stroke-specific brain imaging data, discussed further in the article, the ENIGMA Stroke Recovery working group also requests raw MRI data when available. The raw data are processed by ENIGMA Stroke Recovery research staffs who perform additional quality control and lesion segmentation steps. Reliable and robust systems for data collection and maintenance are thus critical for the success of this collaborative approach.
TABLE 1. Number of T1-weighted MRI scans by geographical region/institute Country City Institute Number of scans Australia Melbourne The Florey Institute of Neuroscience and Mental Health 317 Brazil Sao Paolo University of Sao Paolo 28 Brazil Sao Paolo Albert Einstein Israeli Hospital 15 Canada Toronto University of Toronto/Sunnybrook Research Institute 29 China Tianjin Tianjin Medical University General Hospital 65 Germany Griefswald University of Griefswald 68 Germany Tübingen University of Tübingen 175 Italy Rome IRCCS Santa Lucia Foundation 192 New Zealand Auckland University of Auckland 104 Norway Oslo University of Oslo 265 UK London University College London 50 USA Atlanta Emory University 110 USA Charleston Medical University of South Carolina 174 USA College Park University of Maryland 128 USA Irvine University of California, Irvine 191 USA Los Angeles University of Southern California 189 USA Philadelphia University of the Sciences 37 Total scans 2,137 Note: The total number of T1-weighted MRI scans (N = 2,137) includes data from both individuals with stroke (n = 1,918, or 89.8% of the total dataset) and healthy individuals (n = 219, or 10.2% of the total dataset). A subset of the scans also includes repeated MRIs from the same individual (e.g., longitudinal data; n = 672 scans, or 31.4% of the total dataset).The ENIGMA Stroke Recovery database can be used for many different purposes. First, the data can be used to test whether brain–behavior relationships identified in smaller, more homogeneous samples also exist in larger, more diverse samples. A specific example might be examining whether individuals with left hemisphere stroke show better or worse outcomes than those with right hemisphere stroke, as reports from the literature on this topic are inconsistent (Beuscher et al., 2017; Macciocchi, Diamond, Alves, & Mertz, 1998; Ween, Alexander, D'Esposito, & Roberts, 1996; Wu et al., 2015). Second, these data can be used to identify characteristics, such as specific lesion locations that affect stroke outcomes of interest. For example, the data could be used to identify whether depression is more common following a stroke that impacts the left frontal cortex, or whether injury to specific cortical regions, such as the right dorsal premotor region, is associated with differences in functional outcomes. The data could also be used in a voxel-wise analysis to identify voxels (i.e., three-dimensional pixels) in the poststroke brain that are related to a specific behavior, such as spasticity, mild cognitive impairment or poststroke fatigue. This voxel-wise whole-brain approach provides a granular way to examine entire vascular territories, not limited by specific anatomical boundaries. A third use of the data is in generating new data-driven hypotheses using machine-learning methods. The large size of the dataset makes it uniquely powered for machine-learning methods, which require big datasets to train and test computer algorithms to identify patterns. In particular, this dataset is well suited for unsupervised machine-learning techniques, which can be used to identify clusters, or subgroups, of people who show similar patterns of recovery, and then examine what brain traits are common within those specific subgroups. Notably, as all of the data provided in ENIGMA Stroke Recovery is collected for specific research studies, the secondary use of these data in ENIGMA Stroke Recovery reduces research waste and improves the efficiency and speed with which we can test the reproducibility of existing research findings, identify novel brain–behavior relationships, and generate data-driven hypotheses.
In this article, we provide a comprehensive description of the ENIGMA Stroke Recovery working group's approach to large-scale analyses of multisite retrospective poststroke brain MRI and behavioral data for the broad neuroscience community. Specifically, this article will discuss methods and protocols developed for: (a) data intake, (b) data harmonization, and (c) lesion analysis. In each section, we also highlight future directions and recommendations to facilitate collaborative data sharing (see also Box ).
2 DATA INTAKE 2.1 Data componentsKey components for the initial ENIGMA Stroke Recovery analyses rely on a T1-weighted (T1w) anatomical brain MR image and at least one poststroke behavioral outcome measure. These two elements comprise the minimum data required to join ENIGMA Stroke Recovery. Demographic data are also strongly requested, although not required. A summary of all data components is included in Table 2. Each component is described in more depth below.
TABLE 2. Data elements collected by the ENIGMA Stroke Recovery working group MRI Behavior Demographics RequiredT1-weighted structural MRI
Scanner strength, brand, and model
Alternative if unable to share raw T1 MRIs:
A spreadsheet with FreeSurfer cortical and subcortical measurements, quality control 2D image slices, and lesion masks (registered to a standardized template)
At least 1 behavioral outcome measure
Most common measures at time of publication:
Fugl-Meyer Assessment (72%) NIH Stroke Scale (19%) Motor Activity Log (16%) Modified Ashworth (12%) Action Research Arm Test (11%) Wolf Motor Function Test (9%)None required
Most common demographics at time of publication:
Age Sex Time since stroke/last known well (in days) Lesioned hemisphere Recommended FLAIR Diffusion MRI Resting-state fMRI Lesion masks Longitudinal scans EEGWe suggest collecting measures recommended by the Stroke Recovery and Rehabilitation Roundtable (Bernhardt et al., 2017)
Current recommendations for sensorimotor outcomes can be found in Kwakkel et al. (2017)
Current recommendations for cognitive outcomes can be found in McDonald et al. (2019)
First stroke or multiple strokes Race/ethnicity Hand dominance prior to stroke Therapy received (hours per week) Risk factors for cardiovascular disease (e.g., hypertension, obesity, diabetes smoking) Dementia status Comorbidities Note: The data elements are divided into three main components: MRI, behavioral measures, and demographic data, and further separated into required versus recommended elements. 2.1.1 Magnetic resonance imagingThe high-resolution T1w MRI is used to quantify anatomical variation in regions across the brain, which is then related to the behavioral measures. We use a T1w MRI instead of other MR sequences, which might be more sensitive to the stroke lesion acutely, because high-resolution T1w MRIs are routinely collected during research studies and can be used with FreeSurfer, a software used by all ENIGMA working groups for structural MRI analysis. FreeSurfer performs automated cortical and subcortical brain segmentation (Fischl et al., 2002; Fischl & Dale, 2000) and has reliable performance across different scanner characteristics and T1w acquisition parameters (Han et al., 2006). However, we acknowledge that the T1w MRI is not the best choice for identifying the lesion volume in acute and subacute patients. Therefore, we also collect additional MRI data that can be received and combined for analysis, such as diffusion MRI, T2-weighted fluid-attenuated inversion recovery (FLAIR), as well as other modalities of interest, such as resting-state functional MRI (fMRI).
If raw MRI data cannot be shared, research sites can choose to analyze the data locally using processing scripts found on the ENIGMA website (http://enigma.ini.usc.edu/protocols/); results can then be sent for subsequent analyses (outputs include a spreadsheet with measures of cortical thickness, cortical surface area, and subcortical volumes following FreeSurfer segmentation). However, the ENIGMA Stroke Recovery group has the complicating issue of lesions within the data (see Section 4). Therefore, more stringent quality control, as well as delineation of stroke lesion boundaries, is required for more fine-grained analyses of the lesion's impact on subsequent brain structure and behavior. For this reason, sites not sending raw data are requested to send the FreeSurfer segmentation measures, along with quality control images generated by our in-house code (see Section 2.2 for more detail), and lesion masks normalized to a standard template brain when possible.
2.1.2 Behavioral dataENIGMA Stroke Recovery collects many different types of behavioral data. As mentioned previously, the focus of the initial research question is on the neural correlates of poststroke sensorimotor behavior. To this end, over 80% of the dataset includes measures of sensorimotor performance. However, the growth and diversification of the ENIGMA Stroke Recovery group has led to the collection of additional types of behavioral data, including measures of cognition, mood, dysphagia, and psychosocial well-being (see Section 3 for more information).
2.1.3 Demographic dataDemographic information on the participants is also collected, such as age, sex, time from last known well to imaging, type of stroke, number of previous strokes, stroke location, risk factors, and comorbidities (see Section 3 for more information). There are no exclusionary demographic criteria for entry into the database. There is growing interest in characterizing rehabilitation history, although this is an extremely complex variable to harmonize across research sites and encode. In general, the reported demographic variables can be included as covariates or inclusion/exclusion criteria for specific analyses.
2.2 Data intake workflowWhen a research site joins ENIGMA Stroke Recovery, the site securely transfers de-identified MRI data as well as a comma-separated values (CSV) spreadsheet with all demographic and behavioral outcomes. If the raw MRI data are not available or cannot be shared, the site can run analyses scripts locally (https://github.com/npnl/ENIGMA-Wrapper-Scripts) and send FreeSurfer results in a CSV file along with the behavioral/demographic CSV file. Transfer can be accomplished via a secure file transfer protocol to an ENIGMA Stroke Recovery dedicated Linux server, or by the research site's preferred secure transfer method (e.g., using Box). The complete process, from data intake to data analysis, is displayed in Figure 1 and described in brief below. The scripts and code developed for these processes are freely and publicly available on Github (https://www.github.com/npnl).
ENIGMA Stroke Recovery workflow. Workflow for ENIGMA Stroke Recovery from data intake to data analysis
First, the received data are manually inspected to ensure: (a) good MRI data quality, and (b) appropriate values for the behavioral data. For MRI data quality, we visually inspect the images to ensure there are not large motion artifacts or other visible sources of noise in the data. Data that are of poor quality are excluded from the database. More recently, to standardize this process, we have begun to use the University of Southern California's Laboratory of Neuroimaging (LONI) Quality Control system, which is a freely available, semiautomated, web-based system for quantitatively evaluating MRI image quality (Kim et al., 2019). We ensure that the behavioral data are appropriately coded according to our ENIGMA Stroke Recovery database conventions. For instance, we convert and record time since stroke in units of days and denote the lesioned hemisphere by an integer value where left = 1, right = 2, both = 3, and other = 4. The behavioral data are also inspected for values within the normal range for each measure. For the initial analysis of sensorimotor outcomes, a “primary” sensorimotor outcome that most closely aligns with the existing ENIGMA dataset is selected (see Section 3 for details). Altogether, these steps ensure that the incoming data are consistent with the existing database for subsequent analysis.
Next, we write a site-specific script to reformat the data to conform to the Brain Imaging Data Structure (BIDS; https://bids.neuroimaging.io/) (Gorgolewski et al., 2016). BIDS provides a standardized way to organize and describe neuroimaging and behavioral data. If the data are not consistently named and formatted within each site, we manually reformat the data to conform to the BIDS standard. Having all of the data in BIDS format allows us to quickly and easily analyze ENIGMA Stroke Recovery data using a variety of software tools, many of which are written with the assumption that the data are in BIDS format (Gorgolewski et al., 2017). It also allows all of the code and scripts created for ENIGMA Stroke Recovery analyses to be easily used by others who use BIDS format, thereby increasing the transparency, reproducibility, and impact of this work.
Third, we run FreeSurfer, a brain imaging software package developed to analyze MRI scans, which segments the brain into anatomical regions for morphometric analysis (Fischl et al., 2002). We run two versions of FreeSurfer (version 5.3 and version 6.0) on the MRI data using scripts that automate the processing of all participants in parallel on the LONI computing cluster. FreeSurfer version 5.3 is currently the primary analysis version, which is used to maintain consistency with recent ENIGMA working group analyses. We also analyze all raw data using FreeSurfer version 6.0 because this more recent version has been shown to have improved performance and more accurate brain segmentations compared to FreeSurfer version 5.3. In addition, FreeSurfer version 6.0 also provides improved hippocampal segmentations and analysis of hippocampal subfields in stroke (Khlif et al., 2018), which is an area of research interest given recent findings from our ENIGMA Stroke Recovery working group showing a relationship between poststroke sensorimotor behavior and hippocampal volumes (Zavaliangos-Petropulu et al., 2019).
We then use the ENIGMA scripts mentioned previously to extract standard measures of subcortical volume (from eight subcortical regions of interest, bilaterally), and cortical thickness and cortical surface area (from 34 cortical regions of interest, bilaterally; https://github.com/npnl/ENIGMA-Wrapper-Scripts). All regions are from the standard Desikan-Killany atlas implemented in FreeSurfer (Desikan et al., 2006). These measures provide insight into brain morphometry for both subcortical and cortical regions. ENIGMA scripts are used to generate images of the overlap between each segmented region of interest and the underlying brain and put these into a webpage that can be visually inspected for segmentation accuracy. This is done for each brain. Although FreeSurfer has shown acceptable poststroke brain segmentation performance (Li et al., 2015), it may occasionally fail to create accurate segmentations on lower quality data (e.g., MRIs with lower spatial resolution or greater noise from movement) or on data with lesions that disrupt its surface-based algorithms. In our ENIGMA Stroke Recovery dataset, FreeSurfer fails to produce accurate segmentations in 10–20% of the data. To address this, trained researchers manually inspect each region of interest for each brain MRI. Any regions that are not properly segmented by FreeSurfer are marked as “failed” and excluded from the analyses (see Appendix 1 for additional information about the ENIGMA Stroke Recovery FreeSurfer quality control protocol).
Finally, we use an in-house script that pulls all of each site's FreeSurfer outputs and behavioral data and enters them into a relational database using Structured Query Language (SQL). Data are entered into a SQLite database (https://sqlite.org), a free, relational database engine that allows for simple and intuitive data storage using the SQL language (Owens, 2006). Key benefits of a SQL-based environment—compared to other database options, such as REDCap (Harris et al., 2009) (https://www.project-redcap.org/)—are that it is freely available (not institutionally constrained) and widely used across both research and industry, leading to many options for integrating SQL analyses with popular programming environments such as Python, and statistical packages such as R. SQLite runs quickly and does not require specialized computing resources, and SQLite databases can be easily shared. SQL-based databases are also being used in other large-scale data sharing projects, such as the NIH-funded All of Us research initiative, which aims to gather health-related data about one million or more people living in the United States (Klann, Joss, Embree, & Murphy, 2019). Given these factors, SQLite provides an accessible option for scalable data analysis and for future data sharing and is aligned with existing big data initiatives. Within the SQLite database, there are seven tables that hold information about demographics, behavioral measures, brain measures (two tables, one for each of the FreeSurfer analysis versions), MR scanner information, research site information, and metadata about the database itself. This database is queried to extract the specific data used in each analysis.
2.3 Future directions for improving data intakeA key future priority for updating the data intake process is using artificial intelligence to improve the manual quality control process, capitalizing on the efforts from other ENIGMA working groups (Petrov et al., 2017; Petrov et al., 2018). Currently, performing visual quality control of each segmented FreeSurfer region is a significant bottleneck in our data intake pipeline. Each individual brain has a total of 84 subcortical and cortical regions that should be inspected by a trained investigator, which takes on average about 20 min per brain; performing quality control on so many regions manually is not easy to scale up. In addition, visual quality control is an inherently subjective process. Investigators are intensively trained to ensure good inter-rater reliability, but there is still the potential for human error. Machine-learning algorithms to reliably perform quality control of the segmentations would remove an enormous barrier to performing this work. To facilitate the development of an automated quality control process, we have manually generated ratings on the subcortical volumes of over 1,000 poststroke brain MRIs to date. We anticipate that these data can be used to train and test deep learning algorithms (such as convolutional neural networks) to identify failures in segmentation.
3 DATA HARMONIZATIONA second key challenge that ENIGMA Stroke Recovery faces is harmonizing data collected using different MRI scanners and scanning protocols, as well as using different behavioral outcome measures. Data harmonization refers to all efforts to combine data collected across different sources and with different formats, naming conventions, and measures into one cohesive dataset. Optimizing data harmonization is important because MRI data collected using different scanners and/or scanning protocols can significantly affect analyses and results [e.g., (Zavaliangos-Petropulu et al., 2019)]. For behavioral data, different measures may focus on measuring specific aspects of sensorimotor impairment, function, or quality, and maybe more or less sensitive to a certain range of performance values. Importantly, the harmonization of methods also provides an opportunity to explore the effects of inter-site differences on both MRI and behavioral data analysis, which is difficult to examine otherwise. Here, we describe efforts to address each of these challenges so that we can combine imaging scans and behavioral data obtained at different centers.
3.1 MRI harmonizationENIGMA working groups have developed robust methods for analyzing T1w anatomical MRIs, diffusion MRI, and resting-state fMRI in a way that is reproducible and reliable across multiple research sites (Acheson et al., 2017; Jahanshad et al., 2013; Pizzagalli et al., 2019). While the scans themselves are not directly combined, a number of specific metrics are extracted from each type of imaging sequence so that data across centers can be used in mega- and meta-analyses. The key measures extracted for each sequence are described below. In addition, we provide guidelines for quality control and harmonization in order to ensure that all segmentations are representative of the desired anatomy. In this way, even if the scans themselves are not combined, statistical approaches for combining these extracted features in a meta-analytical or mega-analytical manner can be taken. Detailed instructions and code for analysis and quality control for each method may be found on the ENIGMA website ( http://enigma.ini.usc.edu/protocols/imaging-protocols/).
3.1.1 T1-weighted MRIAs noted previously, a T1w MRI is a key component of the initial ENIGMA Stroke Recovery analyses. T1w MRIs are processed using ENIGMA's structural image processing protocols (publicly available in the link above), which include several methods for analyzing T1w MRIs including cortical and subcortical volume and surface area, sulcal geometry analysis, and vertex-wise subcortical shape analysis. The ENIGMA structural protocols all utilize outputs from FreeSurfer, a brain imaging software package developed to analyze MRI scans of brain tissue, which segments and labels neuroanatomical structures in the data (Fischl et al., 2002). Importantly, FreeSurfer is reliable across research sites and demonstrates good test–retest reliability across scanner manufacturers and field strengths (Han et al., 2006). In addition, as mentioned above in Data Intake (Section 2.2), we perform manual quality control on all FreeSurfer segmentations, so that subsequent structural analyses use only quality-controlled inputs. Trained researchers manually inspect each region of interest for each structural brain MRI. Regions that are not properly segmented by FreeSurfer are marked as “failed” and excluded from further analysis (see Appendix 1 for the full ENIGMA Stroke Recovery FreeSurfer quality control protocol).
3.1.2 Diffusion MRIThe ENIGMA-DTI protocol is used to study fractional anisotropy and diffusion tensor imaging-derived diffusivity measures of whole brain and atlas-defined regions of interest in the white matter, based on the tract-based spatial statistics method implemented in FMRIB Software Library (Smith et al., 2007). While preprocessing diffusion MRI can vary by data quality, the ENIGMA-DTI protocol provides suggestions including steps for motion correction, echo-planar imaging distortion correction, and tensor fitting. The ENIGMA-DTI protocol can be found on the ENIGMA website ( http://enigma.ini.usc.edu/protocols/dti-protocols/) and is detailed elsewhere (Jahanshad et al., 2013). This protocol has been shown to have excellent reproducibility between scanners for the analysis of white matter microstructure (Acheson et al., 2017). The ENIGMA-DTI working group also works on methods for harmonizing multisite diffusion MRI (Zhu, Moyer, Nir, Thompson, & Jahanshad, 2018), and the ENIGMA Stroke Recovery group will use recently recommended mega-analytic methods for diffusion MRI analyses (Boedhoe et al., 2019).
3.1.3 Resting-state fMRIResting-state fMRI (rs-fMRI) offers an approach to understand patterns of synchronized brain activity in the resting state, which can further be decomposed into networks with known functions (e.g., default mode, salience, attention networks) (Biswal et al., 2010; Biswal, Yetkin, Haughton, & Hyde, 1995). Harmonized processing of rs-fMRI in ENIGMA has used one of two pipelines: (a) an echo-planar imaging-based pipeline, based on the Analysis of Functional NeuroImages software, which does not require the use of a co-registered anatomical MRI dataset (Adhikari et al., 2018; Adhikari et al., 2019; Adhikari et al., 2019); and (b) a pipeline known as fMRIprep+, based on the fMRIprep approach (Esteban et al., 2019), which can also be used for the analysis of multisite task-based fMRI (Veer, Waller, Lett, Erk, & Walter, 2019).
3.1.4 Future Directions in Brain Imaging HarmonizationFuture directions include the harmonization of stroke electroencephalography (EEG) data, as the ENIGMA EEG working group is currently developing methods for analyzing resting-state EEG (Smit, 2020; Smit et al., 2016). EEG has garnered growing attention in stroke rehabilitation over the years due to its portability, safety, and lower cost, compared to MRI or fMRI. Advancing our capability to use EEG was also identified as a developmental priority in the SRRR taskforce on biomarkers (Boyd et al., 2017). EEG measures, such as hemispheric asymmetry or frontoparietal coherence, have also been related to and/or predictive of stroke outcomes (Ramos-Murguialday et al., 2013; Zhou et al., 2018).
Finally, although all our existing MRI pipelines offer methods for handling previously collected data, the ideal scenario is a harmonized prospective data collection. We recommend that individual research groups consider using MRI sequences that match the publicly available sequences from the NIH-funded Lifespan Human Connectome Project Aging group https://www.humanconnectome.org/study/hcp-lifespan-aging/project-protocol/imaging-protocols-hcp-aging) or the NIH-funded Alzheimer's Disease Neuroimaging Initiative (ADNI; (http://adni.loni.usc.edu/methods/mri-tool/mri-analysis/). Doing so would expand the reach of one's individual research data by allowing for comparison with large, existing datasets, and help align prospective scanning efforts in ENIGMA Stroke Recovery with that of other groups. Future work may also try to align the ENIGMA diffusion MRI and rs-fMRI data processing protocols with the UK Biobank imaging efforts, which aim to provide multimodal brain imaging data on 100,000 individuals living in the United Kingdom (Alfaro-Almagro et al., 2018).
3.2 Behavioral and demographic data harmonization 3.2.1 Behavioral dataStroke researchers test a broad range of hypotheses, and different behavioral outcomes are selected to address specific hypotheses. In addition, the same behavior may be assessed using multiple measures. Thus, although many researchers in the ENIGMA Stroke Recovery working group study poststroke arm performance, the current ENIGMA Stroke Recovery database has more than 75 unique behavioral measures. This large number of behavioral measures is in line with a study reporting that 144 different outcome measures were used to study poststroke arm rehabilitation across 243 clinical trials (Duncan Millar, van Wijck, Pollock, & Ali, 2019).
In order to organize the many different types of behavioral data, ENIGMA Stroke Recovery relies on the International Classification of Functioning, Disability and Health (ICF) (World Health Organization, 2001), which provides a framework that can be used to categorize different assessments of poststroke outcomes. The ICF model has three levels that can be used to conceptualize behavioral measurements about a person: (a) body functions and structures (measuring the person's impairment), (b) activities (measuring function at the level of the person), and (c) participation (measuring function of the person as a member of society). The ICF framework is used by the SRRR task force to categorize core measures for all stroke recovery trials to collect (Kwakkel et al., 2017). Specifically, the SRRR recommends collecting the Fugl-Meyer Assessment (FMA) (Fugl-Meyer, Jaasko, Leyman, Olsson, & Steglind, 1975) to measure body function and structure for the upper and lower limbs, and the Action Research Arm Test (ARAT) (Lyle, 1981) and a 10-min walk test to measure activity limitations in the upper and lower limbs. Although there was no consensus recommendation on participation, there were recommendations to collect the EuroQoL 5 dimension scale (EQ-5D) as a measure of quality of life (Brooks & Group, 1996) and the modified Rankin Scale as a measure of global disability (Van Swieten, Koudstaal, Visser, Schouten, & Van Gijn, 1988). Finally, the National Institutes of Health Stroke Scale (NIHSS) (Brott et al., 1989) was recommended to measure stroke severity (i.e., global impairment across multiple domains), but not as an outcome measure.
These ICF-based and SRRR-recommended measures are well represented in the ENIGMA Stroke Recovery database. The most common measures include (listed in order of frequency in the database with percent frequency in parentheses): (a) FMA-UE (72%), (b) NIHSS (19%), (c) Motor Activity Log (MAL; 16%,) (Uswatte, Taub, Morris, Vignolo, & McCulloch, 2005), (d) modified Ashworth measure of spasticity (12%) (Bohannon & Smith, 1987), (e) ARAT (11%), and (f) Wolf Motor Function Test (WMFT; 9%) (Wolf et al., 2001). Notably, the most common measures focus on sensorimotor performance of the upper extremity after stroke, which reflects the research focus of the many ENIGMA Stroke Recovery working group members who study this topic. As the working group grows more diverse, so too do the behavioral measures. At present, the database also includes measures of gait and balance, such as the 6-min walk test (Butland, Pang, Gross, Woodcock, & Geddes, 1982), cardiovascular fitness, such as VO2 max (Shephard et al., 1968), cognition (e.g., Mini-Mental State Examination (Folstein, Robins, & Helzer, 1983), Montreal Cognitive Assessment (Nasreddine et al., 2005), and mood [e.g., the Hospital Anxiety and Depression Scale (Zigmond & Snaith, 1983)].
Prospectively, we recommend that stroke researchers collect all of the measures recommended by the SRRR task force (Kwakkel et al., 2017), with a particular emphasis on the FMA, which is contained in 72% of entries in our database and thus the most well-represented measure. However, given that ENIGMA Stroke Recovery currently contains only ret
Comments (0)