Short tandem repeats (STRs) expansions are increasingly recognised as key contributors to neurodegenerative diseases, where both repeat size and sequence composition influence pathogenicity. Over 70 pathogenic STR expansions have been implicated in neurological and neurodegenerative disorders. In amyotrophic lateral sclerosis (ALS), a repeat expansion in C9orf72 accounts for ~40% of familial and ~10% of sporadic cases. Similarly, pathogenic expansions in ATXN1 and ATXN2 cause spinocerebellar ataxia (SCA) types 1 and 2, respectively, while intermediate expansions in these loci have been associated with ALS risk [1].
Recently, a GGC repeat expansion in ZFHX3, encoding a polyglycine tract, was identified as the cause of SCA4 [2]. Expansions exceeding 41 GGC units result in motor neuron degeneration, a hallmark of both SCA4 and ALS. Importantly, the composition of polyglycine-coding STRs can modify disease phenotypes. As seen in NOTCH2NLC, increased interruptions within the repeat sequence are associated with a muscle-weakness-dominant subtype of NOTCH2NLC-realted GGC expansion disorders [3].
Given the genetic overlap between ALS and SCAs, we investigated the size distribution and sequence composition of ZFHX3 GGC expansions in Australian ALS patients. Using ExpansionHunter v5 [4], we genotyped the repeat in whole-genome sequencing data from 802 ALS patients and 6,215 population-matched controls (gnomAD). Alleles ≥24 repeat units were visually validated using REViewer [5]. Case-control associations were assessed using Fisher’s exact test, and TRMotifAnnotator [6] was used to analyse sequence composition.
ZFHX3 GGC repeat sizes ranged from 13–27 units in ALS cases and 5–29 in controls. There were no expansions exceeding the SCA4 pathogenic threshold. Sequence analysis revealed interruptions within the polyglycinetract, including serine, aspartic acid, and glycine conformations.
While ZFHX3 repeat lengths were not linked to the genetic continuum between ALS and SCA in Australian cases, repeat sequence composition may be important for understanding the role of STRs in neurodegenerative diseases.