Skip to main content
Free AccessOriginal Article

Neuer Locus und Kandidatengen für familiäre Legasthenie auf Chromosom 4q

Published Online:https://doi.org/10.1024/1422-4917/a000758

Abstract

Abstract.Objective: Developmental dyslexia is a highly heritable specific reading and writing disability. To identify a possible new locus and candidate gene for this disability, we investigated a four-generation pedigree where transmission of dyslexia is consistent with an autosomal dominant inheritance pattern. Methods: We performed genome wide array-based SNP genotyping and parametric linkage analysis and sequencing analysis of protein-coding exons, exon-intron boundaries and conserved extragenic regions within the haplotype cosegregating with dyslexia in DNA from one affected and one unaffected family member. Cosegregation was confirmed by sequencing all available family members. Additionally, we analyzed 96 dyslexic individuals who had previously shown positive LOD scores on chromosome 4q28 as well as an even larger sample (n = 2591). Results: We found a single prominent linkage interval on chromosome 4q, where sequence analysis revealed a nucleotide variant in the 3’ UTR of brain expressed SPRY1 in the dyslexic family member that cosegregated with dyslexia. This sequence alteration might affect the binding efficiency of the IGF2BP1 RNA-binding protein and thus influence the expression level of the SPRY1 gene product. An analysis of 96 individuals from a cohort of dyslexic individuals revealed a second heterozygous variant in this gene, which was absent in the unaffected sister of the proband. An investigation of the region in a much larger sample further found a nominal p-value of 0.0016 for verbal short-term memory (digit span) in 2,591 individuals for a neighboring SNV. After correcting for the local number of analyzed SNVs, and after taking into account linkage disequilibrium, we found this corresponds to a p-value of 0.0678 for this phenotype. Conclusions: We describe a new locus for familial dyslexia and discuss the possibility that SPRY1 might play a role in the etiology of a monogenic form of dyslexia.

Neuer Locus und Kandidatengen für familiäre Legasthenie auf Chromosom 4q

Zusammenfassung.Fragestellung: Die Legasthenie ist eine spezifische Lese- und Rechtschreibschwäche mit hoher Heritabilität. Um einen neuen Genlocus mit potenziellem Kandidatengen für diese Teilleistungsstörung zu identifizieren wurde hier eine Familie mit, über vier Generationen hinweg augenscheinlich autosomal dominant vererbter, Legasthenie untersucht. Methodik: Basierend auf genomweiter SNP-Genotypisierung wurden parametrische Kopplungsanalysen durchgeführt. Im so identifizierten Kopplungsintervall wurden anschließend bei je einem betroffenen und nicht betroffenen Familienmitglied Sequenzuntersuchungen aller proteinkodierenden Bereiche sowie weiteren konservierten Sequenzabschnitten vorgenommen. Die Kosegregation wurde mittels Sequenzierung bei allen zugänglichen Familienmitgliedern überprüft. Darüber hinaus wurden 96 Individuen mit Legasthenie, die zuvor positive LOD Scores bezüglich Chromosom 4q28 aufgewiesen hatten, untersucht und auch mit einer noch wesentlich größeren Kohorte (n = 2591) Analysen vorgenommen. Ergebnisse: Es wurde ein einzelnes prominentes Kopplungsintervall auf Chromosom 4q identifiziert. Die nachfolgenden Sequenzanalysen ergaben bei den legasthenischen Probanden eine spezifische Nukleotidvariante in der 3’-UTR des im Gehirn exprimierten SPRY1-Gens. Diese Sequenzveränderung könnte die Bindungseffizienz des RNA-bindenden IGF2BP1 Proteins beeinträchtigen und damit Einfluss auf die Expression des SPRY1 Genprodukts haben. Die Analyse von 96 Individuen aus einer Legasthenie-Kohorte ergab darüber hinaus eine zweite heterozygote Variante in diesem Gen, die bei der nicht betroffenen Schwester des Probanden nicht gefunden wurde. Die Untersuchung der Region in einer großen Kohorte (2591 individuen) zeigte einen nominalen p-Wert von 0.0016 für verbales Kurzzeitgedächtnis (Digit Span), eine benachbarte SNV. Nach Korrektur auf die Anzahl lokal untersuchter SNVs und unter Berücksichtigung des Kopplungsungleichgewichts ergab sich ein p-Wert von 0.0678 für diesen Phänotyp. Schlussfolgerungen: Diese Arbeit beschreibt einen neuen Genlocus für erbliche Legasthenie und diskutiert die Möglichkeit einer Beteiligung des SPRY1 Gens im Rahmen der Ätiologie dieser Teilleistungsstörung.

Introduction

Developmental dyslexia is a specific reading and writing disability with a prevalence of around 5–10 % in school-age children (Becker et al., 2014). The highly variable phenotype is characterized not only by difficulties with phonological awareness, phonological decoding, orthographic coding, auditory short-term memory, and/or rapid naming (Peterson & Pennington, 2012), but also by sensory deficits (rapid auditory processing), attentional deficits, and perceptual deficits (the magnocellular-dorsal theory) (Stein, 2014). Developmental dyslexia has a high impact on school and psychosocial development. A compelling body of evidence suggests that developmental dyslexia has a strong genetic component, and heritability estimates range between 40 % and 80 % (Davis et al., 2001; Gayán et al., 2001; Gayán & Olson, 2003). Still, the molecular genetic basis of this disorder is largely unknown, with no gene consistently implicated across a larger number of studies (Schumacher et al., 2007). Although most dyslexia cases are probably caused by a complex interaction of several genes as well as environmental influences, the occurrence of highly loaded pedigrees with a transmission pattern consistent with autosomal dominant inheritance (such as the one we report here) suggests the possibility that monogenic subforms of dyslexia do exist. In such families, the genetic effect may be caused by variants in single genes with high penetrance. Parametric and nonparametric linkage studies have suggested several chromosomal loci, with some of them having received support from independent observations, for example, 1p36 (DYX8) (Rabin et al., 1993), 2p16 (DYX3) (Fagerheim et al., 1999), 2q22 (Raskind et al., 2005), 3q13 (DYX5) (Nopola-Hemmi et al., 2001), 4q13 (Fisher et al., 2002), 6p21 (DYX2)(Cardon et al., 1994), 7q32 (Kaminen et al., 2003), 8q12.3 (Gialluisi et al., 2019), 15q21 (DYX1) (Smith et al., 1983), 18p11 (DYX6) (Fisher et al., 2002), 18q12.2 (Gialluisi et al., 2019), 21q22 (Fisher et al., 2002), and Xq27 (DYX9) (de Kovel et al., 2004) (see also Figure 1d).

In addition, ten candidate genes (DYX1C1 (Taipale et al., 2003), KIAA0319 (Francks et al., 2004), DCDC2 (Meng et al., 2005), ROBO1 (Hannula-Jouppi et al., 2005), KIAA0319L (Couto et al., 2008), CNTNAP2 (Peter et al., 2011), MRPL19 (Scerri et al., 2012), C2ORF3 (Scerri et al., 2012), CEP63 (Einarsdottir et al., 2015), and NCAN (Einarsdottir et al., 2017) have been reported to be associated with dyslexia.

Here, we present the findings from a study in which we investigated a well-documented four-generation pedigree, where dyslexia follows a clear pattern of autosomal dominant inheritance (Grimm et al., 2006). SPRY1, the gene containing the putatively causative sequence alteration we identified, was subsequently screened for variants in a cohort of 96 well-characterized dyslexic individuals from a previous study (Konig et al., 2011), revealing a second, independent SPRY1 variant possibly associated with this phenotype.

Subjects and Methods

Subjects and Sampling

All experiments were carried out following the relevant ethical guidelines and regulations. Ethics votes were granted by the Ethics Committees of the Universities of Marburg and Würzburg for the genetic studies of dyslectic individuals on which this report is based (DFG Gr506/5-1, Re471/9-1, Schu988/2-1/2-3). Assessment included testing of literacy and cognitive skills as well as an evaluation of medical records concerning the family history of reading and writing abilities. The diagnostic criteria were applied specifically correspondent to the Multiaxial Classification of Child and Adolescent Psychiatry Disorders (Remschmidt et al., 2001; WHO, 1996):

  • Reading and spelling performance significantly lower than over average IQ (significance determined by the 1.5 SD criterion as also described by Schulte-Körne; Schulte-Korne, 2010).
  • Positive family history of developmental dyslexia, delay of acquisition of language skills in preschool years.
  • Scholastic difficulties did not readily remit with increased help at school and at home.
  • The specific learning deficit was not explained by mental retardation or uncorrected visual, hearing, motor, or other neurological or psychological impairments.
  • The reading and spelling impairment cannot also be attributed to socioeconomic disadvantage or lack of education.

The dyslexia phenotype of all affected individuals fulfills the criteria from the ICD-10 Diagnostic Guidelines as reviewed by, among others, Thambirajah (2010). Unaffected individuals were identified by normal school performance and the absence of any conspicuous features (see also the electronic supplementary material [ESM] 1). The pedigree of the (German) family is shown in Figure 1a. Included were all individuals from whom DNA samples could be obtained. Together with the informed consent of all living individuals, we obtained blood samples to isolate the DNA from all family members in the pedigree. The Epstein Barr virus-transformed lymphoblastoid cell lines were generated from one patient and his unaffected daughter.

Linkage Analysis

We genotyped six affected and seven healthy individuals from three generations of the family using the Illumina Linkage IV panel (Illumina, 9885 Towne Center Drive, San Diego, CA, 92121–1975, USA) and performed genetic linkage analysis on the resulting 5,695 marker variants distributed over all chromosomes. The genotyping data were converted into appropriate formats, using the ALOHOMORA software (Ruschendorf & Nurnberg 2005). To verify the interindividual relationships, we used standard quality-control routines, including the graphical representation of relationship errors (GRR) (Abecasis et al., 2002) and PedCheck (O’Connell & Weeks, 1998). Merlin (Abecasis et al., 2002) and GeneHunter (Kruglyak et al., 1996) tools were applied to perform genome-wide multipoint linkage analysis, assuming an autosomal dominant mode of inheritance and complete penetrance as also previously described (Garshasbi et al., 2006). Subsequently, we chose the pseudomarker program (Goring & Terwilliger, 2000; Hiekkalinna et al., 2011) (updated implementation at http://www.jurgott.org/linkage/runPM.html) as it has two advantages over other approaches: (1) It treats marker allele frequencies as nuisance parameters, that is, it estimates allele frequencies separately in numerator and denominator of the likelihood ratio for the LOD score, so that results become virtually independent of marker allele frequencies. (2) It allows for the combined estimation of linkage and linkage disequilibrium between marker variant and putative disease variant, although this second advantage is not relevant in a small pedigree as analyzed here.

Array-CGH Analysis

In order to exclude insertions and deletions, we performed array-CGH using a high-resolution 1M oligonucleotide array (SurePrint G3 Human CGH Microarray Kit, 1x1M, Agilent Technologies, Santa Clara, CA, UDA) as described previously (Lohan et al., 2014), which showed no abnormalities in the tested individuals.

Variant Screening and in Silico Analysis

A total of 16,092 regions (Online Resource for coordinates [Agilent Technologies Deutschland GmbH, Herrenberger Str. 130, D-71034 Böblingen, Germany]) comprising 474,282 bp including all exons and exon-intron boundaries as well as among vertebrates conserved regions from within the interval, flanked by rs1880863 and rs906372, were enriched with custom-made Agilent SureSelect DNA capture arrays (Agilent Technologies Deutschland GmbH, Herrenberger Str. 130, D-71034 Böblingen, Germany) as previously described (Najmabadi et al., 2011) and deep sequenced by generating 50bp reads covering the enriched sequences at an average depth of 953 for sample 1890 and 2010 for sample 1893 (Online Resource Table 1 for further details) with a SOLiD 5500XL sequencer (Life Technologies GmbH, Frankfurter Straße 129B, D-64293 Darmstadt, Germany) according to the manufacturer’s instructions. The total length of all enriched regions is 516,851 bp. We then aligned the reads obtained from the sequencing runs with the human reference genome (hg19) using bfast-0.6.5 (Homer et al., 2009). Variant screening was performed with samtools (H. Li et al., 2009) using reads that mapped unambiguously to the target region. Variant calls had to be supported by at least 10 nonidentical reads, a Phred-like quality score ≥ 1 and an allelic percentage of 20–80 %. We retained variants with a minor allele frequency (MAF) below 0.01 and also filtered using sample 1893 (nondyslexic family member), dbSNP132, refseq exons, silent variants, and 6503 exomes available through the Exome Variant Server (NHLBI, 2013). In addition, we performed Sanger sequencing in order to safeguard against false-positive results. We screened SPRY variants in the dyslexia cohort (Konig et al., 2011) by Sanger Sequencing. Furthermore, we used a collection of 100 samples from unrelated individuals (dyslexia status not having been previously evaluated) that matched the intellectual status of the individuals of the family investigated here (higher education entrance qualification and academic studies) as controls for identified variants.

Table 1 In silico pathogenicity prediction results

For the in silico prediction of functional effects of the identified variants, we used a variety of prediction tools: MutationTaster2 (Schwarz et al., 2014) (http://www.mutationtaster.org), FATHMM (http://fathmm.biocompute.org.uk/fathmmMKL.htm), SNPdryad (http://snps.ccbr.utoronto.ca:8080/SNPdryad/), SNAP2 (https://rostlab.org/services/snap/). (http://mutationassessor.org/r3/), Variant Effect Predictor (http://www.ensembl.org/info/docs/tools/vep/index.html). In addition, we used the CADD (Kircher et al., 2014) and REVEL tools (Ioannidis et al., 2016).

Dyslexia Cohort

For additional analysis, we employed 96 dyslexic individuals, phenotyped according to the same criteria applied for the pedigree described, who had previously shown positive LOD scores (LOD > 0) for chromosome 4q28, from a German cohort of 248 dyslexia families described by Konig et al. (2011). These had been recruited through a single-proband sib-pair study design with a detailed assessment of dyslexia and related cognitive traits (Konig et al., 2011). The power to detect a locus explaining 1 % of trait variance at this locus employing a p-value criterion of .05 is 10 %, reflecting the comparatively low power of the sibship design in these settings. Furthermore, we performed analyses in a previously described multinational dataset (Gialluisi et al., 2019). This dataset consists of individuals from nine cohorts of reading-impaired and typically developing children of European ancestry with a total sample size of between 2,562 and 3,468, depending on the phenotype considered. The power to detect a genome-wide signal (p < 5e-08), explaining 1.5 % of the phenotypic variance for a specific trait, in this dataset is beyond 75 %. In the regional replication scenario employed here, the power for explaining 1 % of the variance is beyond 99 % (setting a p-value threshold of .01). For a more detailed discussion of power in this cohort, see Gialluisi et al. (2019). All power calculations were done using the Genetic Power Calculator website (Purcell et al., 2003).

Gene Expression Analysis

Commercially available RNA from various adult brain tissues (cerebellum, cerebral cortex, corpus callosum, diencephalon, frontal lobe, hippocampus, medulla oblongata, occipital lobe, parietal lobe, temporal lobe, thalamus) was analyzed by RNA-Sequencing on a 5500xl SOLiD System (Life Technologies, Carlsbad, CA, USA) as previously described (Esmaeeli-Nieh et al., 2016). Raw sequencing data were submitted to read mapping, using the Lifescope software analysis tool (Life Technologies, Carlsbad, CA, USA) and reference genome GRCh37/hg19. Statistical analysis was carried out using R (R Core Team 2020). Normalization and expression analysis was performed using the R packages EDASeq (Risso et al., 2011) and DESeq (Anders & Huber 2010), respectively.

Results

In the pedigree investigated in our study, 8 out of a total of 15 offspring of affected parents were affected with dyslexia. Assuming simple dominant inheritance, a theoretical number of 7.5 would have been expected. Comparing these numbers using a chi2 test as suggested by Emery (Emery, 1976), we found no significant difference between the expected and the observed values. Still, linkage analysis under a fully penetrant dominant, affected-only analysis showed only minimal lod scores, so we allowed for incomplete penetrance of 0.96 for susceptibility genotypes, a penetrance for phenocopies of 0.01, and specified a disease allele frequency of 0.005 (the exact parameter values are not too relevant as long as incomplete penetrance is specified). A maximum LOD score of 2.61 occurred at basepair position 130,292,021 (rs318539) on chromosome 4q27-28.3 (Figure 1). If linkage analysis is carried out allowing for linkage disequilibrium (LD) between the marker and the putative disease locus, the result is somewhat stronger, and the p-value of .000220 translates into a LOD score of approximately 2.68. While this result falls short of the traditional threshold of a LOD score of 3 for genome-wide significance, it is nonetheless suggestive (Lander & Kruglyak, 1995), also because it belongs to a single peak exceeding any other peak by more than 1 LOD unit (Figure 1a). Finally, it should also be noted that the maximum LOD scores of 2.61 and 2.68 are very close to the theoretical maximum LOD score obtainable in this family, which is 2.71.

Figure 1 (a) Family pedigree and haplotypes (box) within linkage interval (SNP IDs indicated on the left) and genome-wide LOD score plot showing a single peak (arrow) on chromosome 4 exceeding any other peak by more than 1 unit. The corresponding linkage interval between rs1880863 and rs906372 is indicated (red box) on the ideogram of chromosome 4. (b) Curve of LOD scores (y-axis, red line) at variants surrounding the peak LOD score, graphed against base pair positions (x-axis, in units of MB) on chromosome 4. The red line reflects a LOD score drawn 2 units below the maximum LOD score, whose intersections with the LOD score curve provide the endpoints of a 95 % confidence interval (obtained as previously recommended by Conneally et al., 1985). (c) LOD scores at the position of SPRY1 in dyslexia cohort (Konig et al., 2011). 97 families had positive LOD scores (indicated by grey shading). (d) Known dyslexia loci. The newly identified locus on chromosome 4 is highlighted (black circle).

The marker rs318539 is part of a haplotype between the markers rs1880863 and rs906372 which is shared among all affected members (see also supplemental Table 1 for phenotype information) of the three genotyped generations (Figure 1a), spans 11,653,660 bp on the long arm of chromosome 4, and contains 31 REFSEQ genes. Array CGH analysis did not show any copy number variants in a DNA sample from the index patient at this locus. The exonic regions of the genes as well as sequences conserved among vertebrates within this region were screened for variants using targeted enrichment and next generation sequencing. After we filtered out known frequent variants, a heterozygous variant (hg19 chr4:g.124324233C>T) in the 3’ UTR of SPRY1, the human ortholog of the Drosophila sprouty gene 1, remained (Figure 2), which cosegregated with dyslexia in the investigated family, as verified by sequencing in all available family members. This variant can be found in dbSNP (Sherry et al., 2001) build 134 and above as rs150615428. The MAF (T-allele) based on 2,504 individuals from the 1,000 genomes project (Genomes Project et al., 2015) is 0.005, and in all carriers, it was found in a heterozygous state. Data from the available exome studies or repositories (200 Danish exomes; Y. Li et al., 2009), Exome Variant Server (NHLBI, 2013), and ExAC data (Consortium, 2015) did not contain this alteration, though the investigated genomic regions did not include the position of rs150615428, so that these data sets could not be used for determining an allele frequency.

Figure 2 Sequence alterations affecting SPRY1(a) in the 3’UTR or (b) the terminal exon. (c) Schematic representation of SPRY1 (above) and protein sequence of the gene product (below). Arrows indicate the positions of the respective sequence alterations.

Interestingly, when screening SPRY1 in a collection of 100 samples from unrelated individuals (dyslexia status was not previously evaluated) who match the intellectual status of the individuals of the family investigated here (higher education entrance qualification and academic studies) as a more matching set of controls, we also detected one case of heterozygous for rs150615428. Because the samples were anonymized, the phenotype of the carrier of this variant could unfortunately not be determined.

Thus, taking into account the reported 24 cases from the 1000 Genome Project and the one case from our control cohort, we found the allele carrying the DNA alteration to be present in 25 out of at least 2,604 individuals (i. e. 5,208 chromosomes) with a resulting theoretical MAF of 0.0048, which is consistent with the MAF reported in dbSNP (http://www.ncbi.nlm.nih.gov/SNP/snp_ref.cgi?rs=rs​150615428, accessed 08/2016).

The rs150615428 variant is predicted by MutationTaster2 (Schwarz et al., 2014) to be disease-causing (http://www.mutationtaster.org). This was supported by the FATHMM tool (http://fathmm.biocompute.org.uk/fathmm​MKL.htm), which predicted rs150615428 to be deleterious (see Table 1 for pathogenicity prediction results from various software tools). Moreover, according to the doRiNA database (Anders et al., 2012), g.124324233C>T is situated in the center of an IGF2BP1 binding site and therefore most likely affects the interaction of IGF2BP1, an RNA-binding protein that controls the cytoplasmic fate of its target mRNAs (Stohr & Huttelmaier, 2012), with SPRY1 mRNA. Thus, it could affect the expression level of SPRY1. In order to ascertain whether SPRY1 is expressed in relevant parts of the human brain, we carried out RNA-Seq experiments with RNA obtained from different parts of the human brain. This analysis showed that SPRY1 is expressed in all investigated brain regions (cerebellum, cerebral cortex, corpus callosum, diencephalon, frontal lobe, hippocampus, medulla oblongata, occipital lobe, parietal lobe, temporal lobe, thalamus; see also Table 2).

Table 2 SPRY1 expression in the human brain

In order to find additional evidence for the relevance of alterations that affect the expression, cellular localization, and/or function of this putative candidate gene for dyslexia, we analyzed SPRY1 in 96 dyslexic individuals from a German cohort of 248 dyslexia families (Konig et al., 2011). These families had been recruited through a single-proband sib-pair study design with detailed assessment of dyslexia and related cognitive traits. All 96 investigated families had previously provided positive LOD scores for chromosome 4q28 (Konig et al., 2011). The sequence analysis of SPRY1 in these probands, however, revealed only one sequence variant in this gene in a single individual (LMR-2022-1; LOD score 0.015): A missense variant Y29C (rs141161959; hg19:g.124322832A>G, see also Figure 2b and 2c) with reported frequencies of 0.001-0.002 for the G allele. This variant was also determined to be disease-causing by a variety of prediction tools including MutationTaster2 (Schwarz et al., 2014), FATHMM, and SNPdryad. Also, the mutation assessor tool predicted a possible impact and only the Variant Effect Predictor software predicted this variant to be tolerated (see also Table 1). In addition, we checked the variants using the CADD (Kircher et al., 2014) and REVEL tools (Ioannidis et al., 2016), both of which yielded scores that do not exclude pathogenicity for the described variants (Table 1).

No individuals homozygous for the G allele were described (dbSNP build 138). Subsequent sequence analysis of two additional family members showed that this sequence variant was inherited from the father, whose dyslexia phenotype is unknown. Still, both the unaffected sibling and the proband’s mother are homozygous for the unaltered sequence.

Both variants described here were submitted to the public Leiden Open Variation Database 3.0 (http://www.lovd.nl/) and have the following identifiers: #00100429 (g.124324233C>T) and 00100431 (g.124322832A>G).

By investigating the region in a much larger sample, we further found that, for the verbal short-term memory phenotype digit span in n = 2,591 individuals, a nominal p-value of 0.0016 was observed for the SNV rs159864 (Gialluisi et al., 2019) neighboring the SPRY1 gene. After correcting for the investigated local number of SNVs and taking linkage disequilibrium into account, this corresponds to a p-value of 0.0678 for digit span. Correcting for the possible tests of the other phenotypes would add another five tests as noted in the originating manuscript (Gialluisi et al., 2019). In the original genome-wide testing of linkage in the extended family, a correction for the family is implicit in the LOD score threshold of 3 for significance. This threshold would correspond to a much stricter p-value than 0.05 required to declare genome-wide significance at 0.05. For the lookup in the family-based sample, no correction for genome-wide analysis needs to be done as the lookup is strictly only on that region and, because of the high general serial autocorrelation of LOD score curves, amounts to a single test. Finally, for the lookup in the data from Gilaluisi et al. we took into account the number of SNVs, linkage disequilibrium, and the correlation between phenotypes, thus implementing a correction of the family-wise error rate by taking these factors into account.

Discussion

We report here on a family with 8 out of 15 members affected by developmental dyslexia. In line with previous studies reporting autosomal dominant inheritance in dyslexia and related phenotypes (Chapman et al., 2003; Gilger et al., 1994; Wijsman et al., 2000), we assumed a comparable mode of inheritance and thus performed genetic linkage analysis leading to the identification of a previously undescribed locus on chromosome 4q28, which is supported by reaching the near maximum LOD score obtainable in this family. This does not overlap with published loci and contains a putative new candidate gene. The only limitation in this context is the fact that we could not obtain DNA samples from the first generation of the investigated pedigree, as they were not available for genotyping and linkage analysis. The inclusion of more generations would have made it possible to reach even more significant LOD scores. Our experimental strategy is in keeping with previous studies, where the authors used a very similar combination of linkage analysis and next generation sequencing for the successful identification of pathogenic sequence alterations with an autosomal dominant mode of inheritance in linkage regions with comparable LOD scores (Bowen et al., 2011; Sobreira et al., 2010).

Interestingly, several other loci on chromosome 4q (e. g., 4q13.1, 4q22.1, and 4q32.3) have been associated with ADHD. This is of note, because developmental dyslexia and ADHD were observed to occur in both clinical and community samples side by side in individuals more often than would be expected by chance. However, these loci do not overlap with the linkage interval we investigated here, and, more importantly, ADHD was not observed in any of the affected individuals in the pedigree we studied.

A variant screen in all known exonic regions as well as conserved extragenic sequences within the cosegregating haplotype yielded a DNA variant (rs150615428) that affects the SPRY1 gene. Interestingly, a second SPRY1 variant (rs141161959) was found in one out of 96 families who had previously shown positive LOD scores for chromosome 4q28. Both variants are predicted to be disease-causing by most prediction tools. In the family we report on here, the dyslexia phenotype appears to follow an autosomal dominant mode of inheritance, with a penetrance that appears to be 100 %; that could, however, result from a genetic background favoring the development of dyslexia, while in other carriers of this variant, a different genetic background could prevent the occurrence of the phenotype. A similar situation has been observed concerning variants in BRCA1 and BRCA2, for which penetrance estimates vary substantially between different studies, and for which it has been shown that these differences are caused by genetic and environmental modifiers (Milne & Antoniou, 2016).

SPRY1 belongs to a family of four genes that respond to FGF signaling and whose gene products seem to function as negative feedback regulators (Faedo et al., 2010; Hanafusa et al., 2002; Klein et al., 2006; Mahoney Rogers et al., 2011; Minowada et al., 1999). In mice, Spry proteins are required for the development of several tissues, including the hind- and midbrain (Basson et al., 2005; Klein et al., 2006; Mahoney Rogers et al., 2011; Shim et al., 2005). Interestingly, the repression of Spry1 can lead to an increase in the activation of the Erk (Ras-extracellular signal-regulated kinase) signaling pathway (Thum et al., 2008). This highly conserved pathway is also essential for different aspects of neuronal signaling and is thus involved in biochemical events connected with learning and the formation of memory (Adams & Sweatt, 2002; Mazzucchelli & Brambilla, 2000; Orban et al., 1999).

Among the 60,706 exomes investigated by the Exome Aggregation Consortium (ExAC) (Consortium, 2015), only six loss-of-function variants are reported in this gene, none of which was observed in a homozygous state. Also, the vast majority of single nucleotide alterations were found only in a heterozygous state. This observation could tempt one to speculate that a complete loss of functional SPRY1 might not be compatible with life, and that changes in the dosage of the gene product can already have an effect. Such dosage effects would most likely become apparent in evolutionarily young features with a therefore comparatively low degree of mechanistic redundancy. This could be assumed for the processes involved in the development and maintenance of human cognitive features, let alone for the complex integration of several such features, as is required for reading and writing abilities.

On the molecular level, SPRY proteins can inhibit the Ras-MAPK pathway (Hanafusa et al., 2002; Kim & Bar-Sagi, 2004; Mason et al., 2006). Variants in members of this signaling pathway can result in a variety of congenital disorders, the so-called RASopathies (for a review, see, e. g., Rauen, 2013). Among the hallmarks shared by these disorders is the impairment of cognitive functions. One could therefore speculate that a mild disturbance in the regulation of this pathway might well interfere with the acquirement of scholastic skills. Such a presumably mild effect on RAS-MAPK signaling would be in line with the predicted interference of the 3’UTR variant we report here with an IGF2BP1 binding site. IGF2BP1 is an RNA-binding protein, which plays a role in nuclear export, subcellular localization, stability, and translation of its target mRNAs. It could be speculated that impaired IGF2BP1 binding to SPRY1 transcripts affects the amounts of available SPRY1 gene products in the cell, which in turn might lead to the downstream disturbances that are eventually responsible for the dyslexia phenotype observed in the carriers of this variant.

Still, bearing in mind that a causative variant could also be located elsewhere in this locus and would cosegregate equally well, we also screened among vertebrates conserved extragenic regions in the linkage interval, which, however, failed to lead to the discovery of any further interpretable potentially causative variants.

Taken together, our results suggest a new locus for developmental dyslexia and provide tentative evidence that could be seen to point to the possible involvement of SPRY1 in the etiology of this condition.

Electronic supplementary material

The electronic supplementary material (ESM) is available with the online version of the article at https://doi.org/10.1024/1422-4917/a000758.

Literature

Andreas W. Kuss, Interfaculty Institute for Genetics and Functional Genomics, University Medicine Greifswald, Greifswald, Germany, Felix-Hausdorff-Str. 8, 17475 Greifswald, Germany, E-mail