Article
Published: 03 January 2022

Neural responses to affective speech, including motherese, map onto clinical and social eye tracking profiles in toddlers with ASD

Nature Human Behaviour volume 6, pages 443–454 (2022)Cite this article

3453 Accesses
10 Citations
425 Altmetric
Metrics details

Subjects

Abstract

Affective speech, including motherese, captures an infant’s attention and enhances social, language and emotional development. Decreased behavioural response to affective speech and reduced caregiver–child interactions are early signs of autism in infants. To understand this, we measured neural responses to mild affect speech, moderate affect speech and motherese using natural sleep functional magnetic resonance imaging and behavioural preference for motherese using eye tracking in typically developing toddlers and those with autism. By combining diverse neural–clinical data using similarity network fusion, we discovered four distinct clusters of toddlers. The autism cluster with the weakest superior temporal responses to affective speech and very poor social and language abilities had reduced behavioural preference for motherese, while the typically developing cluster with the strongest superior temporal response to affective speech showed the opposite effect. We conclude that significantly reduced behavioural preference for motherese in autism is related to impaired development of temporal cortical systems that normally respond to parental affective speech.

You have full access to this article via your institution.

Download PDF

Sleep quality, duration, and consistency are associated with better academic performance in college students

Article Open access 01 October 2019

The language network as a natural kind within the broader landscape of the human brain

Article 12 April 2024

Autism spectrum disorder

Article 16 January 2020

Main

Social and language development are inextricably linked in typical infants as a result of experience during caregiver–infant interactions^1,2. It is theorized that social and language development and learning as well as affect and emotion development in infants is an experience-expectant process that involves highly affective parental speech, such as infant-directed speech or motherese, as an early input during caregiver–infant interactions³. Motherese is a highly compelling form of positive affective speech; it has human-unique and special characteristics including higher pitch, slower tempo and exaggerated speech contours, accompanied by heightened positive affect. Motherese occurs in many and diverse cultures, and some theorize that this type of affective speech has a genetic basis that emerged evolutionarily in humans^4,5,6. The prevailing view, then, is that, across cultures and languages, strongly affective speech creates and maintains a caregiver–infant social and language interaction which promotes learning. Implicit is that the infant’s seemingly ‘automatic’ neural response to motherese mediates this development and learning. Typically developing (TD) infants attend to and prefer motherese over other forms of adult speech^3,7,8,9,10, and a small number of behavioural and neuroimaging studies suggest that TD infants may process motherese differently from non-speech sounds^{11,12,13,14,15}. However, if such attention enhances social and language learning, then it would be predicted that enhanced or reduced neural responsiveness to such affective speech might be associated with enhanced or reduced early-age social and language ability. However, this hypothesis remains largely untested.

Deficits in joint caregiver–child social interactions and in social and language learning and development are early-age signs of autism spectrum disorder (ASD), so it is essential to understand the reasons for these deficits^16,17,18. While the behavioural literature has thoroughly described these diagnostically core deficits in ASD^19,20, there is remarkably scant evidence about how these deficits arise and whether a key actor may be abnormally reduced neural responsiveness of infants with ASD to affective speech, including motherese. There are no studies that characterize impairments in neural processing of motherese and related types of affective speech in ASD nor any carefully controlled eye tracking studies of attention to motherese in ASD. Potentially, if neural responses to motherese and other affective speech in infants with ASD are reduced, then attention to highly affective speech, such as motherese, as well as social, language abilities might likewise be reduced.

Therefore, the main study hypothesis is that, across TD toddlers and toddlers with ASD, enhanced or reduced attention to affective speech would be associated with enhanced or reduced neural responsiveness to affective speech as well as with better or poorer early-age social and language abilities. To investigate this hypothesis, we measured functional magnetic resonance imaging (fMRI) responses to three different levels of affective speech—mild affect speech, moderate affect speech and motherese—during natural sleep in large samples of toddlers with ASD and TD toddlers. Because motherese is a strongly positive and, for infants, compelling form of affective speech, by including motherese along with milder affective speech we have a strong test of whether neural and behavioural responses to positive affective speech are reduced in toddlers with ASD. Additionally, by obtaining fMRI activation to affective speech during natural sleep, we were able to measure how the toddler’s brain ‘intrinsically’ responds to motherese and non-motherese affective speech independent of volition, arousal, interest, motivation, attention, awareness, expectation and cooperation, which could alter neural responses recorded during wakefulness. We also determined whether such neural responses are correlated with early-age social and language abilities in toddlers with ASD and TD toddlers. Then, to determine whether neural responses to affective speech as well as social and language abilities were associated with behavioural preference for motherese, we used active, gaze-contingent eye tracking, which provides strong evidence of volitional preference as opposed to passive looking and objectively quantifies preference for a female speaking motherese. We chose to test eye tracking on motherese because it provides a strong test of behavioural responsiveness to age-relevant, highly compelling affective speech, as opposed to eye tracking to mildly affective speech.

As shown schematically in Fig. 1, we first tested whether neural responses to affective speech including motherese are in fact reduced in toddlers with ASD. Next, we tested the prediction that neural responses to affective speech are correlated with social and language developmental measures across toddlers with ASD and TD toddlers. Lastly, we tested our main hypothesis above using a two-stage procedure. In the first stage, we used a data-driven, unbiased method (that is, similarity network fusion (SNF)^21,22) to identify clusters of individuals whose combined neural and clinical measures are maximally similar to each other and maximally different from individuals in other clusters. In the second stage, we tested whether individuals in different neural–clinical clusters differed in preference for and attention to motherese based on the gaze-contingent eye tracking preferential looking paradigm. In this SNF approach, rather than determining diagnostic categories a priori through clinical measures, the similarity and dissimilarity in patterns of the different kinds of measures across all individuals drives subject subgroups/clusters independent of diagnosis. This method takes into account all neural and clinical data equally in each individual and reveals bio-behavioural dimensionality.

**Fig. 1: Experimental design and data analysis flow chart.**

Results

Variable affective speech levels across fMRI language paradigms

We collected fMRI data from three language paradigms with variable affect levels: mild affect speech (the story language paradigm), moderate affect speech (the Karen language paradigm) and motherese (the motherese paradigm). For detailed descriptions of these language paradigms, see Methods section. The speech stimuli in each language paradigm included female voices speaking with varying levels of affective valence and speech. To determine whether there were perceptual differences in affect levels across the three language paradigms, we conducted two computer-based surveys in TD adults (survey 1, n = 19; survey 2, n = 15) and compared the ratings of affect levels between language paradigms using paired two-tailed t tests. Results from both survey 1 (Supplementary Fig. 1a; motherese versus moderate affect speech, t(18) = 20.5, P < 0.001, Cohen’s d = 4, 95% CI 2.82–5.2; motherese versus mild affect speech: t(18) = 20.2, P < 0.001, Cohen’s d = 5.4, 95% CI 3.25–7.47; moderate affect versus mild affect speech: t(18) = 11.7, P < 0.001, Cohen’s d = 1.9, 95% CI 1.34–2.42) and survey 2 (Supplementary Fig. 1b; motherese versus moderate affect speech: t(14) = 47.7, P < 0.001, Cohen’s d = 17.9, 95% CI 8.15–27.6; motherese versus mild affect speech: t(14) = 73.6.4, P < 0.001, Cohen’s d = 31.2, 95% CI 12.03–50.43; moderate versus mild affect speech: t(14) = 20.4, P < 0.001, Cohen’s d = 10.3, 95% CI 2.68–17.91) demonstrated that TD adults rated the motherese vignettes as having the strongest positive affect, the Karen language vignettes as having moderate affect, and the story language vignettes as having mild affect.

Similar activation patterns in TD adults and TD toddlers

We collected a total of 241 fMRI language datasets (see Supplementary Table 1 for scan details): 200 fMRI datasets from 71 toddlers (see Table 1 for demographic information and clinical scores) and 41 fMRI datasets from 14 adults. We first compared whole-brain activation patterns in sleeping TD toddlers and awake TD adults, and found no statistically significant differences in activation patterns for each of the three language paradigms (Supplementary Fig. 2a). To quantify the per cent signal change to language stimuli, we used two language-relevant regions of interest (ROIs) from the meta-analytic activation map in Neurosynth (https://neurosynth.org/) with the term ‘language’ that included left and right temporal regions. These ROIs were identical to those used in previous papers^23,24. Per cent signal change was significantly lower in sleeping TD toddlers than in awake TD adults (Supplementary Fig. 2b). These results show that, during sleep, TD toddlers have similar temporal cortical activation patterns during language processing, albeit with reduced activation strength compared with that of awake passively listening adults.

Table 1 Demographic information and clinical test scores for toddlers with ASD and TD toddlers

Full size table

Reliability of affective speech activation in toddlers

We also evaluated the test–retest reliability of brain activation to language paradigms with varying levels of affective prosody within individual toddlers with ASD and TD toddlers across time. The test–retest scans were divided into two groups based on intervals between initial and retest scans: short-term retest (1–4 months after initial scans) and long-term retest (12–15 months after initial scans). The overall test–retest reliability (initial scans versus all retest scans) was quantified with intraclass correlation coefficients, which showed moderate to good reliability for moderate affect speech and motherese paradigms, and moderate reliability for the mild affect speech paradigm in the left temporal ROI but poor reliability in the right temporal ROI (Supplementary Fig. 3).

Reduced neural response to speech in toddlers with ASD

We found robust and significant activation in temporal language regions in TD toddlers but reduced activation in toddlers with ASD in all three levels of affective prosody (Fig. 2a). There were no significant differences in whole-brain activation between TD toddlers and toddlers with ASD. However, an ROI-based analysis, using two-sample two-tailed t tests, demonstrated significant group differences (TD versus ASD) across three language paradigms in both left temporal ROI (mild affect speech: t(52) = 2.99, P = 0.005, Cohen’s d = 0.89, 95% CI 0.31–1.47; moderate affect speech: t(62) = 2.61, P = 0.012, Cohen’s d = 0.68, 95% CI 0.17–1.2; motherese: t(60) = 2.3, P = 0.026, Cohen’s d = 0.59, 95% CI 0.06–1.12) and right temporal ROI (mild affect speech: t(52) = 2.7, P = 0.011, Cohen’s d = 0.81, 95% CI 0.24–1.39; moderate affect speech: t(62) = 2.74, P = 0.009, Cohen’s d = 0.73, 95% CI 0.21–1.25; motherese: t(60) = 2.48, P = 0.017, Cohen’s d = 0.66, 95% CI 0.12–1.19) (Fig. 2b). Thus, TD toddlers had stronger temporal cortical activation across all three types of affective speech as compared to toddlers with ASD, who had significantly weaker responses.

**Fig. 2: Reduced language-related brain activation in toddlers with ASD as compared to TD toddlers.**

Correlations between fMRI and social/communication ability

We further investigated correlations between a toddler’s neural response to affective speech and his/her social and communication abilities. The results of mixed-effects models showed significant correlations between fMRI activation and Vineland socialization and communication scores across individuals and language paradigms (left temporal ROI: Vineland communication scores, t(48) = 2.4, P = 0.02, marginal R² = 0.068; left temporal ROI: Vineland socialization scores, t(50) = 2.73, P = 0.009, marginal R² = 0.08; right temporal ROI: Vineland communication scores, t(50) = 2.58, P = 0.013, marginal R² = 0.094; right temporal ROI: Vineland socialization scores: t(51) = 3.23, P = 0.002, marginal R² = 0.13) (Fig. 3 and Supplementary Table 2).

**Fig. 3: Scatterplots showing correlations between brain activation to language and social communication abilities in toddlers.**

Neural correlates of behavioural motherese preference

Given the reduced neural activation to motherese and other affective stimuli among toddlers with ASD, we next examined a behavioural measure of motherese preference and how it relates to motherese-related activation in toddlers with ASD. Gaze-contingent eye tracking during a visual preference paradigm including both motherese and non-motherese, non-social stimuli is a strong behavioural test of attention to and preference for age-relevant and compelling affective speech. As such, when visual attention preference was measured using this task in TD toddlers and toddlers with ASD, unlike TD toddlers, individuals with ASD showed substantially and significantly reduced percentage fixation towards motherese, preferring the non-motherese, non-social computer ‘techno’ sounds (Fig. 4a; TD versus ASD: t(52) = 3.25, P = 0.001, Cohen’s d = 0.83, 95% CI 0.25–1.4, two-sample one-tailed t test). Next, we examined whether fMRI activation to motherese was correlated with eye tracking-based attention to motherese in toddlers with ASD and TD toddlers using Pearson’s correlation (one-tailed test). As shown in Fig. 4b, the TD group exhibited a significant positive correlation in the left temporal ROI (r(21) = 0.407, P = 0.03), but no statistically significant correlations were observed for the right temporal ROI (r(21) = 0.186, P = 0.2). The ASD group showed no statistically significant evidence for either left (r(29) = 0.007, P = 0.49) or right temporal ROI (r(29) = −0.097, P = 0.7).

**Fig. 4: Gaze-contingent eye tracking measures of preference for motherese and correlations with neural response to motherese in toddlers with ASD and TD toddlers.**

Mapping motherese preference to SNF neural–clinical clusters

While toddlers with ASD exhibited reduced activation in response to language independent of affect levels, the significant, positive relationship between per cent signal change and percentage fixation to motherese among TD toddlers suggests that a subset of toddlers perhaps exhibit both reduced activation to and behavioural preference for motherese stimuli. As such, we next applied SNF to determine whether we could objectively identify neural–clinical subgroups that directly map onto eye tracking measures of preference for motherese. Using SNF, we identified four fMRI–clinical phenotypically distinct clusters of toddlers, including two predominantly TD and two completely ASD clusters (Fig. 5a). At the individual level, 100% of TD toddlers fell into two clusters: cluster 1 (12 TD, 2 ASD) and cluster 2 (8 TD, 3 ASD), and 83.3% of toddlers with ASD also fell into two clusters: cluster 3 (0 TD, 14 ASD) and cluster 4 (0 TD, 11 ASD).

**Fig. 5: TD and ASD subgroups with distinct fMRI–clinical patterns and correlations with behavioural preference for motherese.**

Visual inspection of Fig. 5b,c shows that cluster 1 toddlers had the highest neural activation and best clinical performances among clusters, while cluster 4 toddlers had low activation and very low clinical scores (for all clinical variables, see Supplementary Fig. 4). The composite activation and clinical scores of individuals in clusters 2 and 3 generally fell between cluster 1 and 4. In this way, SNF provides insight into the bio-dimensional, multi-modal subgroups underlying TD and ASD neural–clinical heterogeneity.

SNF also enables follow-up statistical analyses of different clusters for further characterization, interpretation and hypothesis generation. We statistically examined the main effects of cluster (clusters 1, 2, 3 and 4), paradigm (mild affect speech, moderate affect speech and motherese) and hemisphere (left and right temporal ROIs) as well as their interactions on fMRI activation using a three-way ANOVA. Results showed main effects of cluster (F(3,46) = 11.75, P < 0.001) and hemisphere (F(1,46) = 9.5, P = 0.003) as well as a significant cluster × language paradigm effect (F(6, 92) = 2.83, P = 0.014) and a significant cluster × hemisphere effect (F(3, 46) = 3.68, P = 0.019) on fMRI activation (Supplementary Table 3). Follow-up tests showed statistically significant cluster effects involving fMRI responses to motherese. Specifically, two-sample two-tailed t tests, after correcting for multiple comparisons using the false discovery rate correction, demonstrated that right temporal response to motherese in cluster 1 was significantly greater than other clusters (cluster 1 versus 2: t(23) = 3.39, P = 0.0025, Cohen’s d = 1.31, 95% CI 0.39–2.23; cluster 1 versus 3: t(26) = 3.45, P = 0.0022, Cohen’s d = 1.31, 95% CI 0.45–2.16; cluster 1 versus 4: t(23) = 4.7, P = 0.0001, Cohen’s d = 1.76, 95% CI 0.78–2.73). Cluster differences in right temporal activation and effect size were greatest between cluster 1 and 4, with cluster 4 having the least right temporal activation to motherese among the different clusters. These can be qualitatively seen in Supplementary Fig. 4b.

We then tested the hypothesis that cluster 1 toddlers would have greater fMRI responses to motherese than to the moderately affective but non-motherese speech paradigm, while individuals with ASD in cluster 4 would have smaller differential fMRI responses to motherese. The statistical analysis was performed with non-parametric Mann–Whitney one-tailed tests because data in cluster 4 were not normally distributed as verified by the Shapiro–Wilk test (left temporal ROI, P = 0.045; right temporal ROI, P = 0.034). Indeed, in the right temporal ROI, cluster 1 toddlers exhibited a differential increase to motherese (that is, motherese versus moderate affect speech) which was significantly greater than activation observed toddlers with ASD in cluster 4 (z = 2.03, P = 0.022, effect size r = 0.41, 95% CI 0.05–0.71). In fact, cluster 1 had 22% greater and cluster 4 had 58% less fMRI activation to motherese than to moderate affect speech. At the individual level, 71% of cluster 1 toddlers had greater activation to motherese than to moderate affect speech, while 82% of cluster 4 toddlers with ASD had less activation to motherese than to moderate affect speech (χ²(1, 25) = 5.03; P = 0.025, Cramer’s V = 0.53, 95% CI 0.16–0.85). This differential activation pattern between cluster 1 versus 4 was absent for the left temporal ROI (z = 0.77, P = 0.23, effect size r = 0.15, 95% CI 0.01–0.51). In addition, cluster 4 toddlers with ASD showed a marginally statistically significant difference in right temporal activation to motherese relative to moderate affect speech when compared with toddlers with ASD in cluster 3, who had 38% greater activation to motherese than to moderate affect speech (z = 1.48, P = 0.075, effect size r = 0.3, 95% CI 0.02–0.66). Thus, toddlers with ASD in cluster 4 displayed a distinctive pattern of (1) less neural response to motherese as well as a within-individual decrease in the differential fMRI response to motherese relative to moderate affect speech; (2) very poor social, language and cognitive abilities; and (3) severe ASD symptoms (Fig. 5b,c and Supplementary Fig. 4b,c). Notably, cluster 1 toddlers showed a neural–clinical pattern opposite from those of cluster 4, namely robust activation to all three affective speech paradigms, increased differential motherese activation relative to moderate affect speech and the highest social, language and cognitive abilities.

Finally, we compared eye tracking measures of attention to and preference for motherese (1 of the 12 motherese segments utilized in the fMRI paradigm) across the four neural–clinical clusters. As examined by two-sample two-tailed t tests (Fig. 5d), toddlers with ASD in cluster 4 had significantly lower percentage fixation towards motherese compared with those in clusters 1 and 2 (cluster 4 versus 1: 41% versus 79.28%, t(19) = −3.95, P = 0.0009, Cohen’s d = 1.86, 95% CI 0.75–2.98; cluster 4 versus 2: 41% versus 78.83%, t(16) = −3.82, P = 0.001, Cohen’s d = 1.86, 95% CI 0.66–3.07). There was a marginally statistically significant difference in percentage fixation between cluster 4 and cluster 3 (41% versus 61.76%, t(18) = −1.72, P = 0.051, Cohen’s d = 0.74, 95% CI −0.25 to 1.73).

Discussion

Our within-individual fMRI, clinical, eye-tracking design provides unique evidence in support of the long-standing behaviour-based theory that a toddler’s increased neural responses to motherese and other affective speech may increase behavioural responses to motherese utterances and lead to increased social and language abilities at early ages. We found in TD toddlers and toddlers with ASD that greater or lesser neural responses to affective speech in temporal cortex was associated with greater or lesser social and language abilities. Then, using a data-driven SNF and clustering approach, we disentangled TD and ASD neural and clinical heterogeneity into four subgroups and showed that cluster 1 (12 TD, 2 with ASD) toddlers with greater differential neural responses to motherese also have greater visual attention preference for motherese and better social and language skills than toddlers in cluster 4 (11 with ASD) who have lesser neural responsiveness to motherese affective speech, very poor social and language abilities, and reduced eye-tracking-related attention to motherese. A distinctive finding among toddlers in cluster 4 was that motherese stimuli evoked less of a neural response than did moderate affect speech, which is in direct contrast with cluster 1 toddlers, who exhibited increased neural responses to motherese. Clusters 2 and 3 contained individuals with somewhat intermediate neural–clinical phenotypes. Overall, these varying phenotypic characteristics across clusters indicate that social preference and language development are intertwined across a wide spectrum of social and language ability and disability. Moreover, these neural responsiveness effects were not confounded by volition, arousal, interest, motivation, attention, awareness, expectation and cooperation. Further, clustering results suggest that high neural response TD toddlers and low neural response toddlers with ASD stand at opposite ends of the neural–affective–response and social–language ability spectrum. Overall, these results indicate that the biology and behaviour of both TD toddlers and toddlers with ASD are multi-dimensional.

Our study points to the early-age neural bases of the core social deficits and reduced responsiveness to parental affective speech that first emerge in infants and toddlers who develop ASD. As such, reduction of a normal neural response to affective language stimuli is already present at the early age of clinical onset in most toddlers with ASD. This may be a biomarker of foundational dysregulation of social–emotional neural development that could underpin the development of associated social, cognitive and behavioural functions. Indeed, one of the first early signs of ASD in babies and infants is a sharp reduction of behavioural response to mother’s affective speech^25,26. The present study, conducted in sleeping toddlers with ASD, provides evidence for the possible early-age neural basis of reduced behavioural preferences for motherese utterances. In cluster 4, it is robustly evident that reduced neural responses to affective speech, including motherese, are associated with reduced volitional behavioural attention to motherese and poor social and language abilities.

Because this reduced neural response to affective speech, including motherese, is observed during natural sleep, it cannot be attributed to attention, arousal, momentary distractions or competing motivations. As compared with cluster 1 toddlers, toddlers with ASD in cluster 4 have a strikingly weak neural response in the superior temporal cortex that actually differentially diminishes to motherese. We hypothesize that, if such a weak response occurs in the awake baby or infant with ASD, it could effectively disconnect the foundational caregiver–infant loop in which affective speech such as motherese enhances the experience-expectant process of social, language, emotional and cognitive development and learning in infants^27,28. Such a foundational dysfunction, we further hypothesize, might undermine not only the early caregiver–infant experience-expectant process but also later efforts through behavioural therapy to socially engage toddlers with ASD. Many early interventions that show success for some individuals hinge critically upon the idea of changing this attribute of early ASD development^29,30,31. The hope is that early intervention will increase engagement between the child and the social world and enable experience-related neuroplasticity to divert a child towards more TD trajectories.

Motherese, found in many diverse cultures^4,5,6, is an especially strong affective tool that may augment the human-special caregiver–infant interactive loop and is thought to have a genetic basis that emerged evolutionarily in humans. As such, very early neural, behavioural and experience-expectant responses and processes in the infant are likely to be driven genetically to some significant degree and evolutionarily emergent in human infants. Thus, since ASD is highly genetic with heritability of about 81%^32,33, the marked deficits in neural and behavioural responses we identify in toddlers with ASD in cluster 4 might very well involve genetic effects, a hypothesis that should be pursued with brain-genetic studies. Also, ASD is a prenatal multi-stage, multi-process disorder that begins in the first trimester with disruption of proliferation and neurogenesis and continues throughout the second and third trimesters with disorder of neurite outgrowth, synaptogenesis and neural network function^34,35. Social and language impairments are the consequences. Indeed, dysregulation of gene expression in ASD is highly correlated with ASD social symptoms in infants and toddlers³⁶. We also identified gene co-expression networks in ASD that are associated with neural hypoactivation to language stimuli and with abnormal cortical growth in toddlers with ASD who had very poor language development^24,37. Identified genes and networks include language-relevant, ASD-associated, human-specific and prenatal genes, as well as those involved in cell proliferation and excitatory neuron development. These several studies indicate that prenatal genetic dysregulation in ASD may lead to early-age impairment in social and language development and the reduced neural responses to affective speech we describe herein. These disruptions could act as important impediments to socially engaging with the caregiver. It is of major importance that future work directly test this theory.

Understanding early-age clinical and neural ASD heterogeneity is a major challenge. Heterogeneity among TD toddlers is equally important to address but is often overlooked, leading to weakened power to detect and characterize differences between toddlers with ASD and TD toddlers. Here, we demonstrate the power of the unbiased data-driven SNF/clustering method that was used for the first time in the ASD field to resolve the neural bases of early-age social and language heterogeneity in ASD. Our data identified an ASD subgroup with a distinctive neural–clinical pattern that may suggest a poor prognosis, and another ASD subgroup with somewhat better clinical, neural and behavioural characteristics that might suggest a better prognosis. Several individuals with ASD fell in clusters 1 and 2, opening the possibility that SNF and clustering may have identified those rare toddlers with ASD who later have optimal outcomes. Long-term follow-up studies on these individuals will be valuable to test these possibilities. Notably, this method allowed us to additionally resolve the heterogeneity among TD. The relevance of the neural–clinical subgroups to the behaviour of interest was then quantitatively measured and validated using gaze-contingent eye tracking assessments of toddlers volitionally choosing to view a female telling a story in motherese or computer ‘techno’ sounds and images. This gaze-contingent eye tracking assessment, as shown in a similar method³⁸, is a tool to simulate social interaction behaviours in ASD. The present results show that social and language ability and behavioural preference for motherese are linked to how strongly temporal cortex responds to affective speech, including motherese, in toddlers across the neurodevelopmental spectrum from typical to language and social impaired.

Lastly, another exciting finding is that our purely unbiased data-driven SNF/clustering method appears to have replicated, in a new sample of toddlers using new paradigms, the presence of two main ASD subgroups with good and poor language ability that we previously identified using a subjective stratification approach^23,24. Previously, we arbitrarily stratified ASD based on a child being above (‘ASD good’) or below (ASD poor’) one standard deviation on the norm of Mullen expressive and receptive language scales. A similar pair of ASD subgroups emerged from the purely unbiased SNF/clustering method while also resolving TD toddler heterogeneity. Therefore, these may be reliable diagnostic and prognostic ASD subgroups that are also aetiologically and biologically meaningful. As such, future work should investigate how they may open early-age diagnostic, prognostic and/or treatment avenues for biomarker discovery.

The current study has three possible limitations that are worth discussion. The first limitation is that we collected fMRI data during natural sleep without electroencephalography as doing both simultaneously is an extremely challenging dual procedure in non-sedated toddlers³⁹. Thus, sleep stages were not monitored directly. Research has shown that, as compared with TD children, children with ASD have a longer latency before falling asleep (for example, 5 min (ref. ⁴⁰)), lower rapid eye movement (REM, a sleep phase featured by random rapid movements of eyes) sleep percentage (for example, 14.5% versus 22.6%⁴⁰), and greater non-REM sleep percentage as well as a slight difference in anterior–posterior distribution of non-REM sleep^40,41,42,43. In our study, fMRI scans started after each child was sound asleep at the beginning of the night, and by waiting about 20–30 min after sleep onset, data were collected during non-REM sleep. So, potential sleep differences between ASD and TD would not be expected to differentially account for ASD versus TD brain activation differences.

Another limitation involves the three affective language paradigms used in the present study. By definition and design, the motherese vignettes were distinct from the mild and moderate affect speech vignettes by being higher in affect, high-pitched, lyrical and sing-songy in intonational quality. The mild and moderate affect speech vignettes did not have these more vivid motherese qualities and had comparatively lower levels of positive affect, as demonstrated by our affect testing procedure. Another difference was that, while motherese and moderate affect speech had only forward speech, mild affect speech had both forward and backward speech. However, by including these different affective speech designs, we demonstrated several important points. First, inclusion of the mild affect speech (with its backward speech segment) enabled us to demonstrate that our previous results from toddlers with ASD versus TD toddlers^23,24,44,45 could be replicated in an entirely different cohort of toddlers using the identical speech stimuli. This highly substantial and successful replication shows the robustness of our fMRI approach and findings in toddlers with ASD. Second, inclusion of three different levels of affective speech enabled us to demonstrate the strong generalizability of affective language effects across paradigms in toddlers with ASD, wherein ASD had significantly reduced temporal neural responses in all three language paradigms. Third, despite these stimulus differences across paradigms, we also observed that cluster 1 individuals showed the predicted pattern of increased right temporal activation with higher levels of affect. This, in turn revealed that the largest neural activation differences were seen in motherese (strong affective speech) between cluster 1 and cluster 4. Fourth, again despite differences between paradigm stimulus, there were no TD toddler–adult differences in activation patterns evoked by affective speech, including motherese.

A third limitation worth noting is the fact that auditory sensory deficits in ASD might account for the different neural response to the stimuli during sleep. Figure 5b shows that higher-ability TD and lower-ability ASD are at the opposite ends, with lower-ability TD and higher-ability ASD in the middle. While the hypothesis that deficits in lower-level sensory processing was not directly tested herein, it seems implausible that brain activation patterns across clusters is matched by abilities in low-level auditory sensory processing across TD toddlers as well as toddlers with ASD. In addition, we previously analysed the question of general auditory processing in mild affect speech by specifically examining activation responses within the primary auditory cortex, but did not find any significant differences between ASD and TD groups²³.

In conclusion, in a one-of-a-kind fMRI study of ASD, we resolve ASD and TD neural–social–language–symptom heterogeneity into four discrete subgroups and show robust and systematic differences in how the brain of toddlers with ASD and TD toddlers responds to varying levels of affective speech, including motherese. We show that these neural differences are associated with differences in volitional behavioural preference for social–emotional motherese utterances and relate to social and communication developmental differences. Our findings support the long-standing behaviour-based theory that neural activity elicited by affective speech such as motherese may be important in driving infants to engage with caregivers in social and language learning. We speculate that enhanced neural responsiveness leads to such learning, while weaker neural responsiveness may impede or preclude it. This hypothesis predicts that neural and behavioural deficits together may be a biomarker of foundational dysregulation of social–emotional neural development and learning. As such, different ASD neural–clinical–behavioural subgroups were identified that may benefit from different treatment approaches.

Methods

This study was approved by the University of California, San Diego Institutional Review Board. Informed consent was obtained from parents or guardians of toddlers and from adult participants. Families were compensated up to $850 dollars upon study completion. In the present study, data collection and analysis were not performed blind to the conditions of the experiments.

Participants

Toddlers were recruited through community referral and a population-based screening method in collaboration with paediatricians via the Get SET Early Approach⁴⁶, formally known as the 1-Year Well-Baby Check-Up Approach^16,18. All toddlers participated in clinical assessments, including the Autism Diagnostic Observation Schedule (ADOS)⁴⁷, Mullen Scales of Early Learning⁴⁸ and Vineland Adaptive Behavior Scales⁴⁹. Toddlers who received their initial diagnostic and clinical evaluations at <36 months were invited to return for repeat evaluations until they reached 48 months. Clinical scores at the most recent visit were used as a best estimate of a child’s abilities (Table 1). Clinical testing occurred at the University of California, San Diego Autism Center of Excellence. Adult participants were recruited by word of mouth.

Clinical scores and fMRI scans were collected from 71 toddlers (41 with ASD, 30 TD; 53 male, 18 female, 14–55 months old). The distribution of intervals between the age at fMRI and clinical data collection is shown in Supplementary Fig. 5a. Toddlers were considered TD if their diagnosis at outcome was TD and their Mullen Early Learning Composite scores fell within two standard deviations of the group mean. This allows us to examine activation patterns along a continuum of language and cognitive abilities in TD children. A subset of toddlers (4 with ASD, 6 TD) had test–retest fMRI scans collected at intervals ranging from 1 to 15 months after the initial scan. fMRI scans were also obtained from 14 TD adults (6 male, 8 female, 20–37 years old). No statistical methods were used to pre-determine sample size, but our toddler sample sizes are similar to those reported in prior publications^23,45.

Sleep fMRI

Scans of toddlers were conducted during natural sleep, which has been proven to yield robust activation in toddlers with ASD and TD toddlers^44,45,50,51. For the sleep fMRI, on the day of fMRI scan, parents were instructed to eliminate naps from their child’s typical routine, keep their child awake while at home and arrive at the scanner 1 h past their child’s normal bedtime. In an attempt to standardize stages of sleep during scanning, babies were placed on the scanner bed approximately 20 min after sleep onset. Previous research has shown that sleep fMRI acquisition success is confined to non-REM stage 3 sleep³⁹, further leading to homogeneity of sleep state among successfully acquired scans.

Language paradigms

We presented three paradigms using female voices, one with high levels of positive affect (motherese paradigm, referred to as motherese), a second with comparatively moderate positive affect (Karen language, referred to as moderate affect speech) and a third with comparatively milder positive affect (story language, referred to as mild affect speech) (Affect level testing). Average peak frequency was 354 Hz (s.d. 67 Hz, range 258–469 Hz) for motherese, 236 Hz (s.d. 41 Hz, range 211–375 Hz) for moderate affect speech and 275 Hz (s.d. 35 Hz, range 258–328 Hz) for mild affect speech. Average beats per minute were 59 (s.d. 21, range 20–93) for motherese, 77 (s.d. 27, range 20–119) for moderate affect speech and 60 (s.d. 21, range 44–88) for mild affect speech. These three paradigms were presented in a block design (20 s stimulus, 20 s rest), and each speech vignette served as a stimulus. The order of paradigm presentation varied across participants.

The motherese paradigm consisted of 12 high-affect, age-appropriate vignettes (8 min 5 s), each spoken by a different female using high-pitched, intonational, lyrical and sing-songy speech characteristic of motherese^3,9. Thus, by definition and design, motherese differed from moderate and mild affect speech by having higher pitch and affect. The moderate affect speech consisted of 18 different age-appropriate nursery story vignettes, each spoken by different females (12 min 5 s) with moderate levels of affect.

The mild affect speech has two vignettes spoken by a single female with comparatively milder positive affect and largely absent motherese attributes. While moderate affect speech and motherese had only forward speech, mild affect speech had both forward and backward speech. As brain activation to forward and backward speech stimuli did not differ in the present study, forward and backward speech stimuli were combined, with all speech versus rest as our main contrast. The mild affect speech has been previously described and used in our earlier ASD fMRI results^23,24,44,45 with two entirely different cohorts of toddlers, neither one overlapping with the present individuals. In those previous studies, we used it to develop predictors of language outcome among toddlers with ASD²² and to identify gene expression patterns associated with fMRI language hypoactivation in ASD with poor language developmental outcomes²³. There were several reasons we included the mild affect speech stimuli. First, it enabled us to test whether our previous large-sample fMRI results from toddlers with ASD versus TD toddlers^23,24,44,45 could be replicated in an entirely different cohort of toddlers using the identical speech stimuli. Successful replication of effects would be a strong indicator of the robustness of our fMRI language activation approach and findings in TD as well as toddlers with ASD. Additionally, including this paradigm along with new paradigms enabled us to demonstrate the generalizability of language effects across paradigms in toddlers with ASD. Second, by including several stimuli with variable levels of affect, we could examine whether: (a) ASD had significantly reduced temporal neural responses in all three types of language affect paradigms, irrespective of low-level basic stimulus features; (b) There were similarities between TD toddlers and adults in cortical regions activated by the three language affect paradigms, irrespective of low-level basic stimulus features; and (c) TD toddlers would show a predicted pattern of increased right temporal activation with enhanced affective prosody.

Affect level testing

Two computer-based surveys were administered to TD adults to test affect levels of language paradigms.

Each fMRI paradigm consists of unique language segments, that is, 2 mild affect speech, 18 moderate affect speech and 12 motherese segments. For survey 1, each unique segment was presented in random order (same order for each participant). TD adults (n = 19) were instructed to listen to each segment and respond using a Likert scale of 1–5, with a rating of 1 indicating the least amount of affect and a rating of 5 indicating the most.

Survey 2 consisted of 18 trials, each containing a mild affect speech segment, a moderate affect speech segment and a motherese segment. Presenting all three stimulus types allowed for evaluation of differences in affect level across all language paradigms. TD adults (n = 15) then rated each segment using a Likert scale of 1–3, with a rating of 1 indicating the least amount of affect, 2 indicating some affect and 3 indicating very strong affect.

fMRI data acquisition

All fMRI data were collected in a 3 T GE scanner at the University of California, San Diego Center for Functional MRI. Functional images were acquired with a multi-echo echo planar imaging protocol (echo time (TE) 15 ms, 28 ms, 42 ms, 56 ms; repetition time (TR) 2,500 ms; flip angle 78°; matrix size 64 × 64; slice thickness 4 mm; field of view 256 mm; 34 slices). Structural images were acquired using a T1-weighted 3D magnetization-prepared rapid gradient-echo sequence (field of view 256 mm; TE 3.172 ms; TR 8.142 ms; flip angle 12°).

Imaging data preprocessing

Functional data were preprocessed using the multi-echo independent component analysis pipeline ‘meica.py’^52,53 implemented in AFNI⁵⁴ and Python. First, the first four volumes of each run were discarded to allow for magnetization to reach steady state. Next, motion correction parameters were calculated based on the first TE images (TE 15 ms) using a rigid-body alignment procedure. Slice timing correction was implemented for functional images of each TE, which were then normalized to an age-matched infant template⁵⁵. The time series of four TEs were combined into a single time series⁵⁶. Both principal and independent component analyses were applied to denoise the data through isolation of thermal (that is, random) noise from structured signals (that is, blood oxygenation level dependent (BOLD, an indirect measure of neural activity via the relative levels of oxyhaemoglobin and deoxyhaemoglobin) and non-BOLD signals) and separation of BOLD and non-BOLD signals. Only the BOLD-like components were retained in the preprocessed images, which were then spatially smoothed with a 8 mm full width at half maximum Gaussian kernel.

Head motion was quantified via framewise displacement (FD)⁵⁷. For adults and sleeping toddlers, head motion was minimal (mean FD <0.12 mm). There were no group differences either between ASD and TD groups or between adults and toddler groups (Supplementary Table 4).

Whole-brain analyses

First-level and second-level whole-brain activation analyses were conducted with the general linear model in SPM12 (https://www.fil.ion.ucl.ac.uk/spm/software/spm12/). Events in first-level models were based on the canonical haemodynamic response function and its temporal derivative. To take into account the repeated measurements, that is, retest scans, we ran second-level whole-brain analyses with mixed-effects models using the 3dMVM program⁵⁸ in AFNI:

$$\begin{array}{l}{\mathrm{brain}}\,{\mathrm{activation}}\\ = \beta _0 + \beta _1 \times {\mathrm{group}} + \beta _2 \times {\mathrm{age}} + \beta _3 \times {\mathrm{gender}} + \beta _4 \times {\mathrm{meanFD}} + \varepsilon.\end{array}$$

In mixed-effects models, brain activation to each language paradigm (that is, speech versus rest contrast) served as a dependent variable. Individuals were treated as a random effect, which allows for fixed effects (that is, age, gender and mean FD) to vary for each individual.

Using a similar approach, we conducted whole-brain analyses with adult data for each language paradigm. However, only within-group tests were performed as all adult participants had typical development.

Resulting activation maps were corrected for multiple comparisons with the family-wise error approach using the 3dClustSim program in AFNI (voxel wise P = 0.005, cluster size >138 voxels for adults and >186 voxels for toddlers). This spatial cluster correction took into account spatial autocorrelation by using the ‘–acf’ option in 3dClustSim.

Calculation of per cent signal changes in temporal regions

Two language-relevant ROIs from the meta-analytic activation map in Neurosynth (https://neurosynth.org/) with the term ‘language’, including left and right temporal regions, were used for ROI analysis (Fig. 2b). These ROIs were identical to those used in previous papers^23,24. Given that a toddler template was used for toddler samples, ROIs were co-registered to the toddler template using FSL’s ‘flirt’ function^59,60. For each language paradigm, per cent signal changes were calculated with first-level models in speech versus rest contrast for all toddlers and adults.

Stability and validation of fMRI activation in toddlers

Given the challenges relating to implementing sleep imaging with toddlers, test–retest is rarely examined, but it is essential for determining the rigour of this approach. Additional key questions surround the degree to which functional activation patterns vary along the dimension of sleep and awake states, and developmental periods such as between toddlers and adults or in individuals across time. Here, we took steps towards filling these gaps and tested: (1) whether brain activation to language stimuli in sleeping TD toddlers is similar to that in passively listening awake adults and (2) whether brain activation patterns are stable and reproducible in TD individuals and individuals with ASD across time.

These questions were addressed by comparing brain activation patterns between TD adults and TD toddlers, and by computing intraclass correlation coefficients of brain activation within toddlers who were scanned multiple times at intervals of 1–15 months, respectively.

Activation differences between ASD versus TD and TD versus adults

Next, we compared per cent signal change values (extracted from prior temporal ROIs relevant to language processing) between toddlers with ASD versus TD toddlers as well as between sleeping TD toddlers versus awake adults using two-sample two-tailed t tests, excluding repeated time points. Data distribution was assumed to be normal, but this was not formally tested.

Brain–behaviour correlation analysis

Using similar mixed-effects models as mentioned above, but including signal change values to all three language paradigm and with Vineland socialization or communication scores as a predictor of interest (age, gender and mean FD as control variables/fixed effects; both language paradigms and individuals as random effects), we investigated the relevance of brain activation to a child’s social and communication abilities assessed by the Vineland Adaptive Behavior Scales⁴⁹. The Vineland socialization and communication scores and scan data from all 71 individuals, including test and retest scan data, were used in the mixed-effects models.

Motherese eye tracking paradigm

We used a novel eye tracking motherese paradigm that utilized gaze-contingent technology wherein a toddler’s gaze activates what he/she sees and hears. In this gaze-contingent paradigm, motherese and techno movies were presented side by side on the screen, and toddlers can choose to watch a movie depicting an actress telling a story using motherese or computer ‘techno’ sounds and images. The motherese vignette used in this eye tracking task was 1 of the 12 motherese stimuli included in fMRI experiments. Gaze-contingent paradigms provide strong evidence of volitional preference and attention, as opposed to passive looking. Our gaze-contingent design is a variant of other gaze-contingent eye tracking designs that simulate social interactions³⁸.

Eye tracking was conducted using Tobii software (Tobii Studio; Tobii Pro Lab), and fixation data were collected using a velocity threshold of 0.42 pixels per ms (Tobii Studio Tobii Fixation Filter) or 0.03 degrees perms (Tobii Pro Lab Tobii IV-T Fixation Filter). Of the 71 toddlers (41 with ASD, 30 TD; 53 male, 18 female, 14–55 months old) who participated in the eye tracking assessment, 54 toddlers (31 with ASD, 23 TD; 41 male, 13 female, 12–42 months) had moderate or good eye tracking performance and total looking time > 50% and were therefore included in the analysis. For 37 toddlers, eye tracking data were collected prior to the MRI scan, while 17 toddlers completed the task after the MRI scan. The distribution of intervals between the age at fMRI and eye-tracking data collection is displayed in Supplementary Fig. 5b. Percentage fixation to motherese was calculated as a ratio of total fixation time to motherese and total fixation time to both motherese and techno displays; the sum of percentage fixation to motherese and techno was 100. Preference for motherese was characterized by per cent fixation to motherese, which was compared between individuals with ASD and TD individuals.

Clustering analysis using SNF

SNF is a novel approach for capturing heterogeneity in multiple types of patient data and forming clusters or subgroups. The method reduces noise by aggregating across multiple types of data, detects common and complementary signals from different types of data and reveals the importance of each data type to patient similarity. SNF disentangles neural and clinical heterogeneity by grouping together individuals with the greatest similarity of patterns derived from combining normalized data from all data types. To do so, SNF normalizes all modality values and integrates patterns of values into a single fused network score per individual. As such, it assigns each toddler to clusters based on score similarity. It does not depend on or return conventional parametric statistics. Toddlers in each cluster have maximal similarity to each other in composite patterns of neural and clinical features, and maximal differences from toddlers in other clusters.

We used SNF²¹ to integrate fMRI brain activation in three language paradigms and clinical measures and then used the Louvain algorithm⁶¹ to detect clusters of the similarity network. For this analysis, we included a subset of 50 of the 71 toddlers (30 with ASD, 20 TD) who had successful scans of all three language paradigms. The analysis was performed with six ROI variables (left and right temporal activation for each of the three language paradigms) and 14 clinical variables (three ADOS variables: ADOS social affect, ADOS restricted and repetitive behaviour, and ADOS total; six Vineland variables: Vineland communication, Vineland socialization, Vineland daily living skill, Vineland motor, Vineland adaptive behaviour and Vineland domain total; five Mullen variables: Mullen fine motor, Mullen receptive language, Mullen visual reception, Mullen expressive language and Mullen early learning composite) in R with the SNFtool package. We included both subscale and composite or total variables as they provide different but complementary information. First, ROI and clinical data were normalized separately. Next, pairwise distance matrices between individuals were calculated for ROI or clinical data. Affinity matrices (networks) were computed based on distance matrices. Each affinity matrix is equivalent to a similarity network where nodes are samples (for example, individuals), and weighted edges represent pairwise sample similarities. Network fusion that iteratively updates every network was then performed, making two networks more similar to each other with every iteration. After a few iterations, two networks converged to a single network. Specifically, the similarity matrices were computed by setting the nearest-neighbour parameter to 20 and setting the hyperparameter to 0.5. The iteration for convergence was set as 20. All these parameters were set following the original SNF paper²¹. We constructed the network with the strongest 15% connecting partners of each individual and ran the clustering analysis with the Louvain community algorithm. The clusters were visualized with Cytoscape⁶².

Further, SNF clustering results were examined for how identified clusters compare on an independent measure of interest. In the original SNF paper, identified genomic patient clusters were compared for outcome differences in cancer survival times of patients. Here, the measure of interest regarding identified clusters was behavioural response to the socially compelling motherese vignette, that is, a test of each child’s current social preference for motherese.

Reporting Summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The tidy data used in this study are publicly available at https://github.com/Yaqiongxiao/asdmotherese_fmriSNF.

Code availability

Completed R code for implementing all analyses reported in this article is available at https://github.com/Yaqiongxiao/asdmotherese_fmriSNF.

References

Kuhl, P. K. Is speech learning ‘gated’ by the social brain? Dev. Sci. 10, 110–120 (2007).
Article PubMed Google Scholar
Kuhl, P. K. Brain mechanisms in early language acquisition. Neuron 67, 713–727 (2010).
Article CAS PubMed PubMed Central Google Scholar
Saint-Georges, C. et al. Motherese in interaction: at the cross-road of emotion and cognition? (A systematic review). PLoS ONE 8, 1–17 (2013).
Article CAS Google Scholar
Kuhl, P. K. et al. Cross-language analysis of phonetic units in language addressed to infants. Science 277, 684–686 (1997).
Article CAS PubMed Google Scholar
Grieser, D. A. L. & Kuhl, P. K. Maternal speech to infants in a tonal language: support for universal prosodic features in motherese. Dev. Psychol. 24, 14–20 (1988).
Article Google Scholar
Falk, D. Prelinguistic evolution in early hominins: whence motherese? Behav. Brain Sci. 27, 491–541 (2004).
Article PubMed Google Scholar
Cooper, R. P. & Aslin, R. N. Preference for infant-directed speech in the first month after birth. Child Dev. 61, 1584 (1990).
Article CAS PubMed Google Scholar
Fernald, A. Four-month-old infants prefer to listen to motherese. Infant Behav. Dev. 8, 181–195 (1985).
Article Google Scholar
Kuhl, P. K., Coffey-Corina, S., Padden, D. & Dawson, G. Links between social and linguistic processing of speech in preschool children with autism: behavioral and electrophysiological measures. Dev. Sci. 8, F1–F12 (2005).
Article PubMed Google Scholar
Pegg, J. E., Werker, J. F. & McLeod, P. J. Preference for infant-directed over adult-directed speech: evidence from 7-week-old infants. Infant Behav. Dev. 15, 325–345 (1992).
Article Google Scholar
Saito, Y. et al. Frontal cerebral blood flow change associated with infant-directed speech. Arch. Dis. Child. Fetal Neonatal Ed. 92, F113–F116 (2007).
Article CAS PubMed PubMed Central Google Scholar
Santesso, D. L., Schmidt, L. A. & Trainor, L. J. Frontal brain electrical activity (EEG) and heart rate in response to affective infant-directed (ID) speech in 9-month-old infants. Brain Cogn. 65, 14–21 (2007).
Article PubMed Google Scholar
Sulpizio, S. et al. fNIRS reveals enhanced brain activation to female (versus male) infant directed speech (relative to adult directed speech) in young human infants. Infant Behav. Dev. 52, 89–96 (2018).
Article PubMed PubMed Central Google Scholar
Zangl, R. & Mills, D. L. Increased brain activity to infant-directed speech in 6- and 13-month-old infants. Infancy 11, 31–62 (2007).
Article Google Scholar
Zhang, Y. et al. Neural coding of formant-exaggerated speech in the infant brain. Dev. Sci. 14, 566–581 (2011).
Article PubMed Google Scholar
Pierce, K. et al. Detecting, studying, and treating autism early: the one-year well-baby check-up approach. J. Pediatr. 159, 458–465.e6 (2011).
Article PubMed PubMed Central Google Scholar
Pierce, K., Courchesne, E. & Bacon, E. To screen or not to screen universally for autism is not the question: why the task force got it wrong. J. Pediatr. 176, 182–194 (2016).
Article PubMed PubMed Central Google Scholar
Pierce, K. et al. Evaluation of the diagnostic stability of the early autism spectrum disorder phenotype in the general population starting at 12 months. JAMA Pediatr. 173, 578–587 (2019).
Article PubMed PubMed Central Google Scholar
Bacon, E. C. et al. Rethinking the idea of late autism spectrum disorder onset. Dev. Psychopathol. 30, 553–569 (2018).
Article PubMed Google Scholar
Bruinsma, Y., Koegel, R. L. & Koegel, L. K. Joint attention and children with autism: a review of the literature. Ment. Retard. Dev. Disabil. Res. Rev. 10, 169–175 (2004).
Article PubMed Google Scholar
Wang, B. et al. Similarity network fusion for aggregating data types on a genomic scale. Nat. Methods 11, 333–337 (2014).
Article CAS PubMed Google Scholar
Pai, S. & Bader, G. D. Patient similarity networks for precision medicine. J. Mol. Biol. 430, 2924–2938 (2018).
Article CAS PubMed PubMed Central Google Scholar
Lombardo, M. V. et al. Different functional neural substrates for good and poor language outcome in autism. Neuron 86, 267–277 (2015).
Article CAS Google Scholar
Lombardo, M. V. et al. Large-scale associations between the leukocyte transcriptome and BOLD responses to speech differ in autism early language outcome subtypes. Nat. Neurosci. 21, 1680–1688 (2018).
Article CAS PubMed PubMed Central Google Scholar
Klin, A. Listening preferences in regard to speech in four children with developmental disabilities. J. Child Psychol. Psychiatry 33, 763–769 (1992).
Article CAS PubMed Google Scholar
Klin, A. Young autistic children’s listening preferences in regard to speech: a possible characterization of the symptom of social withdrawal. J. Autism Dev. Disord. 21, 29–42 (1991).
Article CAS PubMed Google Scholar
Ferjan Ramírez, N., Lytle, S. R., Fish, M. & Kuhl, P. K. Parent coaching at 6 and 10 months improves language outcomes at 14 months: a randomized controlled trial. Dev. Sci. 22, e12762 (2019).
Article PubMed Google Scholar
Ferjan Ramírez, N., Lytle, S. R. & Kuhl, P. K. Parent coaching increases conversational turns and advances infant language development. Proc. Natl Acad. Sci. USA 117, 3484–3491 (2020).
Article PubMed PubMed Central CAS Google Scholar
Bacon, E. C. et al. Measuring outcome in an early intervention program for toddlers with autism spectrum disorder: use of a curriculum-based assessment. Autism Res. Treat. 2014, 964704 (2014).
PubMed PubMed Central Google Scholar
Dawson, G. et al. Randomized, controlled trial of an intervention for toddlers with autism: the early start Denver model. Pediatrics 125, e17–e23 (2010).
Article PubMed Google Scholar
Kasari, C., Freeman, S. & Paparella, T. Joint attention and symbolic play in young children with autism: a randomized controlled intervention study. J. Child Psychol. Psychiatry Allied Discip. 47, 611–620 (2006).
Article Google Scholar
Sandin, S. et al. The heritability of autism spectrum disorder. J. Am. Med. Assoc. 318, 1182–1184 (2017).
Article Google Scholar
Bai, D. et al. Association of genetic and environmental factors with autism in a 5-country cohort. JAMA Psychiatry 76, 1035–1043 (2019).
Article PubMed PubMed Central Google Scholar
Courchesne, E. et al. The ASD living biology: from cell proliferation to clinical phenotype. Mol. Psychiatry 24, 88–107 (2019).
Article PubMed Google Scholar
Courchesne, E., Gazestani, V. H. & Lewis, N. E. Prenatal origins of ASD: the when, what, and how of ASD development. Trends Neurosci. 43, 326–342 (2020).
Article CAS PubMed PubMed Central Google Scholar
Gazestani, V. H. et al. A perturbed gene network containing PI3K–AKT, RAS–ERK and WNT-β-catenin pathways in leukocytes is linked to ASD genetics and symptom severity. Nat. Neurosci. 22, 1624–1634 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lombardo, M. V. et al. Atypical genomic patterning of the cerebral cortex in autism with poor early language outcome. Sci. Adv. 7, eabh1663 (2021).
Article PubMed PubMed Central Google Scholar
Vernetti, A. et al. Simulating interaction: using gaze-contingent eye-tracking to measure the reward value of social signals in toddlers with and without autism. Dev. Cogn. Neurosci. 29, 21–29 (2018).
Article PubMed Google Scholar
Manning, J. H., Courchesne, E. & Fox, P. T. Intrinsic connectivity network mapping in young children during natural sleep. Neuroimage 83, 288–293 (2013).
Article PubMed Google Scholar
Buckley, A. W. et al. Rapid eye movement sleep percentage in children with autism compared with children with developmental delay and typical development. Arch. Pediatr. Adolesc. Med. 164, 1032–1037 (2010).
Article PubMed PubMed Central Google Scholar
Devnani, P. A. & Hegde, A. U. Autism and sleep disorders. J. Pediatr. Neurosci. 10, 304–307 (2015).
Article PubMed PubMed Central Google Scholar
Goldman, S. E. et al. Defining the sleep phenotype in children with autism. Dev. Neuropsychol. 34, 560–573 (2009).
Article PubMed PubMed Central Google Scholar
Lehoux, T., Carrier, J. & Godbout, R. NREM sleep EEG slow waves in autistic and typically developing children: morphological characteristics and scalp distribution. J. Sleep. Res. 28, 1–6 (2019).
Article Google Scholar
Redcay, E. & Courchesne, E. Deviant functional magnetic resonance imaging patterns of brain activity to speech in 2–3-year-old children with autism spectrum disorder. Biol. Psychiatry 64, 589–598 (2008).
Article PubMed PubMed Central Google Scholar
Eyler, L. T., Pierce, K., Courchesne, E., Cheng, A. & Barnes, C. C. A failure of left temporal cortex to specialize for language is an early emerging and fundamental property of autism. Brain 135, 949–960 (2012).
Article PubMed PubMed Central Google Scholar
Pierce, K. et al. Get SET early to identify and treatment refer autism spectrum disorder at 1 year and discover factors that influence early diagnosis. J. Pediatr. 236, 179–188 (2021).
Article PubMed Google Scholar
Lord, C., Elsabbagh, M., Baird, G. & Veenstra-Vanderweele, J. Autism spectrum disorder. Lancet 392, 508–520 (2018).
Article PubMed PubMed Central Google Scholar
Mullen, E. M. Mullen Scales of Early Learning (American Guidance Service, 1995).
Sparrow, S., Cicchetti, D. & Balla, D. Vineland-II Scales of Adaptive Behavior: Survey Form Manual (American Guidance Service, 2005).
Dehaene-Lambertz, G., Dehaene, S. & Hertz-Pannier, L. Functional neuroimaging of speech perception in infants. Science 298, 2013–2015 (2002).
Article CAS PubMed Google Scholar
Redcay, E., Kennedy, D. P. & Courchesne, E. fMRI during natural sleep as a method to study brain function during early childhood. Neuroimage 38, 696–707 (2007).
Article PubMed Google Scholar
Kundu, P., Inati, S. J., Evans, J. W., Luh, W. M. & Bandettini, P. A. Differentiating BOLD and non-BOLD signals in fMRI time series using multi-echo EPI. Neuroimage 60, 1759–1770 (2012).
Article PubMed Google Scholar
Kundu, P. et al. Integrated strategy for improving functional connectivity mapping using multiecho fMRI. Proc. Natl Acad. Sci. USA 110, 16187–16192 (2013).
Article CAS PubMed PubMed Central Google Scholar
Cox, R. W. AFNI: software for analysis and visualization of functional magnetic resonance neuroimages. Comput. Biomed. Res. 29, 162–173 (1996).
Article CAS PubMed Google Scholar
Shi, F. et al. Infant brain atlases from neonates to 1- and 2-year-olds. PLoS ONE 6, e18746 (2011).
Article CAS PubMed PubMed Central Google Scholar
Kundu, P. et al. Multi-echo fMRI: a review of applications in fMRI denoising and analysis of BOLD signals. Neuroimage 154, 59–80 (2017).
Article PubMed Google Scholar
Power, J. D., Barnes, K. A., Snyder, A. Z., Schlaggar, B. L. & Petersen, S. E. Spurious but systematic correlations in functional connectivity MRI networks arise from subject motion. Neuroimage 59, 2142–2154 (2012).
Article PubMed Google Scholar
Chen, G., Adleman, N. E., Saad, Z. S., Leibenluft, E. & Cox, R. W. Applications of multivariate modeling to neuroimaging group analysis: a comprehensive alternative to univariate general linear model. Neuroimage 99, 571–588 (2014).
Article PubMed Google Scholar
Jenkinson, M., Bannister, P., Brady, M. & Smith, S. Improved optimization for the robust and accurate linear registration and motion correction of brain images. Neuroimage 17, 825–841 (2002).
Article PubMed Google Scholar
Jenkinson, M. & Smith, S. A global optimisation method for robust affine registration of brain images. Med. Image Anal. 5, 143–156 (2001).
Article CAS PubMed Google Scholar
Blondel, V. D., Guillaume, J.-L., Lambiotte, R. & Lefebvre, E. Fast unfolding of communities in large networks. J. Stat. Mech: Theory Exp. 2008, P10008 (2008).
Article Google Scholar
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank the parents and children in San Diego who participated in our research, without whom this would not be possible. We are also fortunate to work with wonderful paediatricians and family practice physicians spanning a range of medical groups including UCSD, Sharp Rees-Stealy, Scripps, Rady-Children’s Primary Care Medical Group, Chula Vista Pediatrics, Graybill Medical Group, Grossmont Pediatrics, Linda Vista Health Care Center, Mills Pediatrics, North County Health Services, San Diego Family Care and Sea Breeze Pediatrics. We are grateful for their support. This work was supported by NIDCD grant 1R01DC016385 awarded to E.C. and K.P.; NIMH grants R01MH118879 and R01MH104446 awarded to K.P.; and 755816 European Research Council awarded to M.V.L. and E.C.. The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.

Author information

Authors and Affiliations

Autism Center of Excellence, Department of Neurosciences, University of California, San Diego, La Jolla, CA, USA
Yaqiong Xiao, Teresa H. Wen, Disha Goel, Karen Pierce & Eric Courchesne
Department of Psychology, University of Miami, Coral Gables, FL, USA
Lauren Kupis
Department of Psychiatry, University of California, San Diego, La Jolla, CA, USA
Lisa T. Eyler
VISN 22 Mental Illness Research, Education, and Clinical Center, VA San Diego Healthcare System, San Diego, CA, USA
Lisa T. Eyler
Point Loma Pediatrics, UC San Diego Health Physician Network, San Diego, CA, USA
Keith Vaux
Laboratory for Autism and Neurodevelopmental Disorders, Center for Neuroscience and Cognitive Systems @UniTn, Istituto Italiano di Tecnologia, Rovereto, Italy
Michael V. Lombardo
Autism Research Centre, Department of Psychiatry, University of Cambridge, Cambridge, UK
Michael V. Lombardo
Department of Pediatrics, University of California, San Diego, La Jolla, CA, USA
Nathan E. Lewis

Authors

Yaqiong Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Teresa H. Wen
View author publications
You can also search for this author in PubMed Google Scholar
Lauren Kupis
View author publications
You can also search for this author in PubMed Google Scholar
Lisa T. Eyler
View author publications
You can also search for this author in PubMed Google Scholar
Disha Goel
View author publications
You can also search for this author in PubMed Google Scholar
Keith Vaux
View author publications
You can also search for this author in PubMed Google Scholar
Michael V. Lombardo
View author publications
You can also search for this author in PubMed Google Scholar
Nathan E. Lewis
View author publications
You can also search for this author in PubMed Google Scholar
Karen Pierce
View author publications
You can also search for this author in PubMed Google Scholar
Eric Courchesne
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.C., K.P., L.T.E. and L.K. conceived the idea and designed the study. L.K., D.G., T.H.W. and K.V. recruited the participants. L.K., D.G., T.H.W., Y.X., L.T.E. and E.C. collected the data. Y.X. conceived and performed all analyses. E.C., M.V.L. and N.E.L. aided in data analyses. E.C., K.P. and M.V.L. obtained grant funding. Y.X. and E.C. wrote the manuscript. All authors contributed to editing the manuscript.

Corresponding authors

Correspondence to Yaqiong Xiao, Karen Pierce or Eric Courchesne.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Human Behaviour thanks Laura Edwards and Giorgia Silani for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Tables 1–4 and Figs. 1–5.

Reporting summary

Peer review information

Rights and permissions

Reprints and permissions

About this article

Cite this article

Xiao, Y., Wen, T.H., Kupis, L. et al. Neural responses to affective speech, including motherese, map onto clinical and social eye tracking profiles in toddlers with ASD. Nat Hum Behav 6, 443–454 (2022). https://doi.org/10.1038/s41562-021-01237-y

Download citation

Received: 21 October 2020
Accepted: 22 October 2021
Published: 03 January 2022
Issue Date: March 2022
DOI: https://doi.org/10.1038/s41562-021-01237-y

This article is cited by

Infant feeding practices and autism spectrum disorder in US children aged 2–5 years: the national survey of children’s health (NSCH) 2016–2020
- Xiao-Ling Zhan
- Ning Pan
- Li-Zi Lin
International Breastfeeding Journal (2023)
Atypical functional connectivity of temporal cortex with precuneus and visual regions may be an early-age signature of ASD
- Yaqiong Xiao
- Teresa H. Wen
- Eric Courchesne
Molecular Autism (2023)
The Musical Turn in Biosemiotics
- Matthew A Slayton
- Yogi Hale Hendlin
Biosemiotics (2023)