- Open Access
Linguistic camouflage in girls with autism spectrum disorder
Molecular Autismvolume 8, Article number: 48 (2017)
Autism spectrum disorder (ASD) is diagnosed more frequently in boys than girls, even when girls are equally symptomatic. Cutting-edge behavioral imaging has detected “camouflaging” in girls with ASD, wherein social behaviors appear superficially typical, complicating diagnosis. The present study explores a new kind of camouflage based on language differences. Pauses during conversation can be filled with words like UM or UH, but research suggests that these two words are pragmatically distinct (e.g., UM is used to signal longer pauses, and may correlate with greater social communicative sophistication than UH). Large-scale research suggests that women and younger people produce higher rates of UM during conversational pauses than do men and older people, who produce relatively more UH. Although it has been argued that children and adolescents with ASD use UM less often than typical peers, prior research has not included sufficient numbers of girls to examine whether sex explains this effect. Here, we explore UM vs. UH in school-aged boys and girls with ASD, and ask whether filled pauses relate to dimensional measures of autism symptom severity.
Sixty-five verbal school-aged participants with ASD (49 boys, 16 girls, IQ estimates in the average range) participated, along with a small comparison group of typically developing children (8 boys, 9 girls). Speech samples from the Autism Diagnostic Observation Schedule were orthographically transcribed and time-aligned, with filled pauses marked. Parents completed the Social Communication Questionnaire and the Vineland Adaptive Behavior Scales.
Girls used UH less often than boys across both diagnostic groups. UH suppression resulted in higher UM ratios for girls than boys, and overall filled pause rates were higher for typical children than for children with ASD. Higher UM ratios correlated with better socialization in boys with ASD, but this effect was driven by increased use of UH by boys with greater symptoms.
Pragmatic language markers distinguish girls and boys with ASD, mirroring sex differences in the general population. One implication of this finding is that typical-sounding disfluency patterns (i.e., reduced relative UH production leading to higher UM ratios) may normalize the way girls with ASD sound relative to other children, serving as “linguistic camouflage” for a naïve listener and distinguishing them from boys with ASD. This first-of-its-kind study highlights the importance of continued commitment to understanding how sex and gender change the way that ASD manifests, and illustrates the potential of natural language to contribute to objective “behavioral imaging” diagnostics for ASD.
N.B.: In this paper, our terminology is drawn from World Health Organization definitions . The word “sex” refers to genetic makeup, and “gender” refers to a socio-cultural construct. Consistent with a recent review of sex differences in ASD , we explicitly acknowledge the significant and inevitable overlap between the two. Here, we use the words “girl” and “boy” to refer to biological sex.
Autism spectrum disorder (ASD) is a behaviorally defined condition predominantly found in males [10, 19, 40, 49]. Recent research suggests that girls with ASD may “camouflage” real struggles with social communication by engaging in social mimicry and behaving in ways that are superficially typical, thus complicating diagnosis [7, 34]. For example, nonverbal communication (e.g., gesture) is broadly impaired in ASD . Using 3D motion capture, a recent study showed that girls with ASD gesture in ways that are more vibrant and noticeable than boys with ASD, despite similar struggles with social communication . In this way, girls effectively modified their behavior to mask a traditional area of weakness. In the present study, we use granular language-based analysis to explore another way in which girls with ASD achieve this social camouflage effect: by producing sex-typical filled pauses during naturalistic conversation.
In the course of normal conversation, interlocutors pause, revise, and re-work their utterances, a process termed disfluency . Disfluencies affect how speakers are perceived by others, and elevated rates of speech disfluencies have been linked to negative social perceptions of the speaker [8, 22, 45]. Pauses are disfluencies that occur during natural speech and can either remain unfilled (silent pauses) or be filled by words like um, uh, like, and you know (filled pauses). UM and UH, in particular, are thought to be social pragmatic hesitation markers with communicative value (e.g., facilitating comprehension by signaling to the listener that the speaker needs more time to finish communicating their current thought, or that they would like to hold/cede the floor ). However, there is growing evidence to suggest that UM and UH are not the same; UH is used to signal a short delay while UM may be used to signal more significant delays . Large-scale cross-linguistic studies of spoken and written language show that these two markers are used with varying frequency by people from different demographic groups [1, 59]. For example, UM is used relatively more often by women, educated individuals, and younger generations than by men, less educated individuals, and older generations. In contrast, UH is used relatively more frequently by men, less educated individuals, and older generations than by women, more educated individuals, and younger generations (M. [37, 59]). This pattern is robust across American English, British English, Scottish English, Dutch, Norwegian, German, Danish, and Faroese .
Filled pauses in ASD
A significant body of research suggests that children and adults with ASD produce unusually high rates of various speech disfluencies [16, 35, 42, 50, 51, 54], but only three studies have specifically examined UM and UH. First, 4- to 8-year-olds with ASD use UM at lower rates than children without ASD . This finding suggests that UM and UH may hinge on distinct cognitive processes, with UM more powerfully affected by the presence of autism . Subsequent research showed that UM use by 8- to 21-year-olds normalizes when overall ASD symptoms drop off [28, 46], again lending support to the hypothesis that UM is associated with social competence. In a third study, language samples from children with ASD were compared to samples from children with specific language impairment, and individual variation in UM use was found to be a marker of pragmatic ability rather than a consequence of general language skill . Taken together, these studies suggest that UM and UH are used differently by children with ASD, and that UM ratios [um/(um + uh)] associate with ASD symptoms.
One major limitation of this line of research is that sex differences in filled pause use have never been studied in individuals with ASD. If females have higher UM ratios than males , slight sex ratio differences across diagnostic groups could lead to erroneous results. For example, Gorman et al.  included 10% girls in their ASD group and 30% girls in their typical group. Elevated UM ratios in the typical group relative to the ASD group might thus be a consequence of sex ratio imbalances (because girls may produce higher UM ratios than boys)—and the TD group was enriched for girls. This observation, that slight differences in sex ratio by diagnostic group could change the results of scientific studies in a variety of domains, has implications for how we evaluate the presence and severity of ASD symptoms in girls and boys, and will influence the hunt for behavioral markers of ASD more broadly. In the case of UM and UH specifically, if girls with ASD exhibit gender-normative speech patterns, it could bias perception of their ASD symptoms. The current literature is unable to address this question, primarily out of an absence of speech data from females with ASD. Apart from Gorman et al. , the other two papers that explored UM vs. UH in children with ASD either did not report sample sex ratios at all , or included insufficient numbers of female participants with ASD to assess potential effects of sex on filler production (3 out of 24 or 12.5% in ).
The present study fills this gap with a large sample of children on the spectrum, including a representative proportion of girls (N = 65, 16 girls, 25%), along with a small comparison sample of typical children (N = 17, 9 girls, 53%). We hypothesized that girls and boys with ASD would show typical sex differences in filled pauses (i.e., girls would produce higher UM ratios than boys with ASD, driven by boys producing more UH than girls). We further hypothesized, based on prior research showing that scores on the Social Communication Questionnaire (SCQ;  higher scores indicate more lifetime ASD symptoms) decrease as UM ratio increases [24, 28], that higher UM ratios would be associated with fewer lifetime ASD symptoms. To address the possibility that girls in our ASD sample were more socially adept than boys, we compared parent reports of social communication ability by sex (using the Vineland Adaptive Behavior Scales-2nd edition, (VABS; ), and explored whether better social communication in girls explained sex differences in filled pause ratios. Finally, we analyzed subgroups of boys with ASD, and gauged the specificity of our UM/UH results by comparing boys and girls on seven additional linguistic features. Language samples were drawn from the interview section of the ADOS module 3 (requiring phrase speech; ). These samples provide a naturalistic back-and-forth conversation about social topics, and have been shown to contain meaningful rates of speech disfluencies in children with ASD [24, 26, 42].
Sixty-five children with ASD aged 6–17 contributed language samples during research visits to the Center for Autism Research at the Children’s Hospital of Philadelphia (Table 1). Children were recruited from the larger Philadelphia area, and 85% of the sample was White according to parent report. Research-reliable clinical psychologists (100% female) used expert clinical judgment to make diagnoses of ASD according to DSM-IV-TR criteria  informed by the ADOS-2 Module 3 and the Autism Diagnostic Interview-Revised (ADI-R; ). Exclusion criteria included extreme prematurity (< 32 weeks) or low birth weight, genetic or medical history that explained ASD symptoms, uncorrected auditory or visual impairment, significant psychiatric conditions (e.g., active mood disorders or psychosis, but not common comorbidities like anxiety or ADHD), and contraindications for an MRI scan (e.g., braces). Inclusion criteria for the present subsample included the ability to use phrase speech, since all conversational data were drawn from Module 3 ADOS evaluations. Boys and girls did not differ significantly on age, General Conceptual Ability (GCA) using the Differential Abilities Scales-II (DAS-II; all participants had GCAs > 75), or ASD symptoms as measured by ADOS scores (Table 1). Despite statistically insignificant differences in chronological age, there were mean differences; we therefore included this variable in our analysis, to check for possible age influences on filled pauses. In addition, we recruited a small sample of typically developing children (N = 17; age in years: M = 11.32, SD = 2.21; GCA: M = 104, SD = 15) for the purposes of comparison.
Research-reliable clinicians administered the ADOS to participants as part of 1- or 2-day study battery (rescored as the ADOS-2, referred to as the ADOS-2 throughout this paper). The ADOS-2 was administered in a quiet room equipped with audio and video recording equipment. Parents consented to using these recordings for research purposes. Participant IQ was estimated using the DAS-II , and parents provided additional information via the ADI-R , the VABS , and the Social Communication Questionnaire (SCQ;  see the “Measures” section below). Families were compensated for time and travel. This research was conducted with approval and oversight from the Institutional Review Board of the Children’s Hospital of Philadelphia.
All measures were administered in person, by telephone (for the ADI-R, if needed), or via US postal service (for parent questionnaires).
Autism Diagnostic Observation Schedule-2nd Edition (ADOS-2; ). The ADOS-2 is a semi-structured psychodiagnostic assessment for ASD, often used in combination with parent history (collected via ADI-R or SCQ combined with parent interview). ADOS-2 calibrated severity scores estimate the extent to which an individual is affected by ASD symptoms (0–10, with 0 representing least affected and 10 most affected ). Calibrated severity scores for the subdomains of social affect (0–10) and repetitive behaviors/restricted interests (0–10) were calculated as well [25, 27].
Differential Abilities Scales-II (DAS-II; ). The DAS-II is designed to estimate general intelligence in individuals aged 2–18 years. The DAS-II provides standard scores that are analogous with full-scale IQ (DAS-II General Conceptual Ability), verbal IQ (DAS-II verbal composite score), and performance IQ (DAS-II nonverbal composite score).
Social Communication Questionnaire (SCQ; ). The SCQ is a parent report questionnaire assessing current and lifetime autism symptoms. This study reports lifetime SCQ scores, with higher scores indicating greater lifetime autism symptomology.
Vineland Adaptive Behavior Scales (parent/caregiver rating form)-2nd edition (VABS; ). The VABS is a parent report measure of their child’s adaptive behavior. It includes multiple domains of adaptive function, including socialization and communication. Higher scores indicate more proficient adaptive behavior.
The largest continuous segment of the “interview questions” section of the ADOS-2 (in minutes) was orthographically transcribed using a specification developed in collaboration with the University of Pennsylvania’s Linguistic Data Consortium (LDC). When possible, we transcribed the “Emotions,” “Social Difficulties and Annoyance,” “Friends, Relationships, and Marriage,” and “Loneliness” sections. The “Break” section, which occurs within these sections, was not transcribed. Mean overall transcription length was 19.61 min per participant. Samples from boys (mean = 19.15 min) vs. girls (mean = 20.65) did not significantly differ in duration, Mann-Whitney Z = −1.48 p = 0.14. Each audio sample was segmented into utterances (separated by breath pauses) by an undergraduate student and checked by an experienced annotator. Speech segments were transcribed by a different undergraduate student, and adjudicated by a different experienced annotator. Segmentation, transcription, and adjudication were conducted using audio files only, blind to diagnosis. Nearly 50% of recordings were fully transcribed by two independent annotators, with pre-adjudication word-level agreement averaging 92%. Time-stamped textual transcriptions were aligned with audio files using a force-aligner developed at the LDC .
Word count was calculated by summing all words produced by each speaker (totwords). Instances of UM and UH were identified within each transcript, and summed. Three variables were calculated: average UM production (tot_um/totwords), average UH production (tot_uh/totwords), and um_ratio, which is the rate of UM produced relative to total filled pauses [tot_um/(tot_um + tot_uh)]. Total filled pause rate was calculated by dividing the sum of pauses filled by UM or UH by the total number of words produced by an individual [(tot_um + tot_uh)/totwords]. Average latency to respond to the ADOS-2 administrator was calculated by dividing the sum duration of all clinician-to-participant inter-turn pauses by the total number of pauses of that transition type. Speech rate was calculated by dividing the total number of words produced by each speaker by the summed duration of all segments for that speaker. An outlier-robust measure of dispersion in participants’ fundamental frequency (F0) distribution (median absolute deviation from the median), captured pitch variation. All time variables are reported in milliseconds.
Three generalized linear mixed effects logistic regression models assessed the relative effects of age, IQ, sex, and ASD symptoms on “tot_um,” “tot_uh,” and “um_ratio” in participants with ASD (software: R, package: lme4; ). In each case, the word of interest was coded as a “hit” while all other words were coded as “misses.” Continuous variables were z-scored for interpretability, and participant identity was included with a random intercept. UM and UH variables were non-normal and contained outliers; analyses with and without outliers were found to yield the same pattern of results, so outliers were retained to capture the heterogeneity inherent in samples with ASD. Nonparametric Mann-Whitney U tests were used in place of standard t tests to account for non-normality (software: SPSS 23; Z is reported). Means and standard deviations are reported for the typically developing comparison group, but due to the small sample size, only nonparametric Mann-Whitney U tests were conducted. Spearman’s Rho (ρ) was used to characterize correlational relationships between UM and UH variables and questionnaire scores for participants with ASD. Effect sizes for Mann-Whitney U tests are reported using r = Z/[sqrt(N)] . Following Cohen (1988), an r value of 0.1 is considered a small effect, 0.3 is a medium effect, and 0.5 is a large effect.
Predictors of UM and UH in ASD
Generalized linear mixed effects logistic regression models revealed significant associations between sex, VABS socialization scores and average UH production in the ASD group, as well as UM ratio, but no association between sex and socialization and average UM production (Table 2). Age, GCA, SCQ, and VABS communication scores did not account for significant explanatory variance in the three filled pause variables, nor did interactions between any two variables. The pattern of results did not change when verbal and nonverbal composite scores were entered instead of DAS-II GCA.
Boys with ASD produced more UH relative to total words produced (mean = 0.0084, SD = 0.0077, 95% CI = 0.0062–0.0106) than girls with ASD (mean = 0.0044, SD = 0.0044, 95% CI = 0.0022–0.0066), mean difference = −0.004, 95% CI = −0.0081–0.00004, Z = −1.98, p = 0.02, r = 0.29. There was also a significant sex difference in UM ratio (mean difference = 19%, 95% CI = 0.04–0.34, Z = −2.16, p = 0.03, r = 0.26). Whereas girls produced UM during approximately 75% of pauses filled by UM or UH (SD = 17%; CI = 0.66–0.85), boys produced UM during an average of 56% of such pauses (SD = 29%, CI = 0.48–0.65). Average UM use did not differ in boys (mean = 0.0143, SD = 0.0156, 95% CI = 0.0098–0.0188) as compared to girls (mean = 0.0161, SD = 0.0125, 95% CI = 0.0094–0.0227), mean difference = 0.0017, 95% CI = −0.0069–0.0103, Z = −1.19, p = 0.24, r = 0.15, suggesting that greater UH use by boys (or reduced UH use by girls) drove the significant difference in the um/uh ratio.
UM and UH in typical controls
A similar pattern emerged in our sample of typically developing children. Typical boys produced more UH relative to total words produced (mean = 0.0083, SD = 0.0050, 95% CI = 0.0042–0.0124) than did typical girls (mean = 0.0032, SD = 0.0027, 95% CI = 0.0011–0.0053; mean difference = −0.0051, 95% CI = −0.0092–(−0.0010), Z = −2.32, p = 0.02, r = 0.56; Fig. 1). Similar to the ASD group, mean UM production relative to total words produced was comparable between typical boys (mean = 0.0301, SD = 0.0129, 95% CI = 0.0193–0.0408) and girls (mean = 0.0234, SD = 0.0184, 95% CI = 0.0011–0.0053; mean difference = −0.0067, 95% CI = −0.0233–0.0100, Z = −1.06, p = 0.29, r = 0.26). However, in contrast to the pattern seen in children with ASD, UM ratios were high for both typical girls (mean = 85%, SD = 14%, 95% CI = 74–95%) and boys (mean = 78%, SD = 15%, 95% CI = 65–91%; mean difference = 0.0673, 95% CI = −0.0827–0.2173, Z = −0.87, p = 0.39, r = 0.21.
Diagnostic group differences in UM and UH
Given our small TDC sample, we suggest that the following results be interpreted with caution. Overall, typical children filled more pauses than children with ASD (Table 3; Z = −2.33, p = 0.02, r = 0.26). Mann-Whitney U tests suggest that girls with ASD and typical girls used comparable levels of UM (p = 0.34) and UH (p = 0.61), and had similar UM ratios (p = 0.16; Fig. 1). In contrast, boys with ASD produced significantly less UM (relative to total words produced) than typical boys (p = 0.002), and typical boys produced higher UM ratios than boys with ASD (p = 0.06). Boys in the ASD and TDC groups did not differ on average UH (p = 0.63).
Filled pauses and ASD symptoms
Generalized linear mixed effects logistic regression models revealed that UM ratio and average UH production are associated with VABS socialization scores in the ASD group. To rule out the possibility that slightly lower VABS scores in girls relative to boys in our sample unduly influenced the observed relationship between socialization and disfluency type, we explored whether relationships between filled pauses and VABS socialization scores held within boys and girls with ASD, separately (Fig. 2). Spearman’s Rho correlations revealed a positive association between VABS socialization scores and UM ratio for boys (ρ = 0.30, p = 0.04) but not for girls (ρ = 0.007, p = 0.98; 2a). Increased UH use was associated with lower socialization scores for boys (ρ = −0.31, p = 0.04; 2b) and girls at similar strength (ρ = −0.32) but likely failed to reach significance in girls due to the smaller sample size. In contrast to similar relationships between average UH and socialization in boys and girls, opposite trends were observed in average UM; the relationship between UM and socialization scores was positive and insignificant in boys, and it was negative and insignificant in girls (2c). This pattern of results suggests that UM ratio and average UM are not sex-robust indicators of pragmatic ability or social impairment in ASD. UH, on the other hand, appears to be a promising indicator of poor socialization as reported by parents on the VABS, across both sexes.
Parsing heterogeneity in ASD
Every girl in the ASD group produced UM during at least 40% of filled pauses, and some produced UM during 100% of filled pauses. In contrast, boys with ASD ranged from 0 to 100% of pauses filled by UM vs. UH. This suggests the possibility of multiple subgroups of boys: primarily UMers and primarily UHers. We explored this heterogeneity among boys with ASD by splitting into three equal groups (by UM ratio) and comparing the top and bottom thirds of the boy group to girls. The bottom third of boys produced UH during filled pauses 60% of the time or more, and were categorized as UHers. Boys that produced UM during 78% or more of filled pauses were categorized as UMers. No group differences survived correction for multiple comparisons in a multivariate ANOVA (all ps > 0.21; Table 4), indicating that the relationship between filled pause use and Vineland socialization is more dimensional than categorical and may be sensitive to sample size.
Specificity of filled pause differences
Finally, we considered the possibility that filled pauses represent just one of many low-level speech differences between boys and girls with ASD. To assess the specificity of variation in filled pause type, we conducted pairwise comparisons across a variety of other linguistic features. Results revealed that girls and boys with ASD did not differ on average total word count, filled pause rate, speech rate, number of turns, duration of turns, response latency, or pitch variation (Table 5).
Subtle linguistic markers influence how parents, teachers, clinicians, and peers perceive an individual’s social skills during everyday conversation. For example, UM is a social pragmatic marker  used more often by younger people and women as compared to older people and men, and UH is used relatively more often by older people and men than by younger people and women . Unusually low UM ratios have been reported in children/adolescents with ASD , and have been argued to mark difficulties with social communication . In the present study, we asked whether a third variable not considered in prior research—speaker sex—might be important for understanding filled pause differences in children with ASD. Our results showed that girls with ASD and typical girls/boys exhibited higher UM ratios than boys with ASD, while girls in both diagnostic groups suppressed UH relative to their male counterparts. Importantly, filled pause differences in boys and girls with ASD in this study were not attributable to increased social pragmatic ability in girls, as girls and boys in our sample had equivalent social communication skills and comparable autism symptom severity. The present findings suggest that UH suppression and higher UM ratios may serve as “linguistic camouflage” to normalize the way a girl with ASD sounds relative to same-aged typical peers, while elevated rates of UH (relative to UM) may cause boys with ASD to sound particularly atypical. Intentional or not, overtly typical-sounding speech in girls could have some benefits (e.g., allowing a child to more easily “blend in” with classmates), while simultaneously complicating the detection of ASD, leading to missed or delayed diagnosis, and misdiagnoses that are more common in girls than boys .
Sex differences and UM Ratio
This is the first study to show that girls and boys with ASD show sex-specific patterns of filled pause use, and to compare these patterns in girls and boys without ASD. Importantly, girls with ASD in our study did not fill more pauses than boys; they filled them differently. Like typical children and children with specific language impairment , girls in the present study used UM during > 70% of all filled pauses. Boys with ASD, in contrast, used UM during only 55% of filled pauses. In contrast to prior research , we did not find relationships between ASD symptoms as measured by the SCQ and average UM production in the sample as a whole, nor did we find relationships between average UM and VABS socialization or communication scores. This indicates that even though girls are using typical-sounding speech patterns (as reflected by the UM ratio), it is not necessarily reflected in improved parental perceptions of their social communicative ability.
Why might girls with ASD fill conversational pauses with “typical-sounding” words, and suppress atypical words like UH, despite comparable social communication deficits to boys? One hypothesis is that girls are not using filled pauses as communicative tools, but rather produce higher UM ratios (and lower rates of UH) as a form of unconscious social mimicry or scripting [7, 30, 32, 33, 36]. According to this hypothesis, a girl with ASD might produce typical UM ratios to appear less atypical and improve her chances of successfully integrating during social situations. However, she may not necessarily understand the social meaning behind different filled pause types, and struggle with social communication as much as boys that do not “normalize” their UM ratios. Indeed, our results suggest that girls’ linguistic camouflaging is more successful in some ways than others. Although girls with ASD produced higher UM ratios than boys with ASD due to UH suppression, their overall filled pause rate was still lower than typical participants, resulting in incomplete camouflage. This may be due to the nature of our sample; research-based ascertainment differs from population- or clinic-based approaches, and may have biased our sample toward girls with more pronounced symptoms. Thus, it is possible that girls who have not yet been identified as autistic or referred for an evaluation will engage in more successful (or complete) linguistic mimicry.
Another likely explanation for the sex differences reported here hinges on powerful forces of gender socialization. Indeed, research on typically developing children shows that parent perceptions, play practices, and styles of interaction differ systematically by child sex, with infant girls hearing more language, receiving more eye contact, and having more opportunities to engage in social-emotional interaction than infant boys . These gendered caregiving differences have already been in full effect for 2 to 3 years before most ASD diagnoses are made [6, 9, 15, 17, 23], and may act as pressure on young girls to conform to “girl” expectations that include pragmatic competence. As Goldman  pointed out, “children (that are later diagnosed with ASD) are raised like any other children according to their sex. They are perceived as girls or boys and are taught to play, talk, and interact in accordance with the particular gender-based rules of their families.” Thus, perhaps girls with ASD are not using filled pauses communicatively, but instead are responding to forces of gender socialization that expect girls to sound less pragmatically impaired, by suppressing UH and producing higher UM ratios. Since girls with ASD and typical girls are subject to the same gender-based influences during the first few years, it follows that their language might be similarly affected. Indeed, we found that typical girls also suppress UH and produce higher UM ratios than typical boys. This is only one possible hypothesis among a confluence of factors that likely relate to the differences observed in this study. Developmental research focused on younger children is needed to chart the causal pathways that lead to this type of linguistic variation in girls.
Sex differences and UH use
Consistent with prior research documenting UM and UH differences in adult men and women, our results suggest that sex differences in UH production are robust in children; boys in our sample produced more than twice as many UHs (0.84% for boys with ASD, 0.83% for TDC boys) as girls (0.44% for girls with ASD, 0.32% for TDC girls). In addition, we found negative relationships between UH production and perceived social ability across both sexes in ASD; this builds on a number of studies linking speech disfluencies to negative social perceptions of speakers [8, 22, 45].
What caused elevated UH in boys with ASD (and typical boys)? Unmeasured variation in basic cognition could explain this finding, since speakers tend to pause more often, and for longer, right before producing syntactically complex utterances or utterances that exceed their linguistic competency [4, 53]. Given that boys and girls in our sample did not differ significantly on IQ, however, basic cognitive differences are unlikely to have caused elevated UH use by boys (IQ was also not a significant predictor of any language variable in our models). Another possible explanation is language-based, since people with uneven language profiles tend to produce more disfluencies [5, 12, 39, 43, 55–57]. Our groups had comparable verbal IQ scores, but it is still possible that boys in our sample had slightly compromised or uneven language abilities relative to girls, who tend to have better language than boys in typical samples as well . Future research is warranted to explore this possibility.
A third explanation, that poor coordination between motor and language systems results in greater UH use by boys with ASD relative to girls, stems from a growing body of research demonstrating subtle motor control differences in ASD . For instance, people who produce high rates of stuttering-like disfluencies often struggle with speech- and non-speech motor tasks, suggesting a general deficiency in integrating sensory and motor control information (Smits Bandstraand De Nil, 2009; Webster, 1997; Smith et al., 2012; Louckset al., 2007; Max & Gracco, 2005). Elevated UH rates in boys with ASD may therefore indicate disrupted coordination across motor and language systems, and could provide clues about a global underlying motor-based pathobiology that partially accounts for social communication problems in ASD . However, we know little about how the motor features of ASD may differ by sex; if UH is a marker of asynchrony across motor and language systems that disproportionately affects boys, and elevated UH production drives reduced UM ratios in boys, it could be that motor dysregulation leads to pragmatic problems even when the cognitive aspects of language are grossly intact. We did not measure motor ability in this study, leaving open the possibility that these differences explain some amount of variance, which may or may not vary by sex; future research will address this question.
Do sex-linked behavior differences necessitate sex-specific diagnostics and treatments for ASD?
Combined with a recent study showing gestural camouflaging in girls with ASD , the linguistic camouflage effect identified in this study suggests that subtle sex differences in behavioral domains relevant to ASD (e.g., language) could contribute to girls being missed—or misdiagnosed—due in part to a male-focused conceptualization of ASD and male-normed diagnostic tools. Computational approaches might be particularly useful as an objective way to “see through” various types of social camouflage in the context of screening and diagnosis, and could add to a battery of sex-normed diagnostic and characterization tools for ASD (at least one, the SRS-2, is already normed by sex [13, 31, 33]). In addition to sex-specific identification and characterization tools, behavior-dependent intervention outcome measures may also need to differ by sex. The goal of naturally and automatically integrating information about putative sex differences into the hunt for behavioral biomarkers could lead to creative new approaches that are more sensitive to the way ASD manifests in boys and girls. Notably, our results are consistent with recent large-scale research showing that sex differences in the cognitive and motor profiles of infant siblings of children with ASD are not unique to ASD risk, but rather reflect broader sex differences in the general population . In their conclusion, Messinger and colleagues highlighted the need to compare girls with ASD to typically developing girls (and boys with ASD to typically developing boys). We echo this suggestion.
This study differs from prior research in significant ways. Whereas some studies quantify a variety of disfluencies , the current research focused only on UM and UH. Unlike Gorman et al. , we counted multiple contiguous instances of UM and UH (i.e., if UM was repeated more than once during a filled pause, we counted it more than once. This allowed us to capture stuttering-like filled pauses). Our sample included the largest number of girls with ASD to date, reflecting a 4:1 ratio of boys-to-girls (close to the generally accepted average ratio in ASD). However, we did not include equal numbers of girls and boys with ASD, which led to power issues when examining variables within each sex separately, and we included only a small sample of typically developing control participants. As with some prior studies, we only focused on the interview section of the ADOS-2, and only used Module 3 recordings—future research should expand to new age ranges, ability levels, and ADOS-2 sections and modules.
An inherent limitation of analyzing language produced during semi-structured evaluations is that clinicians are free to ask questions in any order, or to skip questions completely if appropriate from a clinical perspective. Although all clinicians in the current study were research-reliable PhD-level psychologists administering the ADOS in a single center, it is likely that not all children were given probes in the same order. In addition, all clinicians in this study were female, which means that language samples from male participants may have included more UM than if they had been interviewed by a male clinician . Future research will systematically control for the sex of the interlocutor, to assess effects of sex match vs. mismatch on language variables in individuals with ASD. Finally, it is possible that clinicians differed in how often they asked questions vs. used open-ended comments to start conversations. These variations were limited to the extent possible, but are inherent in any research that relies on ADOS evaluations.
We did not examine the influence of co-occurring disorders or dimensions, such as anxiety, attention deficit/hyperactivity disorder, and executive dysfunction in ASD, on filled pause use. These nuances are ripe avenues for future research. Consistent with findings from other studies of children with ASD, our analyses revealed no relationships between age, IQ, and filled pause type. The lack of an age effect may be due to the age range of our sample (which does not extend into adulthood). Future research with older men and women with ASD is needed to elucidate how our findings change across the lifespan.
The present research represents a number of advances over prior studies. Our results are based on longer language samples than prior research (e.g., 20 min vs. 60 s in ). We analyzed UM/UH data from the largest group of individuals with ASD reported to date (65 in this study vs. 50 in  and 24 in ). Importantly, this is the very first study to examine sex differences in speech disfluencies in children with ASD, made possible by the inclusion of a relatively large number of girls with ASD (25% of our sample vs. 10% and 12.5% in prior work). This is also the first time UM and UH have been explored in typically developing boys and girls, who were found to produce even higher UM ratios than adult men and women [58–60]. Finally, the present research shows that the conclusions of past studies of UM and UH in children with ASD were not entirely wrong. Rather, they were correct for boys with ASD, and incorrect for girls.
Prior research shows that children and adolescents with ASD produce more fillers when cognitive demands are high , which for individuals with social challenges, may occur more often when discussing social topics. Although others have shown that UM is used relatively less frequently by children with ASD across a variety of tasks , future research should specifically compare children's responses to highly social questions and less-social questions (e.g., about objects or interests). The discrepancy in UM and UH production between different question types that vary by social load may be even more informative than relative usage collapsed across an entire interaction. Due to constraints related to the semi-structured conversation format employed in the ADOS-2, we plan to conduct this subsequent experiment in a more controlled format.
Interestingly, Irvine and colleagues suggested that reduced UM use in ASD might contribute to the perception of speech as pedantic or stilted . Based on our current findings, we propose that elevated UH during conversation might in fact drive the impression of pedantry, more so than reduced UM. In fact, high rates of UH (without corresponding increases in UM) might be so unusual in young people  that they contribute to some individuals with ASD sounding older than they appear (thus, the “little professor”). Future studies with older and larger samples of individuals with ASD, as well as human raters, are needed to test this hypothesis.
We found that filled pauses vary by sex in ASD, but the everyday consequences of these variations are unknown. Do typical-gendered language patterns in girls with ASD contribute to older age at diagnosis, missed diagnoses, or misdiagnosis? Given that early detection and early intervention are critical for maximizing functional outcomes in ASD , issues of under-referral and inadequate measurement for girls with ASD are not trivial. Outside of diagnostic issues, it is possible that suppressed UH and typical UM ratios in girls correlate with the perception of social normalcy by peers, thus helping to establish or maintain friendships. In addition, perhaps girls with ASD suppress UH more when talking to peers than when talking with adults, mimicking typical behavior and increasing their chances of social affiliation in classrooms or on playgrounds. Future studies in naturalistic settings, with a variety of interlocutors, are needed to answer these questions.
ASD experts make diagnostic decisions based on observable behavior, and subtle differences in how a child moves or talks will influence the way they are perceived. Gender socialization or social mimicry may lead to “camouflaged” behavior in girls with ASD, which, combined with widely held gender biases about how girls and boys should behave and true biological sex differences, likely complicate efforts to effectively identify and treat boys and girls with ASD. Recent attempts to reduce bias by directly sampling behavior and using objective, computational measurement tools hold promise over existing parent report and clinician rating scales , but even these new tools will likely be influenced by variables such as age, sex, gender socialization, socio-economic status, physical and mental health, and home and cultural environment. The findings reported here, identifying “linguistic camouflage” in girls with ASD, highlight the importance of continued commitment to understanding the complex web of biological and environmental factors that influence ASD emergence and presentation.
Autism Diagnostic Observation Schedule-2nd Edition
Autism spectrum disorder
Differential Abilities Scales-II–2nd Edition
Social Communication Questionnaire
Typically developing control
Vineland Adaptive Behavior Scales
Acton EK. On gender differences in the distribution of um and uh. University of Pennsylvania Working Papers in Linguistics. 2011;17(2):2.
American Psychiatric Association. Diagnostic and statistical manual of mental disorders (4th ed., Text Revision). Washington, DC: American Psychiatric Association; 2000.
American Psychiatric Association. Diagnostic and statistical manual of mental disorders, 5th edition: DSM-5. 5th ed. Washington, D.C: American Psychiatric Publishing; 2013.
Anderson JD. Phonological neighborhood and word frequency effects in the stuttered disfluencies of children who stutter. J Speech Lang Hear Res. 2007;50(1):229–47. https://doi.org/10.1044/1092-4388 (2007/018).
Anderson JD, Pellowski MW & Conture EG. Childhood stuttering and dissociations across linguistic domains. Journal of Fluency Disorders. 2005;30(3):219–253. https://doi.org/10.1016/j.jfludis.2005.05.006.
Anderson S, Ertac S, Gneezy U, List JA, Maximiano S. Gender, competitiveness, and socialization at a young age: evidence from a matrilineal and a patriarchal society. Rev Econ Stat. 2013;95(4):1438–43. https://doi.org/10.1162/REST_a_00309
Attwood T. The complete guide to Asperger’s syndrome. London, England: Jessican Kingsley Publishers; 2006.
Boyle MP. Identifying correlates of self-stigma in adults who stutter: further establishing the construct validity of the Self-Stigma of Stuttering Scale (4S). J Fluen Disord. 2015;43:17–27. https://doi.org/10.1016/j.jfludis.2014.12.002
Cheslack-Postava K, Jordan-Young RM. Autism spectrum disorders: toward a gendered embodiment model. Soc Sci Med. 2012;74(11):1667–74. https://doi.org/10.1016/j.socscimed.2011.06.013
Christensen DL. Prevalence and characteristics of autism spectrum disorder among children aged 8 years—Autism and Developmental Disabilities Monitoring Network, 11 Sites, United States, 2012. MMWR. Surveillance Summaries, 65. 2016. Retrieved from http://www.cdc.gov/mmwr/volumes/65/ss/ss6503a1.htm.
Clark HH, Fox Tree JE. Using uh and um in spontaneous speaking. Cognition. 2002;84(1):73–111.
Clark CE, Conture EG, Walden TA & Lambert WE. Speech-Language Dissociations, Distractibility, and Childhood Stuttering. American Journal of Speech-Language Pathology. 2015:24(3)480. https://doi.org/10.1044/2015_AJSLP-14-0198.
Constantino JN, Charman T. Gender bias, female resilience, and the sex ratio in autism. J Am Acad Child Adolesc Psychiatry. 2012;51(8):756–8.
Cook J. From movement kinematics to social cognition: the case of autism. Philos Trans R Soc B. 2016;371(1693):20150372. https://doi.org/10.1098/rstb.2015.0372
Daniels AM, & Mandell DS. Explaining differences in age at autism spectrum disorder diagnosis: a critical review. Autism. 2013;1362361313480277.
De Marchena A, Eigsti I-M. The art of common ground: emergence of a complex pragmatic language skill in adolescents with autism spectrum disorders. J Child Lang. 2016;43(01):43–80. https://doi.org/10.1017/S0305000915000070
Eagly AH, Wood W. The origins of sex differences in human behavior: evolved dispositions versus social roles. Am Psychol. 1999;54(6):408–23. https://doi.org/10.1037/0003-066X.54.6.408
Elliott CD. Differential Ability Scales®-II-DAS-II. San Antonio: Harcourt Assessment; 2007. Retrieved from http://www.pearsonclinical.com/education/products/100000468/differential-ability-scales-ii-das-ii.html
Elsabbagh M, Divan G, Koh Y-J, Kim YS, Kauchali S, Marcín C, et al. Global prevalence of autism and other pervasive developmental disorders: global epidemiology of autism. Autism Res. 2012;5(3):160–79. https://doi.org/10.1002/aur.239
Field A. Discovering statistics using IBM SPSS statistics. 4th ed. London, England: SAGE Publications Inc.; 2005. Retrieved from https://us.sagepub.com/en-us/nam/discovering-statistics-using-ibm-spss-statistics/book238032%20.
Fox Tree JE. Listeners’ uses of um and uh in speech comprehension. Mem Cogn. 2001;29(2):320–6. https://doi.org/10.3758/BF03194926
Gabel RM. Effects of stuttering severity and therapy involvement on attitudes towards people who stutter. J Fluen Disord. 2006;31(3):216–27. https://doi.org/10.1016/j.jfludis.2006.05.003
Goldman S. Opinion: sex, gender and the diagnosis of autism—a biosocial view of the male preponderance. Res Autism Spectrum Disord. 2013;7(6):675–9. https://doi.org/10.1016/j.rasd.2013.02.006
Gorman K, Olson L, Hill AP, Lunsford R, Heeman PA, & van Santen JPH. “Uh” and “um” in children with autism spectrum disorders or language impairment. Autism Res. 2016, n/a-n/a. https://doi.org/10.1002/aur.1578.
Gotham K, Pickles A, Lord C. Standardizing ADOS scores for a measure of severity in autism spectrum disorders. J Autism Dev Disord. 2009;39(5):693–705. https://doi.org/10.1007/s10803-008-0674-3
Heeman PA, Lunsford R, Selfridge E, Black L, Van Santen J. Autism and interactional aspects of dialogue. In Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue. Association for Computational Linguistics. 2010. pp. 249–52. Retrieved from http://dl.acm.org/citation.cfm?id=1944551.
Hus V, Gotham K, & Lord C. Standardizing ADOS domain scores: separating severity of social affect and restricted and repetitive behaviors. J Autism Dev Disord. 2012. https://doi.org/10.1007/s10803-012-1719-1.
Irvine CA, Eigsti I-M, Fein DA. Uh, um, and autism: filler disfluencies as pragmatic markers in adolescents with optimal outcomes from autism spectrum disorder. J Autism Dev Disord. 2016;46(3):1061–70. https://doi.org/10.1007/s10803-015-2651-y
Kim SH, Lord C. The behavioral manifestations of autism spectrum disorders. In The neuroscience of autism spectrum disorders. Elsevier. 2013. pp. 25–37. Retrieved from http://linkinghub.elsevier.com/retrieve/pii/B9780123919243000028.
Kopp S, Gillberg C. Girls with social deficits and learning problems: autism, atypical Asperger syndrome of a variant of these conditions. Eur Child Adolesc Psychiatry. 1992;1(2):89–99. https://doi.org/10.1007/BF02091791
Kopp S, Gillberg C. The Autism Spectrum Screening Questionnaire (ASSQ)-Revised Extended Version (ASSQ-REV): an instrument for better capturing the autism phenotype in girls? A preliminary study involving 191 clinical cases and community controls. Res Dev Disabil. 2011;32(6):2875–88. https://doi.org/10.1016/j.ridd.2011.05.017
Kreiser NL, White SW. ASD in females: are we overstating the gender difference in diagnosis? Clin Child Fam Psychol Rev. 2014;17(1):67–84. https://doi.org/10.1007/s10567-013-0148-9
Lai M-C, Lombardo MV, Auyeung B, Chakrabarti B, Baron-Cohen S. Sex/gender differences and autism: setting the scene for future research. J Am Acad Child Adolesc Psychiatry. 2015;54(1):11–24.
Lai M-C, Lombardo MV, Pasco G, Ruigrok ANV, Wheelwright SJ, Sadek SA, et al. A behavioral comparison of male and female adults with high functioning autism spectrum conditions. PLoS One. 2011;6(6):e20835. https://doi.org/10.1371/journal.pone.0020835
Lake JK, Humphreys KR, Cardy S. Listener vs. speaker-oriented aspects of speech: studying the disfluencies of individuals with autism spectrum disorders. Psychon Bull Rev. 2011;18(1):135–40. https://doi.org/10.3758/s13423-010-0037-x
Lehnhardt F-G, Falter CM, Gawronski A, Pfeiffer K, Tepest R, Franklin J, Vogeley K. Sex-related cognitive profile in autism spectrum disorders diagnosed late in life: implications for the female autistic phenotype. J Autism Dev Disord. 2016;46(1):139–54. https://doi.org/10.1007/s10803-015-2558-7
Liberman M. Young men talk like old women. 2005. Retrieved August 3, 2016, from http://itre.cis.upenn.edu/~myl/languagelog/archives/002629.html.
Liberman M. More on UM and UH. 2014. Retrieved June 17, 2017, from http://languagelog.ldc.upenn.edu/nll/?p=13713.
Loucks TMJ, De Nil LF & Sasisekaran J. Jaw-phonatory coordination in chronic developmental stuttering. Journal of Communication Disorders. 2007;40(3):257–272. https://doi.org/10.1016/j.jcomdis.2006.06.016.
Loomes R, Hull L, Mandy WPL. What is the male-to-female ratio in autism spectrum disorder? A systematic review and meta-analysis. J Am Acad Child Adolesc Psychiatry. 2017;56(6):466–74. https://doi.org/10.1016/j.jaac.2017.03.013
Lord C, Rutter M, DiLavore PS, Risi S, Gotham K, Bishop SL. Autism diagnostic observation schedule, second edition (ADOS-2). Torrance: Western Psychological Services; 2012.
MacFarlane H, Gorman K, Ingham R, Presmanes Hill A, Papadakis K, Kiss G, van Santen J. Quantitative analysis of disfluency in children with autism spectrum disorder or language impairment. PLoS One. 2017;12(3):e0173936. https://doi.org/10.1371/journal.pone.0173936
Max L & Gracco VL. Coordination of Oral and Laryngeal Movements in the Perceptually Fluent Speech of Adults Who Stutter. Journal of Speech, Language, and Hearing Research 2005;48(3):524–542. https://doi.org/10.1044/1092-4388(2005/036).
Messinger DS, Young GS, Webb SJ, Ozonoff S, Bryson SE, Carter A, et al. Early sex differences are not autism-specific: a Baby Siblings Research Consortium (BSRC) study. Mol Autism. 2015;6(1):32. https://doi.org/10.1186/s13229-015-0027-y
Panico J, Healey EC, Brouwer K, Susca M. Listener perceptions of stuttering across two presentation modes: a quantitative and qualitative approach. J Fluen Disord. 2005;30(1):65–85. https://doi.org/10.1016/j.jfludis.2005.01.003
Rutter M, Bailey A, Lord C. SCQ: The Social Communication Questionnaire. Los Angeles: Western Psychological Services; 2003. Retrieved from https://www.wpspublish.com/store/Images/Downloads/Product/SCQ_Manual_Chapter_1.pdf.
Rutter M, Le Couteur A, Lord C. Autism diagnostic interview-revised. Los Angeles: Western Psychological Services; 2003.
Rynkiewicz A, Schuller B, Marchi E, Piana S, Camurri A, Lassalle A, Baron-Cohen S. An investigation of the ‘female camouflage effect’ in autism using a computerized ADOS-2 and a test of sex/gender differences. Molecular Autism. 2016;7(1) https://doi.org/10.1186/s13229-016-0073-0
Sandin S, Lichtenstein P, Kuja-Halkola R, Larsson H, Hultman C, Reichenberg A. The familial risk of autism. JAMA. 2014;311(17):1770–7. https://doi.org/10.1001/jama.2014.4144
Scaler Scott K, Tetnowski JA, Flaitz JR, Yaruss JS. Preliminary study of disfluency in school-aged children with autism: disfluency in autism. Int J Lang Commun Disord. 2014;49(1):75–89. https://doi.org/10.1111/1460-6984.12048
Shriberg LD, Paul R, McSweeny JL, Klin A, Cohen DJ, Volkmar FR. Speech and prosody characteristics of adolescents and adults with high-functioning autism and Asperger syndrome. J Speech Lang Hear Res. 2001;44(5):1097–115.
Sparrow SS, Cicchetti D, & Balla DA. Vineland Adaptive Behavior Scales, Second Edition (VinelandTM-II). Pearson. 2005.
Starkweather WC. Fluency and stuttering (Vol. xiv). Englewood Cliffs: Prentice-Hall, Inc.; 1987.
Suh J, Eigsti I-M, Naigles L, Barton M, Kelley E, Fein D. Narrative Performance of optimal outcome children and adolescents with a history of an autism spectrum disorder (ASD). J Autism Dev Disord. 2014;44(7):1681–94. https://doi.org/10.1007/s10803-014-2042-9
Smith, A., Goffman, L., Sasisekaran, J., & Weber-Fox, C. (2012). Language and motor abilities of preschool children who stutter: Evidence from behavioral and kinematic indices of nonword repetition performance. Journal of Fluency Disorders, 37(4), 344–358. https://doi.org/10.1016/j.jfludis.2012.06.001.
Smits-Bandstra S & De Nil L. Speech skill learning of persons who stutter and fluent speakers under single and dual task conditions. - PubMed - NCBI. Clinical Linguistics & Phonetics. 2009;23(1):38–57. https://doi.org/10.1080/02699200802394914.
Webster, WG. Principles of human brain organization related to lateralization of language and speech motor functions in normal speakers and stutterers. In: Hulstijn W, Peters HFM, van Lieshout PHHM, editors. Speech production: Motor control, brain research and fluency disorders. Amsterdam, The Netherlands: Elsevier; 1997. pp. 119–139.
WHO. Glossary of terms and tools. (n.d.). Retrieved July 5, 2016, from http://www.who.int/entity/gender-equity-rights/knowledge/glossary/en/index.html
Wieling M, Grieve J, Bouma G, Fruehwald J, Coleman J, Liberman M. Variation and change in the use of hesitation markers in Germanic languages. Lang Dyn Change. 2016;6(2):199–234.
Yuan J, Liberman M. Investigating consonant reduction in Mandarin Chinese with improved forced alignment. In INTERSPEECH. 2015. pp. 2675–8. Retrieved from https://www.ldc.upenn.edu/sites/www.ldc.upenn.edu/files/interspeech2015-mandarin-consonant-reduction.pdf
We gratefully acknowledge the children and families that participated in this research. We thank Katie Lowe, Tiffany Ryan, Joy Payton, and Ayana King Pointer from the Center for Autism Research, as well as all clinicians, staff, and students at CAR and LDC, particularly Caitlin Cieri.
This study was supported by an Autism Science Foundation Postdoctoral Fellowship to JPM; NICHD U54HD086984, NIMH RC1MH08879, Pfizer, and RWJ to RTS; NIMH (K23MH086111; PI: B.E. Yerys, R21MH092615; PI: B.E. Yerys), by the CHOP Clinical Translational Core and a New Program Development Award (to B.E. Yerys) through both the Intellectual and Developmental Disabilities Research Center funded by the National Institute of Child and Human Development (P30HD026979; PI: M. Yudkoff), and by a grant from the Philadelphia Foundation.
Availability of data and materials
The datasets generated and/or analyzed during the current study are not publicly available due to embedded protected health information in conversations, but are available from the corresponding author upon reasonable request.
Ethics approval and consent to participate
The Institutional Review Board of the Children’s Hospital of Philadelphia provided approval and oversight for this study. All participants provided consent (parental consent for participants under age 18) and assent when possible.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.