Diagnostic validity of Autism Diagnostic Observation Schedule, second edition (K-ADOS-2) in the Korean population

Kim, So Yoon; Oh, Miae; Bong, Guiyoung; Song, Da-Yea; Yoon, Nan-He; Kim, Joo Hyun; Yoo, Hee Jeong

doi:10.1186/s13229-022-00506-5

Research
Open access
Published: 30 June 2022

Diagnostic validity of Autism Diagnostic Observation Schedule, second edition (K-ADOS-2) in the Korean population

So Yoon Kim¹^na1,
Miae Oh²^na1,
Guiyoung Bong³,
Da-Yea Song³,
Nan-He Yoon⁴,
Joo Hyun Kim³ &
…
Hee Jeong Yoo ORCID: orcid.org/0000-0003-0521-2718^3,5

Molecular Autism volume 13, Article number: 30 (2022) Cite this article

5486 Accesses
6 Citations
Metrics details

Abstract

Background

Although the Korean version of the Autism Diagnostic Observation Schedule-2 (K-ADOS‐2) is widely being used to diagnose autism spectrum disorder (ASD) in South Korea, no previous study has examined the validity and reliability of all modules of K-ADOS-2 across a wide age range, particularly older children, adolescents, and adults.

Method

Data from 2,158 participants were included (mean age = 79.7 months; 73.6% male): 1473 participants with ASD and 685 participants without ASD (Toddler Module, n = 289; Module 1, n = 642; Module 2 n = 574; Module 3 n = 411; Module 4, n = 242). Participants completed a battery of tests, including the K-ADOS or K-ADOS-2 and other existing diagnostic instruments. Sensitivity, specificity, area under the receiver operating characteristic (ROC) curve, positive predictive value (PPV), negative predictive value (NPV), Cohen’s kappa (k), and agreement with existing diagnostic instruments were computed. Cronbach’s α values were also calculated.

Results

All developmental cells of the K-ADOS-2 showed sufficient ranges of sensitivity 85.4–100.0%; specificity, 80.4–96.8%; area under the ROC curve, .90-.97; PPV, 77.8–99.3%; NPV, 80.6–100.0%; and k values, .83–.92. The kappa agreements of developmental cells with existing diagnostic instruments ranged from .20 to .90. Cronbach’s α values ranged from .82 to .91 across all developmental cells.

Limitation

The best-estimate clinical diagnoses made in this study were not independent of the K-ADOS-2 scores. Some modules did not include balanced numbers of participants in terms of gender and diagnostic status.

Conclusion

The K-ADOS-2 is a valid and reliable instrument in diagnosing ASD in South Korea. Future studies exploring the effectiveness of the K-ADOS-2 in capturing restricted, repetitive behaviors and differentiating ASD from other developmental disabilities are needed.

Introduction

Autism spectrum disorder (ASD) is a neurodevelopmental disorder characterized by social communication difficulties and the presence of repetitive and stereotyped behaviors, interests, and activities (RRBs) [1]. Due to the heterogeneity in symptom presentation of ASD, the clinical diagnosis is most valid and reliable when made using comprehensive diagnostic instruments [2, 3] such as Autism Diagnostic Observation Schedule (ADOS; [4]) or ADOS-2 [5] and Autism Diagnostic Interview-Revised (ADI-R; [6]) [7, 8].

The ADOS-2 is a semi-structured, standardized observational instrument designed to assess and diagnose ASD across all ages. Initially developed in 1989 [9], the ADOS has been updated into the ADOS-2 to improve the accuracy and versatility of the assessment. The ADOS-2 revised classification algorithms, amended protocols of administration, included the additional module for toddlers between 12 and 30 months, and created new criteria for comparison scores, which allow the examination of ASD symptom severity across different modules [10]. The ADOS-2 classification of ASD requires an individual’s score to meet or exceed the algorithm threshold for the two domains: social affect and RRB. The schedule consists of five developmentally sequenced modules, each of which has a different combination of activities based on developmental age and expressive language skills. The wide usage of the ADOS-2 may be attributed to its ability to gather information from a set of structured activities, capture autistic behaviors during interactive activities, and account for the wide developmental levels and ages [11]. The ADOS-2 comprises social activities called “presses,” implemented to provide stimulating and standard contexts in which social communication behaviors and interactions are likely to appear [12].

The ADOS-2 has become more internationally accessible, driven by increased ASD awareness as well as the efforts to administer the ADOS-2 in different countries [13]. Currently, the ADOS-2 has been translated into more than 20 languages [14], and the clinical validity of the ADOS-2 has been well-established in various international samples [15]. Previous studies have underscored that ASD diagnostic instruments developed in Western countries can be properly translated and adapted in non-Western countries [13, 16,17,18]. Adapting the diagnostic tools that were originally developed for different cultures requires a re-examination of reliability and validity [7, 19]. Since culture influences the language, play materials, and social norms concerning developmentally appropriate behaviors, it can consequently affect how people in specific cultural contexts evaluate the appropriateness and severity of autistic symptoms [20, 21]. However, the majority of studies investigating the validity of translated versions of the ADOS-2 have been conducted in Western, English-speaking countries, such as the United States (US) [22], Canada [23], and the United Kingdom [24]. Only a handful of studies have examined the validity of the ADOS-2 in non-Western populations (e.g., in Chinese [16], Indian [17], and South Korean [13, 18]).

The Korean versions of the ADOS/ADOS-2 (i.e., K-ADOS/K-ADOS-2) have been used in South Korea for more than a decade [25]. To date, only two studies have partially validated the K-ADOS/K-ADOS-2 in South Korea. Kim et al. [18] conducted a study including 292 school students (aged 7–14 years old) to show that Module 3 of the K-ADOS had sufficient specificity and sensitivity. More recently, after the ADOS-2 was translated into Korean [26], Lee et al. [13] evaluated the validity of the Toddler Module and Modules 1 and 2 of the K-ADOS-2 on 143 South Korean toddlers and preschoolers. They found that the modules had adequate sensitivity, specificity, and internal consistency with respect to age. However, these previous studies were limited by their small sample size and relatively narrow age range of participants; therefore, research on the use and applicability of the K-ADOS-2, particularly on older children, adolescents, and adults, is still limited.

Further, researchers have emphasized the importance of establishing diagnostic utility of diagnostic instruments in differentiating ASD from other disabilities because ASD is often accompanied by and shows behavioral overlap with many neurodevelopmental and behavioral disorders (e.g., intellectual disabilities, anxiety disorders, and attention-deficit/hyperactivity disorders [ADHD]) [27,28,29,30], complicating the diagnostic process. Lee et al. [13] showed that the sensitivity and specificity of the K-ADOS-2 in distinguishing children and toddlers with ASD from those without ASD but have other developmental delays or language delays ranged from 94 to 100% and 82–100%, respectively. Yet, no other studies have examined the clinical validity of the K-ADOS-2 in distinguishing ASD from other developmental disabilities (OD), notably in Modules 3 and 4. It is particularly important to examine the diagnostic accuracy of the K-ADOS-2 in differentiating ASD from OD in adolescent and adult populations because the diagnostic process is considered more complicated due to increased comorbidities [31, 32]. Indeed, Langmann et al. [29] reported that the diagnostic validity of the Module 4 in distinguishing ASD from other clinical samples (e.g., personality disorders, behavioral and emotional disorders, anxiety and/or compulsive disorders) was low for older adults and individuals with high cognitive and verbal ability, suggesting the need for further research.

Therefore, the purpose of this study was to expand on previous findings [13, 18], examine the psychometric properties, and establish the diagnostic validity of the K-ADOS-2 across all modules (i.e., Toddler Module and Modules 1–4) with a larger number of participants. Specifically, we aimed to investigate (1) the diagnostic validity of all modules of the K-ADOS-2 algorithms, (2) its agreement with existing ASD diagnostic instruments, and (3) the reliability of all modules of the K-ADOS-2 to examine whether it can be validly and reliably applied to the South Korean population across all ages. Additionally, we preliminarily explored if the K-ADOS-2 could be used to differentiate ASD from OD.

Methods

Participants

This study is a secondary analysis of pooled data with research samples collected from 2008 to 2017 from several projects aimed at identifying ASD biomarkers, randomized control trials of social skills training, and developing an early ASD screening instrument. All the participants were enrolled via patient referrals from child and adolescent psychiatric, pediatric and child rehabilitation departments, and communities such as local clinics and daycare centers, recruitment posters on online/offline bulletin boards of public institutions, and online parenting communities. Participants from the social skills training programs consisted of participants with ASD; participants recruited for identification of ASD biomarkers and development of the early ASD screening instrument included both participants with ASD and without ASD. The examiners were blinded to the diagnostic characteristics of the participants, and clinical best-estimate diagnoses were determined by experienced clinicians, including two licensed child psychiatrists. One institution was in charge of recruiting participants and conducting all evaluations for all projects.

A total of 2158 participants were included in this study (mean age [standard deviation] = 79.7 [64.0] months; age range = 12–393 months; 1588 males; Toddler Module, n = 289; Module 1,^{Footnote 1}n = 642; Module 2 n = 574; Module 3 n = 411; Module 4, n = 242; 1473 participants with ASD, 685 participants without ASD, and 123 participants with OD). Participants with OD consisted of participants who were diagnosed as not having ASD based on clinical best-estimate diagnosis and obtained scores lower than 80 in either the full-scale intelligence quotients (FSIQ) or Korean Vineland Adaptive Behavior Scales, Second Edition (K-VABS; [33]) and therefore were considered as a subgroup of participants without ASD.

We aimed to categorize the OD group to represent individuals with potential intellectual disabilities or developmental delays. Although we were not able to confirm the clinical diagnostic status of the OD group, we wanted to, at least preliminarily, examine if the ADOS-2 can be used to differentiate individuals with ASD from individuals with at least some developmental problems in terms of adaptive skills and intellectual functioning. Diagnostic criteria of intellectual disability include deficits in intellectual and adaptive functionings observed during the developmental period [1], and, therefore, we used the FSIQs and K-VABS scores to identify individuals who may have an intellectual disability. We included participants with IQ scores lower than 80 to include those who have borderline intellectual functioning (i.e., individuals who function on the border between intellectual disability and normal intellectual functioning; [34]). Because the construct of adaptive behavior captures whether an individual has conceptual, social, practical skills expected of their age, development, and culture [35,36,37], we used the VABS score as a proxy for potential developmental delay.^{Footnote 2}

Diagnostic procedures are presented in the Procedures section. Detailed characteristics of the total participants and participants by module are included in Tables 1 and 2. Information on participant characteristics for each developmental cell of the Toddler Module, Module 1, and Module 2 is available in Additional file 1: Table S1. Detailed characteristics of the OD participants are available in Additional file 1: Table S2.

Table 1 Participant characteristics

Full size table

Table 2 Participant characteristics by module

Full size table

Procedures

Participants and their parents completed a battery of tests during their one-time visit, including the K-ADOS or K-ADOS-2, ADI-R, the Korean version of Childhood Autism Rating Scale (K-CARS), Korean Vineland Social Maturity Scale (K-SMS), and cognitive tests measuring FSIQs. Questionnaires, such as the Social Responsiveness Scale-2 (SRS-2), Social Communication Questionnaire (SCQ), and K-VABS, were mailed and filled out prior to the visit. The K-ADOS or K-ADOS-2 and ADI-R were administered by research-reliable professionals or research assistants who worked alongside them in the same laboratory on a daily basis and were trained prior to the actual administration. The scales were administered only after an adequate level of inter-reliability with the research-reliable professionals (> 80%) was reached. All administrations of the K-ADOS or K-ADOS-2 and ADI-R were videotaped and double-checked by these professionals to confirm the quality and reliability.

Subsequently, two board-certified psychiatrists made the best-estimate clinical diagnostic criteria for ASD and non-ASD based on DSM-5 [1]. The clinical best-estimate diagnosis was made according to the information gathered collectively from all tests administered, including the K-ADOS/K-ADOS-2, ADI-R, SCQ_, SRS-2, K-CARS, SMS, VABS, IQ assessments, and observed clinical impressions. The study was approved by the Institutional Review Board (IRB) of Seoul National University Bundang Hospital (IRB no. B-2110–716-102).

Measures

Autism Diagnostic Observation Schedule and Autism Diagnostic Observation Schedule-2 (ADOS and ADOS-2 [4, 5])

This study used the Korean translated versions of the ADOS/ADOS-2, approved by its publisher Western Psychological Services. Data collected prior to July 2017, when the ADOS-2 was published in Korea, were administered using the original K-ADOS. The results from the K-ADOS were rescored based on the K-ADOS-2 algorithm for this study. The modules range from the Toddler Module, for children aged 30 months and younger, to Module 4, for verbally fluent older adolescents and adults. The diagnostic algorithms for the Toddler Module and Modules 1 and 2 are further subdivided into developmental cells based on age/language. The algorithm for the Toddler Module is divided into two developmental cells: 12–20 months/nonverbal 21–30 months toddlers (12–20/NV21–30) and 21–30 months toddlers with some words (21–30SW). The algorithm of Module 1 is divided into two developmental cells based on expressive language level: no words (NW) and some words (SW). The algorithm of Module 2 is divided into two developmental cells based on age groups: < 5 years and ≥ 5 years.

All modules provide two cutoff points in the classification algorithms. For Modules 1 through 4, there is a higher cutoff in the classification algorithms for stringent classification (i.e., autism) and a lower cutoff in the classification algorithms for more inclusive classification (that is, autism spectrum disorder; ASD). For Module 4, we applied the revised algorithm from Hus and Lord [38]. The Toddler Module also has a higher cutoff in the classification algorithms for stringent classification (moderate–severe concern) and a lower cutoff in the classification algorithms for more inclusive classification (mild–moderate concern), which were specified in Esler et al. [39]. Alternatively, Luyster et al. [40] provided the single research cutoff point for the Toddler Module and explained that the single cutoff needs to be applied in the Toddler Module due to the relative lack of diagnostic stability in younger children. In this study, we primarily relied on the results calculated based on the ASD cutoff for Modules 1–4 and the Luyster et al. [40]’s cutoff point for Toddler Module to make the decisions regarding validity. The diagnostic validity of the Toddler Module calculated based on Esler et al. [39]’s cutoff point system is presented in Additional file 1: Table S3.

Autism Diagnostic Interview-Revised (ADI-R [6])

The ADI-R is a semi-structured caregivers’ interview used to diagnose or evaluate the core symptoms of ASD. Each item is scored and converted on a scale of 0, 1, and 2, with higher scores indicating a greater number of and/or clear symptoms of ASD. The ADI-R includes 93 items describing four diagnostic domains: social interaction, communication, RRBs, and abnormality of development evident at or before 36 months. Each domain has a diagnostic criterion, but individuals must exceed all four cutoff scores to be classified as ASD. While the majority of the algorithm score consists of parents’ descriptions of a child’s behaviors between the ages of 4–5 years, some items ask whether the behavior has ever been present during the child's lifetime. For children under 4 years of age, ratings on current behaviors are used. The Korean translation of the ADI-R [25], approved by its publisher Western Psychological Services, was used in this study.

Social Communication Questionnaire [41]

The SCQ is a caregiver-report screening instrument for ASD designed to evaluate an individual’s behavior in three domains: social interaction, language and communication, and RRB. The SCQ includes 40 items to be rated as either “yes” or “no.” It consists of two forms: the Lifetime Form, which focuses on an individual’s developmental history, and the Current Form, which inspects an individual’s behaviors over the past three months. The total score in the Lifetime Form is used to determine if an individual is likely to have ASD, and whether a more extended diagnostic evaluation needs to be undertaken. In this study, we used a cutoff score of 10, for children under 47 months of age, and 12, for children over 48 months, based on a standardization study conducted in Korea [42].

Social Responsiveness Scale-2 (SRS-2 [43])

The SRS-2 is a 65-item parent-report questionnaire that assesses the severity of ASD-related symptoms on a 4-point scale, with higher total scores reflecting more severe ASD symptomatology. It consists of five subscales: social awareness, social cognition, social communication, social motivation, and autistic mannerisms. The SRS-2 has been used extensively in the ASD literature as a diagnostic measure [44] and is reported to have good internal consistency and concurrent, discriminant validity [45]. Chun et al. [46] demonstrated adequate levels of sensitivity and specificity of the Korean translated version of the SRS-2. A cutoff T-score of 65 was applied regardless of gender in the preschool form of the SRS-2, and cutoff T-scores of 70 and 63 were used for female and male participants, respectively, for the school-age and adult forms of the SRS-2 because these values are widely used across clinical settings in South Korea.

Korean version of the Childhood Autism Rating Scale (K-CARS [47])

The CARS [48] is a clinician-rated scale developed to screen for ASD. Consisting of 15 items rating the presence and severity of symptoms associated with ASD, the CARS is scored from 1 (no impairment observed or reported) to 4 (severe impairment). There is no consensus on the cutoff score of the K-CARS; Shin and Kim [49] suggested a cutoff score of 28, while others recommend 24 [50]. Therefore, we utilized both cutoff scores in this study.

Full-Scale Intelligence Quotients (FSIQ)

The following instruments were used to calculate FSIQ in this study: the Wechsler Preschool and Primary Scale of Intelligence (WPPSI) [51] for children aged 2 years and 6 months to 6 years, Wechsler Intelligence Scale for Children (WISC) [52] for children aged 6–16 years, and Wechsler Adult Intelligence Scale (WAIS) [53] for individuals over 16 years of age. These instruments utilize chronological age standardization with a mean of 100 and a standard deviation of 15.

Korean version of the Vineland Adaptive Behavior Scale, second edition (K-VABS [33, 54])

The VABS is a parent or other caregiver’s rating of a person’s adaptive functioning and social self-sufficiency from birth to adulthood. The VABS consists of five domains: communication, daily living skills, socialization, motor skills, and maladaptive behavior. It is scored on a 0–2 rating scale, with a higher score representing skills used more frequently. The five domains together yield a total adaptive behavior composite score. The normative mean of the composite score is 100, with a standard deviation of 15. We used the Korean version of the parent/caregiver rating form of VABS, which was highly correlated with the survey interview form of VABS and showed sufficient validity among Koreans [55].

Korean Vineland Social Maturity Scale (K-SMS [56])

The K-SMS is a clinician-rated instrument that assesses social and adaptive maturity. Originally developed using the Doll’s Vineland Social Maturity Scale [57], the K-SMS includes 89 items grouped by behavioral milestones that are expected at each age. It consists of eight subdomains (communication, general self-help, locomotion, occupation, self-direction, self-help eating, self-help dressing, and socialization skills) and provides a global social age and social quotient.

Nonverbal mental age

Data were collected from multiple studies aiming to fulfill different objectives; the age range of participants recruited for each study and, consequently, the scales used to assess the nonverbal mental age of participants varied across studies. Depending on the type and age range of the studies, we used the Beery-Buktenica Developmental Test of Visual-Motor Integration (VMI) or Leiter International Performance Scale in addition to the nonverbal subscale of WPPSI or WISC. The Beery-Buktenica Developmental Test measures the ability of an individual to integrate their visual perception and motor coordination [58]. The Leiter International Performance Scale assesses nonverbal performance intelligence and cognitive abilities [59]. Many participants were not able to participate in these assessments of nonverbal mental age due to lack of cooperation, and, additionally, some could not participate because they did not meet the minimum age range for participation. For instance, we could not collect the information about the nonverbal mental age of participants in the Toddler Module. However, we present the information on the nonverbal mental age of participants in Module 1, analyzed using the collected data, since Gotham et al. [22] reported that the specificity was low when Module 1 was applied to children with nonverbal mental age lower than 15 months.

We identified the nonverbal mental age of 30 participants in Module 1, calculated based on the WPPSI or WISC scores, and, of these 30 participants, none of the participants in Module 1 had a nonverbal mental age lower than 15 months. We also identified the VMI scores from 74 participants of the participants in Module 1, and the developmental age calculated based on the VMI scores of all 74 participants exceeded 35 months (mean developmental age = 43.4 months, SD = 10.4). Additionally, we identified the Leiter International Performance Scale of 169 participants in Module 1, and five participants with ASD from Module 1 had a nonverbal mental age lower than 15 months. We conducted sensitivity tests of the entire analysis on Module 1, Module 1 SW, and Module 1 NW after eliminating these five participants, and eliminating these participants resulted in very minimal changes in analyses.

Statistical analyses

Initially, we computed a set of independent t tests comparing the age, FSIQ, and scores from K-ADOS-2, ADI-R, K-CARS, SCQ, and SRS-2 of participants with ASD and those without ASD. Calibrated severity scores (CSS; i.e., a severity metric that takes age and language level into account [60] were used to compare the K-ADOS-2 scores.

To address the first aim, the sensitivity, specificity, PPV, NPV, and Cohen’s kappa (k) between ASD and non-ASD were calculated to check for consistency between the best-estimate clinical diagnosis and diagnosis based on ASD cutoff for K-ADOS-2 Modules 1–4 and Luyster et al.’s [40] cutoff point for Toddler Module. This analysis was conducted on all modules combined, each module (including Toddler Module and Modules 1, 2, 3, and 4) individually, and each developmental cell (12–20/NV21–30 and 21–30 SW in Toddler Module, NW and SW in Module 1, and under and over 5 years of age in Module 2). We also computed the area under the receiver operating characteristic (ROC) curve of all items by developmental cell to explore if all items included in the algorithm have sufficient diagnostic accuracy according to the area under the curve (AUC).

To investigate the second aim, we computed Pearson’s r correlation coefficients between the total scores of K-ADOS-2 and those of existing ASD diagnostic instruments (i.e., ADI-R, K-CARS, SCQ, and SRS-2) for all modules combined, each module individually, and each developmental cell. Additionally, k values were calculated between the diagnosis based on the K-ADOS-2 ASD cutoff (and Luyster et al.’s [40] cutoff point for Toddler Module) and the diagnosis based on the existing ASD diagnostic instruments. The k values were interpreted based on McHugh’s [61] criteria (0–0.2, none; 0.21–0.39, minimal; 0.40–0.59, weak; 0.60–0.79, moderate; 0.8–0.9, strong; above 0.9, almost perfect). For the third aim, Cronbach’s α values for the algorithm items and values after an item was removed were computed to examine the internal consistency of each developmental cell.

Finally, we calculated the sensitivity, specificity, PPV, NPV, and k values to examine how accurately the K-ADOS-2 ASD cutoff can distinguish ASD from OD for all modules combined, each module individually, and each developmental cell. We did not compare the diagnostic validity between OD and the remaining participants without ASD (i.e., participants who were not diagnosed with ASD and did not have FSIQ or VABS scores lower than 80) because this sample included a few participants for whom we did not have all FSIQ and VABS scores and therefore would have been categorized as OD if all relevant information was available.

All analyses except for the calculation of Cronbach’s α values were repeated using the Autism cutoff for Modules 1–4 and moderate–severe concern for the Toddler Module. All statistical analyses were performed using Excel and SPSS Statistics (version 23.0; IBM Corp., Armonk, NY, USA).

Results

There were statistically significant inter-group differences between ASD and non-ASD in all algorithm scores of the K-ADOS-2, ADI-R, K-CARS, SCQ, and SRS-2 (p < 0.01) in the composite K-ADOS-2 and across all developmental cells, except for in the ADI-R Communication domain in the 12–20/NV21–30 developmental cell group (Tables 1, 2 and Additional file 1: Table S1).

All developmental cells of the K-ADOS-2 showed sufficient ranges of sensitivity 85.4–100.0%; specificity, 80.4–96.8%; area under the ROC curve, 0.90–0.97; PPV, 77.8–99.3%; NPV, 80.6–100.0%; and k values, 0.83–0.92.^{Footnote 3} Detailed results of the sensitivity, sensitivity, AUC, PPV, NPV, and k values between ASD versus non-ASD by module and developmental cell are presented in Table 3.

Table 3 Sensitivity, specificity, AUC, PPV, NPV, and Cohen’s kappa between ASD and non-ASD based on ASD cutoff criteria

Full size table

The AUC values of the majority of algorithm items in each developmental cell exceeded 0.70 (range = 0.70–0.93). The list of algorithm items with AUC values lower than 0.70 is presented in Table 4 by developmental cell. Across all developmental cells, the AUCs of Hand Finger and Other Complex Mechanism item were consistently lower than 0.70, and all items with AUC lower than 0.7 were from the RRB algorithm.

Table 4 Algorithm items with AUC values lower than .7

Full size table

The total scores of the K-ADOS-2 were significantly and positively correlated with those of ADI-R, SCQ, SRS-2, and K-CARS scores across all modules and developmental cells. Pearson’ r correlations ranged between 0.60 and 0.90 for Toddler Module (12–20/NV21–30), 0.57–0.90 for Toddler Module (21‐30SW), 0.45–0.80 for Module 1 (NW), 0.54–0.78 for Module 1 (SW), 0.66–0.88 for Module 2 (< 5 yo), 0.55–0.68 for Module 2 (≥ 5 yo), 0.52–0.82 for Module 3, and 0.47–0.84 for Module 4. The kappa agreements between all K-ADOS-2 modules and existing diagnostic instruments ranged between 0.48–0.85 for Toddler Module (12–20/NV21–30), 0.47–0.90 for Toddler Module (21‐30SW), 0.35–0.82 for Module 1 (NW), 0.38–0.64 for Module 1 (SW), 0.54–0.72 for Module 2 (< 5 yo), 0.20–0.42 for Module 2 (≥ 5 yo), 0.33–0.73 for Module 3, and 0.25–0.57 for Module 4, suggesting weak-to-strong agreement. Detailed results of Pearson's correlations and kappa values with existing diagnostic instruments by module and developmental cell are presented in Table 5.

Table 5 Agreement with existing instrument based on ASD cutoff criteria

Full size table

All modules and developmental cells had high internal consistencies, with α values ranging from 0.82 to 0.91. Removing an item inflicted no-to-minimal changes (that is, a change of less than 0.03 change in α values). The complete results of the reliability analysis are presented in Table 6.

Table 6 Results of reliability analysis

Full size table

There were no significant differences in participants’ age between the OD and ASD groups except for in Module 2 (OD, M = 50.2 months; ASD, M = 66.3 months; p = 0.01). The IQ scores of the OD and ASD groups only differed significantly in Module 4 (OD, M = 73.0; ASD, M = 97.5; p = 0.0001). There were statistically significant group differences in OD vs. ASD in all algorithm scores of the K-ADOS-2 and ADI-R (ps < 0.05) across all developmental cells (Additional file 1: Table S2). When using the ASD cutoff to distinguish OD from ASD, all modules and developmental cells of the K-ADOS-2 had sufficient sensitivity, specificity, AUC, PPV, and NPV except for NPV in Toddler Module and Module 1. Sensitivity across the developmental cells ranged from 85.4 to 100.0%; specificity, 66.7–94.7%; AUC, 0.83-0.97; PPV, 93.3–99.5%; and NPV, 50.0–100% (Table 7). The k values with the final diagnosis ranged between 0.53 and 0.92, suggesting moderate-to-almost perfect agreement based on McHugh’s [61] criteria.

Table 7 Sensitivity, specificity, AUC, PPV, NPV, and Cohen’s kappa between ASD and OD based on ASD cutoff criteria

Full size table

Additional file 1: Tables S4 and S5 present the sensitivity, specificity, AUC, PPV, NPV, and k values between ASD and non-ASD and those between ASD and OD, respectively, calculated based on the Autism cutoff score for Modules 1–4 and moderate–severe concern range for Toddler Module. Additional file 1: Table S6 presents the agreement with existing diagnostic instruments, calculated based on the Autism cutoff score for Modules 1–4 and moderate–severe concern range for Toddler Module.

Discussion

This study showed that the K-ADOS-2 has excellent diagnostic validity in distinguishing individuals with ASD from those without ASD with sufficiently high sensitivity, specificity, AUC, PPV, and NPV across a wide age group. Moreover, all modules and developmental cells of the K-ADOS-2 demonstrated sufficient reliability. These findings provide additional evidence that the ADOS-2 can be adapted for various cultural settings [7, 13, 16, 62]. This suggests that although there can be cultural differences in the interpretation of severity and appropriateness of autistic behaviors [20, 21], the behavior patterns that need to be considered when diagnosing ASD may not differ across cultures.

Further, compared to previous adaptation studies conducted in different countries such as the Netherlands [62] and Poland [7], the K-ADOS-2 exhibited higher sensitivity and specificity values. As Lee et al. [13] also highlighted the importance of highly trained, research-reliable clinicians in establishing strong validity and specificity for a measure, we postulate that the positive result from this study may be due to the strict, reliable administration and coding process in which research-reliable professionals double-checked all K-ADOS-2 administration.

The examination of the AUC under each item indicated that all algorithm items in the social affect domain had an acceptable ability to distinguish ASD from non-ASD. Meanwhile, the Hand Finger and Other Complex Mechanism item showed consistently low AUC across all developmental cells, and items with AUC lower than 0.7 were all from the RRB algorithm. Similarly, previous studies have also suggested that social communicational items tend to distinguish the individuals with ASD from those without ASD more accurately than the RRB items [63,64,65]. Given the brevity of the time allotted for the observation during the K-ADOS-2 and the variability of frequency and types of RRBs depending on the observational contexts (i.e., clinic vs. home) [66], it is possible that clinicians are not offered a sufficient opportunity to observe these types of RRBs during the K-ADOS-2. We, therefore, suggest the importance of complementing the results of K-ADOS-2 with other diagnostic instruments such as the ADI-R that rely on a more long-term observation by parents or teachers, particularly when assessing the RRBs.

The K-ADOS-2 scores were correlated with the ADI-R, SRS-2, SCQ, and K-CARS scores across all developmental cells and modules, suggesting sufficient concurrent validity. Interestingly, the Pearson’s r coefficients between K-ADOS-2 and ADI-R and K-CARS tended to be greater than those between K-ADOS-2 and SCQ and SRS-2. This pattern could be explained by the inherent shortcomings of parent-report questionnaires (i.e., the SRS-2 and SCQ). Caregivers may have responded to the questions based on their interpretations without an accurate understanding of the concepts captured in each question [67]. Caregivers’ beliefs, characteristics, acceptance, and awareness of ASD may have influenced how they interpreted their child’s behaviors [68, 69].

It is noteworthy that kappa agreements between diagnoses made by the K-ADOS-2 and SCQ, SRS-2, and K-CARS were weak in some modules and developmental cells. In particular, the kappa agreements between K-ADOS-2 and K-CARS were low in ≥ 5 yo developmental cell of Module 2 and Modules 3 and 4. However, considering that the Pearson’s correlations between them were strong and significant, we postulate that this discrepancy may signal the need for more studies adjusting cutoff scores on the K-CARS in the Korean population, especially for the verbally fluent children, adolescents, and adults with ASD. Indeed, due to the lack of consistency in the K-CARS cutoff score used in Korea, we applied two cutoff scores (i.e., [21, 41]) [49, 50] to calculate agreement with K-ADOS-2 scores.

Similar to the findings from the previous K-ADOS-2 adaptation study of the Toddler Module and Modules 1 and 2 [13], applying an autism (i.e., higher) cutoff lowered the sensitivity and specificity compared to using an ASD (i.e., lower) cutoff. However, previous validation studies of the ADOS-2 conducted in Western countries such as the US [70] and Sweden [71] have reported more balanced specificities and sensitivities when applying an autism cutoff, suggesting that sample variability may impact the diagnostic validity of the ADOS-2 [13].

In our preliminary examination of the K-ADOS-2’s validity in differentiating ASD from OD, we found promising results that sensitivity, specificity, AUC, PPV, and Cohen’s kappa were satisfactory for all developmental cells. However, NPV values of Toddler Module and Module 1, particularly the 12–20/NV21–30 algorithm of Toddler Module and the NW algorithm of Module 1, were relatively low. This suggests that children with developmental difficulties, especially those who do not use words to communicate, should be examined with additional diagnostic instruments even if the K-ADOS-2 identifies them as non-ASD. Notably, however, it is unclear if some of the participants categorized as having OD in this study had a formal developmental disability diagnosis. Different patterns could have emerged if we had included individuals with a confirmed diagnosis of non-ASD developmental disabilities (e.g., intellectual disabilities) as a separate clinical control group, and future studies should investigate this possibility.

Limitations

This study has several limitations, which recommend promising avenues for future studies. First, while the best-estimate clinical diagnosis was based on the combination of direct observation, caregiver reports, and other psychological assessments, the final diagnosis was not independent of the K-ADOS-2 scores. To establish its validity more accurately, we suggest that separate institutions independently implement the standard diagnostic procedures (which may or may not also include K-ADOS-2). Second, the ratios of ASD-to-non-ASD and male-to-female participants were unbalanced for some modules and developmental cells in our sample. For instance, in Module 4, 93.2% of the 192 participants with ASD were male, while 38.0% of the 50 participants without ASD were male. We recommend that future iterations of the study recruit a balanced number of participants in terms of diagnostic status and gender.

As more studies are reporting sex differences in symptom presentation (e.g., fewer RRBs in female individuals), which may be contributing to sex biases in diagnostic tools and practices [72], future studies should examine if there are sex differences in the validity of and symptom presentations captured by the K-ADOS-2. Third, we calculated the nonverbal mental age of some participants using VMI and Leiter International Performance Scale. However, we did not collect the nonverbal mental age of all participants because this study is a secondary analysis of pooled data and many participants were not able to measure properly due to a lack of cooperation and functional level. We retrieved available data to see the patterns of nonverbal mental ages of participants in Toddler Module and Module 1, but few participants with nonverbal mental ages lower than 15 and 12 could have been included in the analysis of Module 1 and Toddler Module, respectively. Fourth, the discriminant validity of the instrument (ASD vs. OD) should be interpreted with caution because only a small number of participants with OD were included in each developmental cell and module, and the proportion of participants without ASD may have been over-represented in this study, particularly in Modules 1–4.

Fifth, our information about the OD group was limited. We did not conduct additional or follow-up assessments to verify whether participants categorized as having OD actually have a clinical diagnosis of developmental disabilities or have comorbid disorders such as ADHD, anxiety, or obsessive–compulsive disorder (OCD). Due to missing FSIQ and VABS data, some participants without ASD who may have been categorized as the OD were not categorized as such. Sixth, while parents who responded to the VABS may not have accurately answered the questions, perhaps due to misinterpretation of the items, we did not verify the accuracy of the VABS, which was used to categorize OD, by triangulating the results with other instruments measuring adaptive skills (e.g., survey interview form of the VABS). Future studies should utilize a larger and more balanced sample including participants with a confirmed diagnosis of developmental delays or intellectual disabilities or with frequently occurring comorbid disorders (e.g., ADHD or OCD) to confirm the validity of the K-ADOS-2 in differentiating ASD from OD.

Conclusions

This study demonstrates that K-ADOS-2 is a valid and reliable instrument for diagnosing ASD based on its sensitivity, specificity, AUC, PPV, NPV, k value, Cronbach’s alpha, and moderate agreement with existing ASD diagnostic instruments. To our knowledge, this study is the first to examine the validity and reliability of all modules and developmental cells of the K-ADOS-2. We recommend that future studies should compare K-ADOS-2 scores with best-estimate clinical diagnoses made using independent administration of standard diagnostic procedures, as well as include balanced numbers of participants in terms of gender and diagnostic status. Further, we suggest the need for studies recruiting larger samples and participants with formal diagnoses of developmental disabilities.

Availability of data and materials

The datasets generated and/or analyzed during the current study are not publicly available because we do not have the permission from the IRB to share or make the unidentified participant information available online and did not receive the consent from the participants but are available from the corresponding author on reasonable request.

Notes

We included the data from participants who were younger than 30 months when they received Module 1 (n = 10) if these data were collected prior to the publication of the Toddler Module in Korean.
We conducted a set of independent t-tests comparing the available subscores of the Child Behavior Checklist (CBCL) scores of participants with OD and participants without ASD and not categorized as OD (i.e., typically developing; TD) to provide information on potential comorbid conditions of the OD group. These analyses included 71 participants with OD (57% of all participants with OD), and 229 TD participants (40.0% of TD participants). Participants with OD scored significantly higher on all Syndrome Subscales except for in Social Problem Subscales when compared to the TD participants (all ps < .05).
We conducted a set of sensitivity tests excluding the participants categorized as OD, and the changes in values of sensitivity, specificity, AUC, PPV, NPV, k values, and α were minimal (i.e., all changes of sensitivity, specificity, PPV, and NPV were in the tenth digits, and all changes of AUC, k, and α were in hundredths).

References

American Psychiatric Association. Diagnostic and statistical manual of mental disorders (DSM-5). Washington, DC: Author; 2013.
Book Google Scholar
Guthrie W, Swineford LB, Nottke C, Wetherby AM. Early diagnosis of autism spectrum disorder: stability and change in clinical diagnosis and symptom presentation. J Child Psychol Psychiatry. 2013;54(5):582–90.
Article PubMed PubMed Central Google Scholar
Kim SH, Lord C. Combining information from multiple sources for the diagnosis of autism spectrum disorders for toddlers and young preschoolers from 12 to 47 months of age. J Child Psychol Psychiatry. 2012;53(2):143–51.
Article PubMed Google Scholar
Lord C, Rutter M, DiLavore PC, Risi S. Autism diagnostic observation schedule (ADOS). Los Angeles, CA: Western Psychological Services; 1999.
Google Scholar
Lord C, Rutter M, DiLavore PC, Risi S, Gotham K, Bishop SL. Autism diagnostic observation schedule, (ADOS-2) modules 1–4. Los Angeles, CA: Western Psychological Services; 2012.
Google Scholar
Rutter M, Le Couteur A, Lord C. Autism diagnostic interview-revised (ADI-R). Los Angeles, CA: Western Psychological Services; 2003.
Google Scholar
Chojnicka I, Pisula E. Adaptation and validation of the ADOS-2, polish version. Front Psychol. 2017;8:1916.
Article PubMed PubMed Central Google Scholar
Falkmer T, Anderson K, Falkmer M, Horlin C. Diagnostic procedures in autism spectrum disorders: a systematic literature review. Eur Child Adolesc Psychiatry. 2013;22(6):329–40.
Article PubMed Google Scholar
Lord C, Rutter M, Goode S, Heemsbergen J, Jordan H, Mawhood L, et al. Autism diagnostic observation schedule: a standardized observation of communicative and social behavior. J Autism Dev Disord. 1989;19(2):185–212.
Article CAS PubMed Google Scholar
Dorlack TP, Myers OB, Kodituwakku PW. A comparative analysis of the ADOS-G and ADOS-2 algorithms: preliminary findings. J Autism Dev Disord. 2018;48(6):2078–89.
Article PubMed Google Scholar
Akshoomoff N, Corsello C, Schmidt H. The role of the autism diagnostic observation schedule in the assessment of autism spectrum disorders in school and community settings. Calif School Psychol. 2006;11:7–19.
Article PubMed PubMed Central Google Scholar
Lord CLR, Gotham K, Guthrie W. Autism diagnostic observation schedule, second edition (ADOS-2) manual. Torrance, CA: Western Psychological Services; 2012.
Google Scholar
Lee KS, Chung SJ, Thomas HR, Park J, Kim SH. Exploring diagnostic validity of the autism diagnostic observation schedule-2 in South Korean toddlers and preschoolers. Autism Res. 2019;12(9):1356–66.
Article PubMed Google Scholar
Western Psychological Services. Published translations 2018. Available from: http://www.wpspublish.com/app/OtherServices/PublishedTranslations.aspx
Hong JS, Singh V, Kalb L, Ashkar A, Landa R. Replication study of ADOS-2 toddler module cut-off scores for autism spectrum disorder classification. Autism Res. 2021;14(6):1284–95.
Article PubMed Google Scholar
Sun X, Allison C, Auyeung B, Zhang Z, Matthews FE, Baron-Cohen S, et al. Validation of existing diagnosis of autism in mainland China using standardised diagnostic instruments. Autism. 2015;19(8):1010–7.
Article PubMed PubMed Central Google Scholar
Rudra A, Banerjee S, Singhal N, Barua M, Mukerji S, Chakrabarti B. Translation and usability of autism screening and diagnostic tools for autism spectrum conditions in India. Autism Res. 2014;7(5):598–607.
Article PubMed Google Scholar
Kim SH, Kim YS, Koh Y-J, Lim E-C, Kim S-J, Leventhal BL. Often asked but rarely answered: Can Asians meet DSM-5/ICD-10 autism spectrum disorder criteria? J Child Adolesc Psychopharmacol. 2016;26(9):835–42.
Article PubMed PubMed Central CAS Google Scholar
Hambleton RK, Merenda PF, Spielberger CD. Adapting educational and psychological tests for cross-cultural assessment. Mahwah, NJ: Lawrence Erlbaum Associates, Inc; 2009.
Google Scholar
Matson JL, Matheis M, Burns CO, Esposito G, Venuti P, Pisula E, et al. Examining cross-cultural differences in autism spectrum disorder: a multinational comparison from Greece, Italy, Japan, Poland, and the United States. Eur Psychiatry. 2017;42:70–6.
Article CAS PubMed Google Scholar
Pacífico MC, de Paula CS, Namur VS, Lowenthal R, Bosa CA, Teixeira M. Preliminary evidence of the validity process of the Autism Diagnostic Observation Schedule (ADOS): translation, cross-cultural adaptation and semantic equivalence of the Brazilian Portuguese version. Trends Psychiatry Psychother. 2019;41(3):218–26.
Article PubMed Google Scholar
Gotham K, Risi S, Pickles A, Lord C. The autism diagnostic observation schedule: revised algorithms for improved diagnostic validity. J Autism Dev Disord. 2007;37(4):613–27.
Article PubMed Google Scholar
Risi S, Lord C, Gotham K, Corsello C, Chrysler C, Szatmari P, et al. Combining information from multiple sources in the diagnosis of autism spectrum disorders. J Am Acad Child Adolesc Psychiatry. 2006;45(9):1094–103.
Article PubMed Google Scholar
Le Couteur A, Haden G, Hammal D, McConachie H. Diagnosing autism spectrum disorders in pre-school children using two standardised assessment instruments: the ADI-R and the ADOS. J Autism Dev Disord. 2008;38(2):362–72.
Article PubMed Google Scholar
Yoo HJ, Kwak Y. Korean version of autism diagnostic observation schedule (ADOS). Seoul, Korea: Hakjisa; 2007.
Google Scholar
Yoo HJ, Bong GY, Kwak YS, Lee MS, Cho SH, Kim BN. Korean autism diagnostic observation schedule-2 (K-ADOS-2). Seoul, Korea: Hakjisa; 2018.
Google Scholar
Collin L, Bindra J, Raju M, Gillberg C, Minnis H. Facial emotion recognition in child psychiatry: a systematic review. Res Dev Disabil. 2013;34(5):1505–20.
Article PubMed Google Scholar
Gjevik E, Eldevik S, Fjæran-Granum T, Sponheim E. Kiddie-SADS reveals high rates of DSM-IV disorders in children and adolescents with autism spectrum disorders. J Autism Dev Disord. 2011;41(6):761–9.
Article PubMed Google Scholar
Langmann A, Becker J, Poustka L, Becker K, Kamp-Becker I. Diagnostic utility of the autism diagnostic observation schedule in a clinical sample of adolescents and adults. Res Autism Spectrum Disorders. 2017;34:34–43.
Article Google Scholar
Sappok T, Diefenbacher A, Budczies J, Schade C, Grubich C, Bergmann T, et al. Diagnosing autism in a clinical sample of adults with intellectual disabilities: how useful are the ADOS and the ADI-R? Res Dev Disabil. 2013;34(5):1642–55.
Article PubMed Google Scholar
Lombardo MV, Baron-Cohen S. The role of the self in mindblindness in autism. Conscious Cogn. 2011;20(1):130–40.
Article PubMed Google Scholar
Lombardo MV, Barnes JL, Wheelwright SJ, Baron-Cohen S. Self-referential cognition and empathy in autism. PLOS ONE. 2007;2(9):e883.
Article PubMed PubMed Central Google Scholar
Volkmar F. Vineland adaptive behavior scales. 2nd ed. New York, NY: Springer; 2013.
Google Scholar
Wieland J, Zitman FG. It is time to bring borderline intellectual functioning back into the main fold of classification systems. BJPsych Bull. 2016;40(4):204–6.
Article PubMed PubMed Central Google Scholar
Bruininks RH, Thurlow M, Gilman CJ. Adaptive behavior and mental retardation. J Spec Educ. 1987;21(1):69–88.
Article Google Scholar
Schalock RLB-DS, Bradley VJ, Buntinx WHE, Coulter DL, Craig EM, et al. Intellectual disability: definition, classification, and systems of supports. 11th ed. Washington, DC: American Association on Intellectual and Developmental Disabilities; 2010.
Google Scholar
Tassé MJ, Schalock RL, Balboni G, Bersani H Jr, Borthwick-Duffy SA, Spreat S, et al. The construct of adaptive behavior: its conceptualization, measurement, and use in the field of intellectual disability. Am J Intellect Dev Disabil. 2012;117(4):291–303.
Article PubMed Google Scholar
Hus V, Lord C. The autism diagnostic observation schedule, module 4: revised algorithm and standardized severity scores. J Autism Dev Disord. 2014;44(8):1996–2012.
Article PubMed PubMed Central Google Scholar
Esler AN, Bal VH, Guthrie W, Wetherby A, Ellis Weismer S, Lord C. The autism diagnostic observation schedule, toddler module: standardized severity scores. J Autism Dev Disord. 2015;45(9):2704–20.
Article PubMed PubMed Central Google Scholar
Luyster R, Gotham K, Guthrie W, Coffing M, Petrak R, Pierce K, et al. The autism diagnostic observation schedule-toddler module: a new module of a standardized diagnostic measure for autism spectrum disorders. J Autism Dev Disord. 2009;39(9):1305–20.
Article PubMed PubMed Central Google Scholar
Rutter M, Bailey A, Lord C. The social communication questionnaire: Manual. Los Angeles, CA: Western Psychological Services; 2003.
Google Scholar
Kim JH, Sunwoo HJ, Park SB, Noh DH, Jung Y, Cho SC, et al. A validation study of the Korean version of social communication questionnaire. J Korean Acad Child Adolesc Psychiatry. 2015;26:197–208.
Article CAS Google Scholar
Constantino JN, Gruber CP. Social Responsiveness Scale (SRS-2). 2nd ed. Torrance, CA: Western Psychological Services; 2012.
Google Scholar
Bölte S, Poustka F, Constantino JN. Assessing autistic traits: cross-cultural validation of the social responsiveness scale (SRS). Autism Res. 2008;1(6):354–63.
Article PubMed Google Scholar
Constantino JN, Gruber CP. Social Responsiveness Scale (SRS). Los Angeles, CA: Western Psychological Services; 2005.
Google Scholar
Chun J, Bong G, Han JH, Oh M, Yoo HJ. Validation of social responsiveness scale for Korean preschool children with autism. Psychiatry Investig. 2021;18(9):831–40.
Article PubMed PubMed Central Google Scholar
Kim T, Park R. Korean version of childhood autism rating scale. Seoul: Special Education; 1995.
Google Scholar
Schopler E, Reichler RJ, Renner BR. The childhood autism rating scale (CARS). Los Angeles, CA: Western Psychological Services; 1988.
Google Scholar
Shin MS, Kim YH. Standardization study for the Korean version of the Childhood Autism Rating Scale: reliability, validity and cut-off score. Korean J Clin Psychol. 1998;17:1–15.
Google Scholar
Kwon HJ, Yoo HJ, Kim JH, Noh DH, Sunwoo HJ, Jeon YS, et al. Re-adjusting the cut-off score of the Korean version of the Childhood Autism Rating Scale for high-functioning individuals with autism spectrum disorder. Psychiatry Clin Neurosci. 2017;71(10):725–32.
Article CAS PubMed Google Scholar
Wechsler D. Wechsler preschool and primary scale of intelligence—third edition. San Antonio, TX: Psychological Corporation; 2002.
Google Scholar
Wechsler D. Wechsler intelligence scale for children-Fourth Edition. San Antonio, TX: Psychological Corporation; 2003.
Google Scholar
Wechsler D. Wechsler adult intelligence scale-revised (WAIS-R). New York: Psychological Corporation; 1981.
Google Scholar
Hwang S, Kim JH, Hong S. Korean Vineland adaptive behavior scales-II. Daegu: Korea Psychology Co; 2014.
Google Scholar
Na YAHS, Hong SH, Kim JH. A preliminary study for the standardization of the Korean version of Vineland Adaptive Behavior Scales revised: a comparison between survey interview forms and parent/caregiver rating forms. Korean J Clin Psychol. 2015;34(2):375–90.
Article Google Scholar
Kim S, Kim O. Korean Vineland Social Maturity Scale. Seoul, Korea: Chung-Ang Jeokseong Press; 2002.
Google Scholar
Doll EA. The measurement of social competence: a manual for the Vineland Social Maturity Scale: Educational Test Bureau Educational Publishers; 1953.
Berry K. The Beery-Buktenica developmental test of visual-motor integration: VMI with supplemental developmental tests of visual perception and motor coordination: administration, scoring and teaching manual. Parsippany, NJ: Modern Curriculum Press; 1997.
Google Scholar
RG L. Leiter international performance scale, instruction manual. Chicago: Stoelting; 1980.
Gotham K, Pickles A, Lord C. Standardizing ADOS scores for a measure of severity in autism spectrum disorders. J Autism Dev Disord. 2009;39(5):693–705.
Article PubMed Google Scholar
McHugh ML. Interrater reliability: the kappa statistic. Biochem Med (Zagreb). 2012;22(3):276–82.
Article Google Scholar
Oosterling I, Roos S, de Bildt A, Rommelse N, de Jonge M, Visser J, et al. Improved diagnostic validity of the ADOS revised algorithms: a replication study in an independent sample. J Autism Dev Disord. 2010;40(6):689–703.
Article PubMed PubMed Central Google Scholar
Berument SK, Rutter M, Lord C, Pickles A, Bailey A. Autism screening questionnaire: diagnostic validity. Br J Psychiatry. 1999;175:444–51.
Article CAS PubMed Google Scholar
Dow D, Day TN, Kutta TJ, Nottke C, Wetherby AM. Screening for autism spectrum disorder in a naturalistic home setting using the systematic observation of red flags (SORF) at 18–24 months. Autism Res. 2020;13(1):122–33.
Article PubMed Google Scholar
Nilsson Jobs E, Bölte S, Falck-Ytter T. Preschool staff spot social communication difficulties, but not restricted and repetitive behaviors in young autistic children. J Autism Dev Disord. 2019;49(5):1928–36.
Article PubMed PubMed Central Google Scholar
Stronach S, Wetherby AM. Examining restricted and repetitive behaviors in young children with autism spectrum disorder during two observational contexts. Autism. 2014;18(2):127–36.
Article PubMed Google Scholar
Volkmar FR, Lord C, Bailey A, Schultz RT, Klin A. Autism and pervasive developmental disorders. J Child Psychol Psychiatry. 2004;45(1):135–70.
Article PubMed Google Scholar
Kim SY, Kim YA, Song D-Y, Bong G, Kim J-M, Kim JH, et al. State and trait anxiety of adolescents with autism spectrum disorders. Psychiatry Investig. 2021;18(3):257–65.
Article PubMed PubMed Central Google Scholar
Oh M, Song DY, Bong G, Yoon NH, Kim SY, Kim JH, et al. Validating the autism diagnostic interview-revised in the Korean population. Psychiatry Investig. 2021;18(3):196–204.
Article PubMed PubMed Central Google Scholar
Molloy CA, Murray DS, Akers R, Mitchell T, Manning-Courtney P. Use of the Autism Diagnostic Observation Schedule (ADOS) in a clinical setting. Autism. 2011;15(2):143–62.
Article PubMed Google Scholar
Zander E, Sturm H, Bölte S. The added value of the combined use of the autism diagnostic interview-revised and the Autism Diagnostic Observation Schedule: diagnostic validity in a clinical Swedish sample of toddlers and young preschoolers. Autism. 2015;19(2):187–99.
Article PubMed Google Scholar
Kreiser NL, White SW. ASD in females: Are we overstating the gender difference in diagnosis? Clin Child Fam Psychol Rev. 2014;17(1):67–84.
Article PubMed Google Scholar

Download references

Acknowledgements

Not applicable.

Funding

This work was supported by the Original Technology Research Program for Brain Science of the NRF, funded by the Korean government, MSIT (NRF-2017M3C7A1027467) and MIST (NRF-2021M3E5D9021878).

Author information

So Yoon Kim and Miae Oh share first authorship.

Authors and Affiliations

Teacher Education, Duksung Women’s University, Seoul, South Korea
So Yoon Kim
Department of Psychiatry, Kyung Hee University Hospital, Seoul, South Korea
Miae Oh
Department of Psychiatry, Seoul National University Bundang Hospital, Seoul National University College of Medicine, 300 Gumi-ro, Bundang-gu, Seongnam, Gyeonggi, 463-707, South Korea
Guiyoung Bong, Da-Yea Song, Joo Hyun Kim & Hee Jeong Yoo
Division of Social Welfare and Health Administration, Wonkwang University, Iksan, South Korea
Nan-He Yoon
Seoul National University College of Medicine, Seoul, South Korea
Hee Jeong Yoo

Authors

So Yoon Kim
View author publications
You can also search for this author in PubMed Google Scholar
Miae Oh
View author publications
You can also search for this author in PubMed Google Scholar
Guiyoung Bong
View author publications
You can also search for this author in PubMed Google Scholar
Da-Yea Song
View author publications
You can also search for this author in PubMed Google Scholar
Nan-He Yoon
View author publications
You can also search for this author in PubMed Google Scholar
Joo Hyun Kim
View author publications
You can also search for this author in PubMed Google Scholar
Hee Jeong Yoo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

HJY contributed to conceptualization, supervision, and funding acquisition. GB, NHY, JHK, and DS were involved in data curation. SYK and MO contributed to formal analysis and investigation. SYK, MO, and HJY were involved in methodology. GB and JHK contributed to project administration. SYK, MO, and DS were involved in writing. All authors approved the final version of the submitted manuscript.

Corresponding author

Correspondence to Hee Jeong Yoo.

Ethics declarations

Ethics approval and consent to participate

The study procedure including informed consent, recruitment, and participation procedures was approved by the Institutional Review Board (IRB) of Seoul National University Bundang Hospital (IRB no. B-2110-716-102).

Consent for publication

We did not include any individual person’s data in any form.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

. Table S1. Participant Characteristics by Developmental Cell. Table S2. Characteristics of Participants with Other Developmental (OD) Disabilities. Table S3. Sensitivity, Specificity, Positive Predictive Value (PPV), Negative Predictive Value (NPV), AUC, and Cohen’s Kappa value of Toddler Module based on the Mild-Moderate Concern Range of Esler et al. (2015). Table S4. Sensitivity, Specificity, AUC, PPV, NPV, and Cohen’s Kappa Between ASD and non-ASD Based on Autism Cut-off Criteria for Modules 1-4 and Moderate-Severe Concern Range for Toddler Module. Table S5. Sensitivity, Specificity, AUC, PPV, NPV, and Cohen’s Kappa Between ASD and OD Based on Autism Cut-off Criteria for Modules 1-4 and Moderate-Severe Concern Range for Toddler Module. Table S6. Agreement with Existing Instrument Based on Autism Cut-off Criteria for Modules 1-4 and Moderate-Severe Concern Range for Toddler Module.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Kim, S.Y., Oh, M., Bong, G. et al. Diagnostic validity of Autism Diagnostic Observation Schedule, second edition (K-ADOS-2) in the Korean population. Molecular Autism 13, 30 (2022). https://doi.org/10.1186/s13229-022-00506-5

Download citation

Received: 31 January 2022
Accepted: 31 May 2022
Published: 30 June 2022
DOI: https://doi.org/10.1186/s13229-022-00506-5

Diagnostic validity of Autism Diagnostic Observation Schedule, second edition (K-ADOS-2) in the Korean population

Abstract

Background

Method

Results

Limitation

Conclusion

Introduction

Methods

Participants

Procedures

Measures

Autism Diagnostic Observation Schedule and Autism Diagnostic Observation Schedule-2 (ADOS and ADOS-2 [4, 5])

Autism Diagnostic Interview-Revised (ADI-R [6])

Social Communication Questionnaire [41]

Social Responsiveness Scale-2 (SRS-2 [43])

Korean version of the Childhood Autism Rating Scale (K-CARS [47])

Full-Scale Intelligence Quotients (FSIQ)

Korean version of the Vineland Adaptive Behavior Scale, second edition (K-VABS [33, 54])

Korean Vineland Social Maturity Scale (K-SMS [56])

Nonverbal mental age

Statistical analyses

Results

Discussion

Limitations

Conclusions

Availability of data and materials

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Molecular Autism

Contact us