1 Department of Computer Science, Maharishi International University, 1000 North Fourth St., Fairfield, Iowa 52557, USA
2 Department of Special Education and Counseling, California State University, Los Angeles ,State University Drive Los Angeles, CA 90032, USA
3 Department of Information and Communication Technology, Islamic University, Kushtia-7003, Bangladesh
4 Department of Computer Science and Engineering, Daffodil International University, Birulia, Savar, Dhaka-1216
Autism Spectrum Disorder (ASD) is a neurodevelopment disorder that impacts 1 in 36 children in the United States. Therefore, early identification plays a pivotal role. Nonetheless, such populations mentioned may take late and wrong screenings, thus further amplifying disparities. This study aims to create a screening model based on AI with multisource data: EHR, developmental diagnostic checklists, and wearing device behavioral data to improve the early diagnosis of ASD. Based on a cohort of more than 20,000 children, this work showcases how AI can enhance the diagnostics of conditions and lessen the time to diagnosis and bias in children’s care (Alzakari et al.,2024). By integrating both sources, the overall accuracy was 91%, thus making the model better than single-source models, and there are encouraging prospects for the large-scale deployment of the proposed system in developing/least developed countries. Potential solutions and recommendations regarding AI applications in pediatric healthcare, as well as several ethical considerations and upcoming issues concerning scalability, are also examined.
DOI: https://doi.org/10.63471/praihi24003 @ 2024 C5K Research Publishing
ASD is a complex developmental disorder in which people have difficulties in social interaction, communication, and play or behavioral rigidity. ASD is on the rise, especially in the developed world, and according to the CDC, approximately one in every thirty-six children are impacted (Hodges et al., 2020). Timely detection is vital in offering proper solutions that enable children to attain developmental milestones like oral and motor development, interaction skills, and educational performance. However, the diagnosis of ASD is still cumbersome, especially in the underresourced areas where access to services specialty within ASD is a challenge. ASD affects every person differently, from mild to severe, thus the different symptoms. Currently, the diagnostic approaches typically include observing the child’s behavior and using tests scored per the existing checklists, which are the Modified Checklist for Autism in Toddlers (MCHAT) and Autism Diagnostic Observation Schedule (ADOS) (Zheng et al., 2024). These tools, as much as they are helpful in ideal environments, are very tedious to use, are very time-consuming, and need professionals to administer them. The symptoms of the condition are diverse, and affected individuals may not receive accurate diagnoses from doctors due to differences in experience and approach; this is because children in ethnic minority and low-income families are likely to be diagnosed late because of other barriers to access to care.
The current state of ASD diagnosis presents several systemic and procedural challenges:
The use of Artificial Intelligence (AI) provides a novel approach to the problems encountered in conventional ASD screeners. Big data analysis using AI is best suited to compare it with other data sets and make predictions for diagnostic purposes, which can complement traditional diagnostic models (Shahamiri & Thabtah, 2020). Then, the development of AI tools that can better capture a child’s IDP can be improved through the use of a range of data sources, leading to earlier identification of ASD (Wankhede et al., 2024).
Perhaps one of the biggest strengths of using AI-driven ASD screening tools is that they can help fill these existing gaps in access to healthcare services. Children from rural areas, low-income families, and children from minority groups have insufficient access to diagnosis and care (Song et al., 2019). Geographical access, financial barriers, and cultural attitudes about neurodevelopmental disorders are such barriers. AI tools can mitigate these challenges through:
The main aim of this current study is to design and test an AI-based screening tool for early diagnosis of ASD while promoting equity. This involves:
Importantly, this study yields findings that have implications for the increasing application of artificial intelligence (AI) in the diagnosis of Autism Spectrum Disorder (ASD). This is why the study demonstrates how combining information from EHRs, behavioral metrics data, and data from wearable devices makes for better AI models. Such developments are critical in capturing the heterogeneity of ASD, which can present in very different ways in different people. Furthermore, the study is meant to satisfy the demands of the present and future generations, as well as of the underserved population, by filling the gap with cheap and effective screening instruments. Some children from lowresource backgrounds face many challenges to early and accurate ASD (Alzakari et al., 2025). As we explore the power of AI in the diagnostic process in this research, we see that it also offers a path to rectify such disparities so that early intervention is available to all. The work also proposes guidelines for the development of ethical artificial intelligence (AI), including openness, de-bias, and data protection. This shows the need to make sure that the AI models being utilized do not contain any built-in biases that will effectively negate the minority. The study of algorithmic fairness and strong data protection measures opens the way to safe AI applications that will be trusted by clinicians, families, and the community.
The study also adhered to national regulations, such as the Health Insurance Portability and Accountability Act (HIPPA) in the USA and the General Data Protection Regulation (GDPR) in the EU, beyond these privacy measures. However, these frameworks provided the legal and ethical requirements for processing this type of health data sensitively.
The research team wanted to make sure that the training data set was balanced by gender, socioeconomic status, ethnicity, and geographic location because they understood that algorithms themselves could also be unfair to some groups of children. They did this to prevent the model from predicting that some people are less capable than others because the model has yet to see many samples of this type of person. Furthermore, small-scale fairness checks were performed at regular intervals during the study to assess the model’s effectiveness and to monitor bias that could arise in reallife scenarios. The focus of the study is building an ethically designed and inclusive AI diagnostic tool, which this approach demonstrates.
Finally, the study was based on Wearable Devices, which allowed the collection of behavioral and physiological indices in real time. These were sleep, social engagement, and movement, and all of them were monitored through smartwatches and other wearable technologies. Wearable devices enabled the collection of data on the child’s daily activities and physiological changes in the child to indicate that there was an ASD before it could be diagnosed in a clinical setting.
To harmonize the diverse data sources, preprocessing steps included:
A multiple model integrated the results of each singlesource model with wearable device data to achieve the best prediction. This approach adopted stacking as the ensemble learning strategy to aggregate the decisions from multiple models into one decision-making system.
The dataset was divided into a training set (70%), a validation set (15%), and a test set (15%). Crossvalidation made the model valid for other cohorts as well.
Model performance was assessed using the following metrics:
The single-source models (EHR, behavioral, wearable) were compared to the integrated model in order to determine whether data fusion provides any benefit.
This research was conducted with a strict emphasis on ethical considerations, specifically data privacy, equity, and fairness in algorithms. In order to prevent the exposure of patients’ health information, all data collected in the present study were de-identified so as to eliminate the possibility of re-identification of patients. Access to the data was restricted to only personnel who were allowed to work on it, and the records were encrypted at all times. The study also adhered to national regulations such as the Health Insurance Portability and Accountability Act (HIPPA) in the USA and the General Data Protection Regulation (GDPR) in the EU, beyond these privacy measures. However, these frameworks provided the legal and ethical requirements for processing this type of health data sensitively. The research team wanted to make sure that the training data set was balanced by gender, socioeconomic status, ethnicity, and geographic location because they understood that algorithms themselves could also be unfair to some groups of children. They did this to prevent the model from predicting that some people are less capable than others because the model has yet to see many samples of this type of people. Furthermore, small-scale fairness checks were performed at regular intervals during the study to assess the model’s effectiveness and to monitor bias that could arise in reallife scenarios. The focus of the study is building an ethically designed and inclusive AI diagnostic tool, which this approach demonstrates.
The integrated model was found to have better results in identifying ASD than the single-source models. Performance metrics across all models are summarized below:
Table 1. A comparison of the performance of singlesource and integrated models.
The integrated model gave the highest accuracy of 91% and the AUC-ROC of 0.93, which is the best indication of the model’s capacity to detect genuine ASDs with few false positives
Fig. 1. A Comparison of Model Performance Metrics
A Line Chart showing the AUC-ROC values for each model is represented below
Fig. 2. AUC-ROC Across Models
The integrated model used various modes of data which gave a broad view of ASD risk factors.
A heatmap of feature importance highlights the top contributors across data types as shown in figure 3 below;
Fig. 3. Heat Map showing Feature Importance
Key observations include:
The EHR model was relatively accurate (82% accuracy, 0.84 AUC-ROC) using demographic and medical history data. However, it was not able to track behavior in detail due to the absence of real-time tracking.
The behavioral model was somewhat better than the EHR model, obtaining an accuracy of 86 percent and 0.88 AUC-ROC. Informative tools such as M-CHAT proved useful for determining social and communication difficulties but were not linked to physiological or previous data.
The proposed integrated model incorporated EHR, behavioral, and wearable data to create a holistic diagnostic model. Researchers found that its capacity to identify temporal structures (e.g., daily social interactions) and cross-sectional characteristics (e.g., developmental history) boosted predictive efficacy.
The integrated model also highlighted the possibility of deploying the model to underserved communities. By leveraging wearable devices and accessible behavioral assessments, the model addressed key barriers to care, including:
However, the key question is that to implement these solutions, several technological challenges must be solved, including the availability of affordable wearable devices and the ability of local healthcare providers to interpret AI outputs.
with some issues of deployment that need to be considered. One of the main challenges is that of the infrastructure demands that are made by such systems. The model, which could be complex and computationally intensive to implement, must be adapted to be implemented in primary care clinics or CHCs, where resources may be limited. This implies that the technology has to be developed to run on lowerend machines, and at the same time, the system should still be efficient in producing the right results. Also, capacity building is a very important activity. Specialists implementing AI in less developed healthcare facilities may lack sufficient experience to analyze AI’s elaborate results. Hence, meaningful clinician and healthcare worker training on how to use and integrate these AI tools into practice is critical for the model. Two more aspects influence scalability: explainability. Though the integrated model had the best predictive accuracy, its high dimensionality may limit its clinical use since clinicians may not have a good background in AI. In such cases, clinicians may not be willing to put their trust in a model that they cannot understand how it arrived at a decision. XAI techniques should be embedded into the model. Such methods, for example, feature attribution tools, can assist clinicians in understanding how the model reaches such conclusions and gain confidence in the AI’s advice. Explaining the reasoning of the AI would not only improve the understandability of the model but also the adoption of the model among physicians, which in turn would improve the implementation of the model into clinical practice.
However, the study also has some drawbacks that need to be taken into account. The first major limitation is that the dataset is mainly derived from urban healthcare centers. This shift may reduce the effectiveness of the generalization of the model to rural or low-resource areas where the population may be different in health care access, demography, and socio-economic status. As to the generalizability, it is proposed, in the subsequent research, to include data from as many environments as possible to ensure that the model works for everyone. In addition, getting real-time data from wearable devices to integrate into the system was also a concern. On several occasions, these devices generated partial or inconsistent data, and the imputation of missing data was essential to maintain the model’s accuracy. The issue of data quality and consistency in real-time monitoring is another important direction for further development, as these devices are critical to the success of the model, which monitors the child’s behavior at any given time.
ASD continues to be a major public health concern in the United States is still present, and early identification is crucial. This study showed how AI tools could revolutionize the screening of ASD and reduce healthcare inequalities. The proposed AI model that combines the EHR data and behavioral assessment and data from the wearable device showed high diagnostic accuracy (91%) and AUC-ROC (0.93) and outperformed the models that used a single data source.
The integrated model offers a cost-efficient model of service delivery in underserved settings, where there is often a scarcity of specialized diagnostic services. Using the real-time behavior and physiological data from the wearable devices, the model ensures early detection of high-risk cases to ensure that the children receive early interventions that improve their development. However, to enhance implementation success, some challenges need to be considered. Data privacy and algorithm fairness issues can be mitigated only by strong regulation, diverse data, and fairness checks. Scalability also presents many challenges that are especially difficult in LMICs because infrastructure and technology constraints need to be addressed. To stimulate the uptake of the concept across the board, simpler models and capacity-building projects are vital.
Further studies should aim at refining the model for particular populations to decrease the gap in the diagnostic even more, at improving the methods of realtime data feed integration to enable constant risk assessment, and at increasing the model interpretability with the help of Explainable AI (XAI). With regard to these aspects, it is possible to improve the use of AI techniques in the identification of ASD and the provision of equal opportunities for children’s healthcare for all communities. The present work is a valuable discussion of the opportunities for improving public health with the help of AI, especially in cases of neurodevelopmental disorders such as ASD. Therefore, leveraging gaps in diagnostic access and equity, AI is leading the future of healthcare and its efficient execution.
Alzakari, S. A., Allinjawi, A., Aldrees, A., Zamzami, N., Umer, M., Innab, N., & Ashraf, I. (2025). Early detection of autism spectrum disorder using explainable AI and optimized teaching strategies. Journal of Neuroscience Methods, 413, 110315. https://doi.org/https://doi.org/10.1016/j.jneumeth.2024.110315
Hodges, H., Fealko, C., & Soares, N. (2020). Autism spectrum disorder: definition, epidemiology, causes, and clinical evaluation. Transl Pediatr, 9(Suppl 1), S55-s65. https://doi.org/10.21037/tp.2019.09.09
Lenart, A., & Pasternak, J. (2023). Resources, Problems and Challenges of Autism Spectrum Disorder Diagnosis and Support System in Poland. J Autism Dev Disord, 53(4), 1629-1641. https://doi.org/10.1007/s10803-021-05142-1
Okoye, C., Obialo-Ibeawuchi, C. M., Obajeun, O. A., Sarwar, S., Tawfik, C., Waleed, M. S., Wasim, A. U., Mohamoud, I., Afolayan, A. Y., & Mbaezue, R. N. (2023). Early Diagnosis of Autism Spectrum Disorder: A Review and Analysis of the Risks and Benefits. Cureus, 15(8), e43226. https://doi.org/10.7759/cureus.43226
Shahamiri, S. R., & Thabtah, F. (2020). Autism AI: a New Autism Screening System Based on Artificial Intelligence. Cognitive Computation, 12(4), 766-777. https://doi.org/10.1007/s12559-020-09743-3
Song, D. Y., Kim, S. Y., Bong, G., Kim, J. M., & Yoo, H. J. (2019). The Use of Artificial Intelligence in Screening and Diagnosis of Autism Spectrum Disorder: A Literature Review. Soa Chongsonyon Chongsin Uihak, 30(4), 145- 152. https://doi.org/10.5765/jkacap.190027
Wankhede, N., Kale, M., Shukla, M., Nathiya, D., R, R., Kaur, P., Goyanka, B., Rahangdale, S., Taksande, B., Upaganlawar, A., Khalid, M., Chigurupati, S., Umekar, M., Kopalli, S. R., & Koppula, S. (2024). Leveraging AI for the diagnosis and treatment of autism spectrum disorder: Current trends and future prospects. Asian J Psychiatr, 101, 104241. https://doi.org/10.1016/j.ajp.2024.104241
Woolfenden, S., Sarkozy, V., Ridley, G., & Williams, K. (2012). A systematic review of the diagnostic stability of Autism Spectrum Disorder. Research in Autism Spectrum Disorders, 6(1), 345-354. https://doi.org/https://doi.org/10.1016/j.rasd.2011.06.008
Zheng, R. M., Chan, S. P., Law, E. C., Chong, S. C., & Aishworiya, R. (2024). Validity and feasibility of using the Modified Checklist for Autism in Toddlers, Revised with Follow-Up (M-CHAT-R/F) in primary care clinics in Singapore. Autism, 28(7), 1758-1771. https://doi.org/10.1177/13623613231205748