Abstract
Atrial fibrillation (AF) is the most common sustained arrhythmia worldwide and a major contributor to stroke and cardiovascular morbidity. Conventional diagnostics, including electrocardiograms (ECG) and ambulatory monitors, are limited by accessibility and short monitoring windows, particularly in asymptomatic or paroxysmal AF. Recently, artificial intelligence (AI)-driven technologies and wearable devices have emerged as promising tools for early AF detection.BACKGROUNDAtrial fibrillation (AF) is the most common sustained arrhythmia worldwide and a major contributor to stroke and cardiovascular morbidity. Conventional diagnostics, including electrocardiograms (ECG) and ambulatory monitors, are limited by accessibility and short monitoring windows, particularly in asymptomatic or paroxysmal AF. Recently, artificial intelligence (AI)-driven technologies and wearable devices have emerged as promising tools for early AF detection.This systematic review evaluates the diagnostic performance of AI-enabled and wearable technologies for AF detection in clinical and real-world settings.OBJECTIVEThis systematic review evaluates the diagnostic performance of AI-enabled and wearable technologies for AF detection in clinical and real-world settings.A comprehensive search of PubMed, Scopus, and Web of Science identified studies reporting sensitivity, specificity, predictive values, and AUC of digital AF detection tools. Platforms included smartphone apps, smartwatches, photoplethysmography (PPG), single- and multi-lead ECGs, and machine learning algorithms. Data extraction and quality assessment used QUADAS-2. Random-effects meta-analyses and subgroup analyses synthesized findings and explored heterogeneity.METHODSA comprehensive search of PubMed, Scopus, and Web of Science identified studies reporting sensitivity, specificity, predictive values, and AUC of digital AF detection tools. Platforms included smartphone apps, smartwatches, photoplethysmography (PPG), single- and multi-lead ECGs, and machine learning algorithms. Data extraction and quality assessment used QUADAS-2. Random-effects meta-analyses and subgroup analyses synthesized findings and explored heterogeneity.Twenty-four studies met inclusion criteria. High-performing tools demonstrated sensitivity and specificity ≥94% and AUC ≥0.95, while consumer-grade devices, such as the Apple Heart Study, showed lower specificity (46%) and positive predictive value (7.6%), reflecting frequent false positives. Heterogeneity arose from device type, signal method (PPG vs. ECG), algorithm design, and population characteristics. Tools integrating explainable AI and multi-modal data generally outperformed simpler models.RESULTSTwenty-four studies met inclusion criteria. High-performing tools demonstrated sensitivity and specificity ≥94% and AUC ≥0.95, while consumer-grade devices, such as the Apple Heart Study, showed lower specificity (46%) and positive predictive value (7.6%), reflecting frequent false positives. Heterogeneity arose from device type, signal method (PPG vs. ECG), algorithm design, and population characteristics. Tools integrating explainable AI and multi-modal data generally outperformed simpler models.AI-enhanced and wearable technologies show strong potential for accurate AF detection under specific conditions. Performance variability, especially in consumer-grade devices, underscores the need for external validation, algorithm refinement, and clinical integration. Future research should assess real-world effectiveness, cost-efficiency, explainability, and long-term outcomes to support broader adoption.CONCLUSIONSAI-enhanced and wearable technologies show strong potential for accurate AF detection under specific conditions. Performance variability, especially in consumer-grade devices, underscores the need for external validation, algorithm refinement, and clinical integration. Future research should assess real-world effectiveness, cost-efficiency, explainability, and long-term outcomes to support broader adoption.