Logo image
Evaluating AI-Enabled Technologies for Atrial Fibrillation Detection: A Systematic Review of Diagnostic Performance
Journal article   Peer reviewed

Evaluating AI-Enabled Technologies for Atrial Fibrillation Detection: A Systematic Review of Diagnostic Performance

Maddison Weber, Mason Klisares, May Li-Jedras and Amjad Kabach
Critical pathways in cardiology
05/05/2026
PMID: 42082350

Abstract

Atrial fibrillation (AF) is the most common sustained arrhythmia worldwide and a major contributor to stroke and cardiovascular morbidity. Conventional diagnostics, including electrocardiograms (ECG) and ambulatory monitors, are limited by accessibility and short monitoring windows, particularly in asymptomatic or paroxysmal AF. Recently, artificial intelligence (AI)-driven technologies and wearable devices have emerged as promising tools for early AF detection.BACKGROUNDAtrial fibrillation (AF) is the most common sustained arrhythmia worldwide and a major contributor to stroke and cardiovascular morbidity. Conventional diagnostics, including electrocardiograms (ECG) and ambulatory monitors, are limited by accessibility and short monitoring windows, particularly in asymptomatic or paroxysmal AF. Recently, artificial intelligence (AI)-driven technologies and wearable devices have emerged as promising tools for early AF detection.This systematic review evaluates the diagnostic performance of AI-enabled and wearable technologies for AF detection in clinical and real-world settings.OBJECTIVEThis systematic review evaluates the diagnostic performance of AI-enabled and wearable technologies for AF detection in clinical and real-world settings.A comprehensive search of PubMed, Scopus, and Web of Science identified studies reporting sensitivity, specificity, predictive values, and AUC of digital AF detection tools. Platforms included smartphone apps, smartwatches, photoplethysmography (PPG), single- and multi-lead ECGs, and machine learning algorithms. Data extraction and quality assessment used QUADAS-2. Random-effects meta-analyses and subgroup analyses synthesized findings and explored heterogeneity.METHODSA comprehensive search of PubMed, Scopus, and Web of Science identified studies reporting sensitivity, specificity, predictive values, and AUC of digital AF detection tools. Platforms included smartphone apps, smartwatches, photoplethysmography (PPG), single- and multi-lead ECGs, and machine learning algorithms. Data extraction and quality assessment used QUADAS-2. Random-effects meta-analyses and subgroup analyses synthesized findings and explored heterogeneity.Twenty-four studies met inclusion criteria. High-performing tools demonstrated sensitivity and specificity ≥94% and AUC ≥0.95, while consumer-grade devices, such as the Apple Heart Study, showed lower specificity (46%) and positive predictive value (7.6%), reflecting frequent false positives. Heterogeneity arose from device type, signal method (PPG vs. ECG), algorithm design, and population characteristics. Tools integrating explainable AI and multi-modal data generally outperformed simpler models.RESULTSTwenty-four studies met inclusion criteria. High-performing tools demonstrated sensitivity and specificity ≥94% and AUC ≥0.95, while consumer-grade devices, such as the Apple Heart Study, showed lower specificity (46%) and positive predictive value (7.6%), reflecting frequent false positives. Heterogeneity arose from device type, signal method (PPG vs. ECG), algorithm design, and population characteristics. Tools integrating explainable AI and multi-modal data generally outperformed simpler models.AI-enhanced and wearable technologies show strong potential for accurate AF detection under specific conditions. Performance variability, especially in consumer-grade devices, underscores the need for external validation, algorithm refinement, and clinical integration. Future research should assess real-world effectiveness, cost-efficiency, explainability, and long-term outcomes to support broader adoption.CONCLUSIONSAI-enhanced and wearable technologies show strong potential for accurate AF detection under specific conditions. Performance variability, especially in consumer-grade devices, underscores the need for external validation, algorithm refinement, and clinical integration. Future research should assess real-world effectiveness, cost-efficiency, explainability, and long-term outcomes to support broader adoption.

Metrics

1 Record Views

Details

Logo image