Oura Ring Gen 4 sensor data, not clinical measurementsN=1 case study, not validated for clinical decisionsHEV diagnosed Mar 18; Day 109 post-ruxolitinibMore
Consumer wearable data can support exploratory review only. The HEV diagnosis, temporally confounded with treatment start, remains a material confounder.

Anomaly Pattern Comparison

Module 5: Anomaly Signature Analysis
EXECUTIVE SUMMARY

Executive Summary

Patient 1 ANOMALY DAYS
In range
7/ 96
7.3% of days flagged
Patient 2 ANOMALY DAYS
In range
46/ 594
7.7% of days flagged
Patient 3 ANOMALY DAYS
In range
3/ 23
13.0% of days flagged
FINGERPRINT SIMILARITY
Watch
0.10cosine
1.0 = identical signatures
PCA AXIS SIMILARITY
Info
-0.43cosine
Dominant failure mode alignment

Patient 1 and Patient 2 show distinct anomaly signatures (cosine=0.10). Patient 1 has 7 anomaly days vs P2's 46. PCA axis similarity is -0.43, suggesting different dominant failure modes.

DETECTION HENRIK

Patient 1 (post-HSCT): Anomaly Detection Results

TOTAL DAYS
96
ENSEMBLE ANOMALIES
In range
7
Z-SCORE METHOD
5
5.2%
ISOLATION FOREST
10
10.4%
PERCENTILE METHOD
13
13.5%

Top 10 Anomaly Days

DateScoreAgreementTop Deviating MetricsCluster
2026-04-081.1043/3Active Cal (z=+5.0), Steps (z=+4.3), Activity Score (z=+2.9)-1
2026-04-070.9753/3Steps (z=+3.5), Active Cal (z=+2.7), Activity Score (z=+2.5)-1
2026-04-120.8392/3HRV Average (z=+3.5), Activity Score (z=+2.4), HRV Balance (z=+1.9)-1
2026-02-090.8153/3Readiness (z=-4.4), HR Average (z=+4.3), HR Lowest (z=+3.4)-1
2026-04-100.7933/3HRV Average (z=+3.5), Activity Score (z=+3.2), HR Lowest (z=-2.4)-1
2026-04-060.7783/3Steps (z=+4.1), Active Cal (z=+2.6), Readiness (z=-2.0)-1
2026-04-110.5952/3Activity Score (z=+3.1), Breath Rate (z=+2.2), HR Average (z=-1.3)-1
DETECTION MITCH

Patient 2 (post-Stroke): Anomaly Detection Results

TOTAL DAYS
594
ENSEMBLE ANOMALIES
In range
46
Z-SCORE METHOD
30
5.1%
ISOLATION FOREST
60
10.1%
PERCENTILE METHOD
81
13.6%

Top 10 Anomaly Days

DateScoreAgreementTop Deviating MetricsCluster
2022-05-041.6703/3Temp Delta (z=+21.1), Readiness (z=-6.0), Efficiency (z=-4.7)-1
2021-12-211.4093/3Efficiency (z=-6.2), Sleep Score (z=-5.5), Total Sleep (z=-5.1)-1
2021-09-181.2553/3Readiness (z=-4.4), HRV Balance (z=-4.1), HR Lowest (z=+4.1)-1
2021-12-201.1653/3Sleep Score (z=-5.1), Efficiency (z=-5.0), Total Sleep (z=-4.8)-1
2026-03-281.1653/3Sleep Score (z=-4.9), Total Sleep (z=-4.6), Efficiency (z=-4.4)-1
2021-11-170.9753/3HR Lowest (z=+4.6), HR Average (z=+3.5), Readiness (z=-3.5)-1
2021-10-240.9523/3Readiness (z=-3.2), HR Lowest (z=+2.6), Sleep Balance (z=-2.4)-1
2021-09-170.9503/3Breath Rate (z=-4.7), Readiness (z=-3.5), HR Lowest (z=+2.8)-1
2021-11-210.9473/3HRV Balance (z=-4.4), Readiness (z=-3.1), HR Average (z=+2.6)-1
2021-11-200.8333/3HRV Balance (z=-3.2), Steps (z=+2.5), Readiness (z=-2.2)-1
DETECTION WENCHE

Patient 3 (Healthy Control): Anomaly Detection Results

TOTAL DAYS
23
ENSEMBLE ANOMALIES
In range
3
Z-SCORE METHOD
1
4.3%
ISOLATION FOREST
3
13.0%
PERCENTILE METHOD
9
39.1%

Top 10 Anomaly Days

DateScoreAgreementTop Deviating MetricsCluster
2026-04-090.9163/3HR Lowest (z=+3.1), Readiness (z=-2.8), HR Average (z=+2.6)N/A
2026-04-070.7822/3Activity Score (z=-2.4), Efficiency (z=-2.0), Sleep Score (z=-1.7)N/A
2026-04-030.7342/3Deep Sleep (z=+2.2), HRV Average (z=+2.2), HR Average (z=-1.5)N/A
ANOMALY TIMELINE

Anomaly Timeline

FINGERPRINT COMPARISON

Anomaly Fingerprint Comparison

CLUSTER ANALYSIS

Cluster Analysis

METRIC COMPARISON

Metric Severity Comparison

CLINICAL IMPLICATIONS

Clinical Implications

Anomaly Signature Differences

Patient 1 (post-HSCT) typical bad day: Activity Score (z=+2.41), Steps (z=+1.67), Active Cal (z=+1.41), Readiness (z=-0.98), HRV Average (z=+0.76)

Patient 2 (post-Stroke) typical bad day: Readiness (z=-1.40), Total Sleep (z=-1.01), HR Lowest (z=+0.94), Sleep Score (z=-0.94), HR Average (z=+0.86)

Patient 3 (Healthy Control) typical bad day: Deep Sleep (z=+1.56), HR Lowest (z=+1.07), Breath Rate (z=+1.04), Active Cal (z=-0.69), Readiness (z=-0.64)

Cluster Patterns

Patient 3 (Healthy Control): Insufficient anomaly days for clustering.

Severity Comparison

Anomaly severity distributions are significantly different (Mann-Whitney U, p=p=0.049).

Key Takeaways

  • The patients have distinct anomaly signatures, reflecting their different underlying conditions (HSCT vs stroke recovery).
  • Z-score normalization removes raw-value bias, enabling meaningful comparison of 'how bad is bad' relative to each patient's baseline.
  • Ensemble methods reduce false positives: only days flagged by multiple methods are considered truly anomalous.
METHODOLOGY

Methodology & Limitations

Anomaly Detection Pipeline

  1. Data Collection: 17 metrics from Oura Ring across sleep, readiness, and activity domains.
  2. Z-Score Normalization: Each metric normalized per patient (z = (x - mean) / std). This removes raw-value differences (e.g., HRV 9ms vs 43ms) and focuses on relative deviation.
  3. Three Detection Methods:
    • Z-Score Threshold: Day anomalous if 3+ metrics exceed |z| > 2.0 or composite magnitude > 2.0
    • Isolation Forest: Unsupervised tree-based method (contamination=0.1, n_estimators=200)
    • Percentile-Based: Flag values outside 5th/95th percentile, anomalous if 3+ metrics extreme
  4. Ensemble: Day is anomaly if 2+ of 3 methods agree. Weighted score: z-score (0.35) + Isolation Forest (0.40) + percentile (0.25).
  5. Fingerprinting: Mean z-vector of anomaly days reveals the 'typical bad day' signature.
  6. PCA: Reduces dimensionality; PC1 loadings reveal the dominant failure mode.
  7. DBSCAN Clustering: Groups anomaly days by similarity (eps=2.0, min_samples=2). Clusters labeled by dominant metric group.

Limitations

  • Patient 1 has ~81 days of data vs P2's ~518 days. Small sample size limits clustering reliability.
  • Oura Ring consumer-grade sensors; not clinical-grade measurement.
  • Temperature delta may show seasonal drift in P2's longer dataset.
  • Isolation Forest contamination rate (10%) is assumed, not empirically derived.
  • Cosine similarity treats all metrics equally; clinical weighting may differ.