Oura Ring Gen 4 sensor data, not clinical measurementsN=1 case study, not validated for clinical decisionsHEV diagnosed Mar 18; Day 111 post-ruxolitinibMore
Consumer wearable data can support exploratory review only. The HEV diagnosis, temporally confounded with treatment start, remains a material confounder.

Causal Inference: Ruxolitinib

Data period: 2026-01-08 to 2026-07-02 (178 days) | Total runtime: 0.5s
Four complementary causal analysis methods explore whether Oura biometrics shifted after ruxolitinib (10 mg BID, started 2026-03-16) on Oura Ring biometrics. Data period: 2026-01-08 to 2026-07-02 (178 days).
Pre-intervention
Info
67days
Post-intervention
111days
Raw p<0.05
Info
0/0
FDR-significant
None
0/0
Lowest raw p
Not significant
p=1.000
N/A | q=1.0000
Methods used
Info
4
CI + PCMCI+ + TE + Mediation
INDIV

0. Individual Metric Treatment Response

Method: Non-parametric Mann-Whitney U test compares pre- vs post-ruxolitinib distributions for each core biometric. Effect sizes reported as Cohen's d with bootstrap 95% CI (2000 iterations). These direct statistical tests provide intuitive per-metric significance before the multivariate causal methods below.
HRV (RMSSD)
Significant
p<0.001
d=+1.75 (large)
Lowest HR
Significant
p<0.001
d=-1.82 (large)
Average HR
Significant
p<0.001
d=-1.74 (large)
Sleep Efficiency
Significant
p<0.001
d=+0.66 (medium)
Metric Pre-Rux Mean Post-Rux Mean Change Cohen's d Mann-Whitney 95% CI (diff)
HRV (RMSSD)10.00 ms25.59 ms+15.59 ms+1.75 (large)Sig p<0.001[+13.46, +17.89]
Lowest HR76.72 bpm63.19 bpm-13.52 bpm-1.82 (large)Sig p<0.001[-15.60, -11.36]
Average HR85.17 bpm72.14 bpm-13.03 bpm-1.74 (large)Sig p<0.001[-15.33, -10.80]
Sleep Efficiency78.62 %81.92 %+3.30 %+0.66 (medium)Sig p<0.001[+1.94, +4.71]
Note: These per-metric tests complement the Bayesian CausalImpact analysis below. For full changepoint detection and multi-patient comparison, see the Comparative Treatment Response report.
CONFOUNDER

0b. Confounder Analysis: Beta-Blocker Separation

Method: Three-period analysis separating the post-treatment window into Jakavi-only (23 days) and Jakavi + beta-blocker (85 days). Mann-Whitney U tests isolate each drug's contribution. This addresses the key confounder question: does Ruxolitinib's effect stand independently of the beta-blocker added later?
Treatment Timeline
Pre-treatment: 67 days
Jakavi only: 2026-03-16 to 2026-04-07 (23 days)
Jakavi + BB: 2026-04-08 to present (85 days)
Test 1: Isolated Ruxolitinib Effect
Pre-treatment vs Jakavi-only period (beta-blocker confounder eliminated)
MetricPre MeanJakavi-only MeanChangeCohen's dp-value
HRV (RMSSD)10.0 ms (n=67)10.9 ms (n=22)+0.9 ms+0.40 (small)p=0.076
Lowest HR76.7 bpm (n=64)72.9 bpm (n=19)-3.8 bpm-0.77 (medium)p=0.0093
Average HR85.2 bpm (n=64)81.4 bpm (n=19)-3.7 bpm-0.66 (medium)p=0.0119
Sleep Efficiency78.6 % (n=64)79.9 % (n=19)+1.3 %+0.36 (small)p=0.102
Test 2: Marginal Beta-Blocker Effect
Jakavi-only vs Jakavi + beta-blocker (what BB adds on top)
MetricJakavi-only MeanJakavi+BB MeanChangep-value
HRV (RMSSD)10.9 ms (n=22)29.3 ms (n=87)+18.4 msp=0.0000
Lowest HR72.9 bpm (n=19)61.0 bpm (n=84)-11.9 bpmp=0.0000
Average HR81.4 bpm (n=19)70.0 bpm (n=84)-11.4 bpmp=0.0000
Sleep Efficiency79.9 % (n=19)82.4 % (n=84)+2.4 %p=0.0034
Key Findings
  • Ruxolitinib alone produces a statistically significant reduction in heart rate, independent of beta-blocker.
  • HRV improvement during Jakavi-only period does not reach significance โ€” the HRV signal strengthens after beta-blocker addition.
  • Beta-blocker addition produces a significant further HRV increase on top of Jakavi.
CI

1. CausalImpact - Bayesian Structural Time Series Analysis

Method: Bayesian Structural Time Series (BSTS) models pre-intervention dynamics and generates a counterfactual prediction for the post-period. The difference between actual and counterfactual estimates the causal effect, with full posterior uncertainty. MCMC: 5,000 iterations, weekly seasonal component (nseasons=7). Benjamini-Hochberg FDR correction for multiple testing.

FAILED CausalImpact package not installed. Install with: pip install pycausalimpact

STATPOWER

1a. Statistical Power &amp; Interpretation

Purpose: All 11 metrics sorted by statistical significance, with Benjamini-Hochberg corrected q-values for multiple testing.

No CausalImpact results available for statistical power analysis.

PLACEBO

1b. Placebo tests (intervention date falsification)

Method: CausalImpact is run with 3 random placebo dates in the pre-period on the 3 most significant metrics. Placebo dates should NOT show significant effects.

CausalImpact failed, skipping placebo

PCMCI

2. Granger Causality Network (PCMCI+)

Method: PCMCI+ (tigramite) tests for time-lagged causal relationships between biometric variables using partial correlation. Tau_max = 7 days.

Full period (0 days): No significant causal links found.

Pre-ruxolitinib (0 days): No significant causal links found.

TE

3. Transfer Entropy

Method: Transfer entropy quantifies directional information flow between biometric streams. Comparison of TE matrices for pre- and full period reveals changes in information coupling after ruxolitinib start.

Full period (177 days)

SourceTargetTE (bits)Net TE
REMpctRMSSD1.0323+0.2503
DeepDurRMSSD1.0208+0.3173
TempDevRMSSD1.0208+0.2483
REMpctAvgHR1.0193+0.2100
TotalSleepAvgHR1.0193+0.2416
DeepDurAvgHR1.0193+0.3158
TotalSleepRMSSD1.0093+0.2545
REMdurAvgHR1.0078+0.1645

Pre-ruxolitinib (67 days)

SourceTargetTE (bits)Net TE
REMdurRespRate0.5564+0.1891
TotalSleepRespRate0.5564+0.3064
DeepDurRespRate0.5564+0.2439
RMSSDmaxRespRate0.5564+0.3689
LowestHRRespRate0.5564+0.2127
AvgHRRespRate0.5564+0.1071
SpO2RespRate0.5564+0.1132
REMpctRespRate0.5252+0.0835

Change in information flow

Largest increase: DeepDur -> RMSSDmax (+0.7513 bits)

Largest decrease: REMdur -> REMdur (+0.0000 bits)

MEDIATION

4. Intervention Response Decomposition

Method: Linear mediation analysis (Baron-Kenny) decomposes total ruxolitinib effect into four mediating pathways. Bootstrap (2000 iterations) for confidence intervals.
+9.6
Total effect (readiness score)
64.8 -> 74.4
Pre -> Post average
p<0.001
Raw p-value Significant
PathwayMediator (pre->post)a (T->M) b (M->Y)Indirect effect [95% CI] % mediatedp-value
Direct cardiac
Ruxolitinib -> HR change -> Readiness
92.57 -> 82.90-0.905-0.578+0.5232
[+0.3571, +0.7215]
52.2%0.0000 Sig
Autonomic
Ruxolitinib -> HRV change -> Readiness
10.00 -> 25.88+1.357+0.598+0.8116
[+0.5930, +1.0329]
80.5%0.0000 Sig
Sleep-mediated
Ruxolitinib -> Sleep efficiency -> Readiness
78.62 -> 81.92+0.635+0.326+0.2066
[+0.0937, +0.3490]
19.8%0.0000 Sig
Inflammatory
Ruxolitinib -> Temperature deviation -> Readiness
0.06 -> 0.05-0.034-0.275+0.0094
[-0.0664, +0.1002]
0.9%0.8030 NS
CLINICAL

5. Clinical Interpretation

Executive summary

N/A
Strongest signal: p=1.000
0/0
Metrics trending in expected direction
111
Post-intervention days (Day 14 target)
  • N/A: strongest hypothesis-generating raw p-value signal (p=1.0000, q=1.0000). It does not survive FDR correction, so confirmation still depends on more post-treatment follow-up.
  • CausalImpact: 0 of 0 biometric streams show significant causal change (p < 0.05). After Benjamini-Hochberg FDR correction: 0 of 0 remain significant (q < 0.05).
  • Placebo validation: N/A - ?/? placebo tests reached significance. This keeps the result vulnerable to false positives rather than establishing a confirmed intervention effect.
  • PCMCI+: 0 significant time-lagged causal links identified in the biometric network
  • Mediation analysis: 3 of 4 mediating pathways show significant indirect effect

Limitations

  • Short post-period (111 days): All results are preliminary. Minimum 14-21 days of post-intervention data recommended for robust causal inference.
  • Confounders: Linear methods cannot capture non-linear interactions. Seasonal variation, activity level, and other medications are not controlled for.
  • Wearable data: Oura Ring is not a medical device. Measurements have inherent noise that can affect causal estimates.
  • Single patient: N=1 study without control group. Causality cannot be definitively established, but Bayesian posterior probability of effect provides a strength measure.
  • HEV diagnosis: HEV was diagnosed 2026-03-18 (2 days after ruxolitinib start). Hepatitis may confound biometric changes.

Recommendations

  1. Repeat analysis after 2-3 weeks of ruxolitinib treatment for robust causal inference
  2. Add HEV-related biomarkers (ALT, bilirubin) as time-varying covariates
  3. Consider synthetic control method when longer time series are available
  4. Combine with clinical endpoints (GVHD scoring, ferritin) for multimodal analysis