Aqrab is an AI-powered methodology critique engine for clinical researchers. It analyzes study protocols and detects specific biases like immortal time bias, healthy user confounding, and missing sensitivity analyses — providing named issues with actionable fixes.

How is Aqrab different from ChatGPT for research?

Generic AI gives generic advice like "consider confounding." Aqrab identifies specific biases by name (immortal time bias, competing risks, positivity violations), explains the mechanism, and provides concrete fixes. Built by researchers from Mount Sinai, Johns Hopkins, and King Abdulaziz University with 20+ years of causal inference expertise.

Yes — Aqrab offers 3 free critiques with no credit card required. The Pro plan at $99/month includes unlimited critiques, RAG-powered literature analysis, and exportable reports.

What types of studies can Aqrab critique?

Aqrab critiques observational studies (cohort, case-control, cross-sectional), clinical trials (RCTs, adaptive designs), and epidemiological analyses using methods like propensity score matching, instrumental variables, difference-in-differences, and target trial emulation.

Target Trial Emulation: A Practical Guide for Clinical Researchers

Every observational study that claims causal effects is — whether the authors know it or not — trying to emulate a randomized trial. Target Trial Emulation (TTE) makes that implicit logic explicit, disciplined, and auditable. Instead of asking "what statistical method should I use?", you start by asking: "what RCT would I run if I could?"

Why Target Trial Emulation?

Most observational studies fail not because of bad statistics but because of bad design. Immortal time bias, prevalent user bias, selection bias from conditioning on future events — these are design problems dressed up as data analysis.

Hernán and Robins formalized TTE as a framework that forces you to specify exactly what trial you're emulating before touching the data. The result: many of the biases that plague observational research become visible — and fixable — at the design stage.

The Core Principle:

If your observational analysis doesn't correspond to a well-defined randomized trial, your causal claims are meaningless — no matter how sophisticated the statistics.

The Seven Protocol Components

Every target trial emulation must specify seven components, mirroring the protocol of the hypothetical RCT you wish existed:

Component	What You Specify	Common Mistake
1. Eligibility	Who would be enrolled	Using future information
2. Treatment strategies	What interventions to compare	Vague or overlapping definitions
3. Treatment assignment	How assignment happens	Ignoring confounding at baseline
4. Time zero	When follow-up starts	Misaligned eligibility, assignment, and follow-up
5. Outcome	What you measure and when	Outcome windows that vary by group
6. Causal contrast	ITT, per-protocol, or as-treated	Per-protocol without IP weighting
7. Analysis plan	Statistical approach	Choosing methods before specifying the trial

This isn't bureaucratic overhead — it's the entire analytical strategy. Get these seven components right and the "methods" section almost writes itself.

Time Zero: Where Most Studies Go Wrong

Time zero is the single most important concept in TTE. In a real trial, three things happen simultaneously on enrollment day:

Eligibility is confirmed
Treatment is assigned
Follow-up begins

In observational data, these three events are often misaligned. A patient might meet eligibility criteria in January, start treatment in March, and have "follow-up" begin whenever the researcher decides. This misalignment is the root cause of immortal time bias, prevalent user bias, and a host of other design flaws.

⚠️ The Immortal Time Trap:

If you define the "treated" group as people who eventually received treatment, then the time between eligibility and treatment start is immortal — they had to survive to receive it. This creates a guaranteed bias in favor of treatment. TTE eliminates this by aligning treatment assignment with time zero.

Example: Statins and Cancer Risk

A classic example: you want to know if statins reduce cancer risk. A naive approach identifies "statin users" and "non-users" from pharmacy records and compares cancer rates. But someone classified as a "statin user" might have started statins 5 years after baseline — meaning they survived 5 years cancer-free just to enter the treatment group.

The TTE approach: at each time point where a patient could initiate statins, ask "does this person meet eligibility criteria right now?" If yes, they enter the emulated trial at that time zero. Treatment strategy is "initiate statins now" vs. "don't initiate (yet)." Follow-up begins immediately.

Cloning, Censoring, and Weighting

In practice, TTE often uses a technique called sequential trial emulation (or "cloning"). At each eligible time point:

Create a clone of the patient assigned to the treatment strategy
Create another clone assigned to the comparator strategy
Censor a clone when their observed behavior deviates from the assigned strategy
Use inverse probability of censoring weights to adjust for informative censoring

This might sound strange — cloning patients? — but it's the observational equivalent of randomization at each eligibility window. The key insight: every patient could have been assigned to either strategy at time zero, just like in a trial.

Why Cloning Works:

At time zero, before treatment assignment, the clones are identical (same covariates, same history). They only diverge when one "follows" the treatment strategy and the other doesn't. Censoring handles the divergence; weighting handles the selection bias from censoring.

Intention-to-Treat vs. Per-Protocol

Just like in real RCTs, TTE distinguishes between:

Intention-to-treat (ITT): Compare groups based on initial assignment, regardless of adherence. In TTE, this is the simplest — assign at time zero, follow everyone, no censoring needed.
Per-protocol: Compare groups who actually followed their assigned strategy. This requires censoring non-adherent individuals and using IP weights to handle the selection bias that censoring introduces.
As-treated: Compare based on actual treatment received. Prone to confounding — generally not recommended without careful adjustment.

The per-protocol analysis is where TTE shines. In standard observational studies, "per-protocol" analyses are a minefield of biases. TTE provides a principled framework: censor when behavior deviates from protocol, weight to adjust for informative censoring.

Sustained Treatment Strategies

Many clinical questions involve sustained strategies: "initiate and continue treatment for at least 2 years" vs. "never initiate." These are harder to emulate because adherence can change over time.

The approach: at each follow-up visit, check if the patient is still following their assigned strategy. If not, censor them at that point. Then model the probability of remaining uncensored (adherent) and apply inverse probability weights.

Grace Periods: A Practical Necessity

Real-world treatment isn't perfectly continuous. Patients miss doses, refill prescriptions late, or take drug holidays. TTE protocols should specify grace periods — allowable gaps before treatment is considered discontinued. A 30-day gap for chronic medications is common. Without grace periods, you'll censor too aggressively and lose statistical power (and generalizability).

When TTE Works Best — and When It Doesn't

Ideal Settings

Large administrative/claims databases with clear treatment initiation dates (MIMIC-IV, Medicare, CPRD)
Questions where an RCT is infeasible — too expensive, too slow, or ethically impossible
Treatment strategy questions — when to start, which drug to use, how long to continue
Policy evaluation — new guidelines, formulary changes, screening programs

Poor Settings

Cross-sectional data — TTE needs longitudinal follow-up
Unmeasured confounders you can't address — TTE doesn't magically solve unmeasured confounding (use IV methods or sensitivity analysis)
Ill-defined interventions — "healthy lifestyle" is not a treatment strategy; "initiate metformin within 30 days of T2DM diagnosis" is
Sparse data near time zero — if very few patients meet eligibility at any given window, power will be limited

Step-by-Step Implementation

Specify the target trial protocol

Write out all seven components as if submitting a trial registration. Be specific about eligibility windows, treatment definitions, and outcome measurement.

Map protocol to available data

For each protocol component, identify the data elements. Document where the emulation deviates from the ideal trial (it always will).

Define time zero and eligibility windows

Identify all time points where patients could enter the emulated trial. Ensure eligibility, assignment, and follow-up start align at each window.

Apply the clone-censor-weight approach

Clone patients into treatment arms at each eligible time zero. Censor when behavior deviates from assigned strategy. Fit censoring models and apply IP weights.

Estimate and validate

Estimate treatment effects using weighted survival analysis or pooled logistic regression. Validate with sensitivity analyses: vary eligibility criteria, grace periods, and weight models.

Common Pitfalls

❌ Misaligning time zero with eligibility

If eligibility is assessed in 2015 but follow-up starts in 2017, you've introduced immortal time bias — the very thing TTE is designed to prevent.

❌ Using future information for eligibility

"Patients who eventually received Drug X" is not an eligibility criterion. Eligibility must be based solely on information available at time zero.

❌ Ignoring treatment strategy precision

"Treatment vs. no treatment" is almost never enough. Specify: which drug, what dose, within what window, sustained for how long, with what grace period.

❌ Per-protocol without IP weighting

Censoring non-adherent patients introduces selection bias. Without inverse probability of censoring weights, your per-protocol analysis is biased by definition.

❌ Skipping the protocol table

If you can't fill out all seven components in a table, your study isn't ready to analyze. The protocol IS the design.

Reporting Checklist

Before submitting your manuscript, verify:

☐Target trial protocol table with all seven components — both ideal trial and emulation
☐Explicit definition of time zero and how eligibility/assignment/follow-up align
☐Treatment strategies defined precisely (drug, dose, duration, grace periods)
☐Deviations between target trial and emulation documented and justified
☐Causal contrast specified (ITT, per-protocol, or both)
☐If per-protocol: censoring mechanism and IP weight model described
☐Sensitivity analyses: varied eligibility criteria, grace periods, weight truncation
☐Discussion of unmeasured confounders not addressable by the design
☐DAG or causal diagram showing assumed relationships

TTE vs. Other Causal Methods

Method	Addresses	Requires
PSM/IPW	Measured confounding	No unmeasured confounders
IV	Unmeasured confounding	Valid instrument
DID	Group-level confounding	Parallel trends
RDD	Confounding near threshold	Sharp cutoff
TTE	Design-level biases (immortal time, prevalent user, selection)	Well-defined treatment strategies + longitudinal data

TTE is a design framework, not a statistical method. You can use PSM, IPW, or other methods within a TTE design.

Key References

Hernán MA, Robins JM. Using Big Data to Emulate a Target Trial When a Randomized Trial Is Not Available. Am J Epidemiol. 2016;183(8):758-764.
Hernán MA, Sauer BC, Hernández-Díaz S, et al. Specifying a target trial prevents immortal time bias and other self-inflicted injuries in observational analyses. J Clin Epidemiol. 2016;79:70-75.
Danaei G, Rodríguez LAG, Cantero OF, et al. Observational data for comparative effectiveness research: an emulation of randomised trials of statins and primary prevention of coronary heart disease. Stat Methods Med Res. 2013;22(1):70-96.
Dickerman BA, García-Albéniz X, Logan RW, et al. Avoidable flaws in observational analyses: an application to statins and cancer. Nat Med. 2019;25(10):1601-1606.
Hernán MA, Robins JM. Causal Inference: What If. Boca Raton: Chapman & Hall/CRC, 2020. Free online

Ready to critique your study design?

Aqrab checks your methodology for design flaws — including the ones TTE was built to prevent.

Try Aqrab Free →