Clinical Trials: Phases, Endpoints, and What the Data Means

Clinical trials run in three phases: Phase I tests safety in 20-80 patients, Phase II signals efficacy, Phase III confirms in 1,000+ for approval.

March 3, 2026

12 min read

3 interview questions

By Alexis Lentati

Introduction
The Three Phases
Phase I: Safety and Dosing (6-12 months, $4-10M)
Phase II: Efficacy Signal (1-3 years, $10-50M)
Phase III: Confirmation (2-4 years, $50-150M+ per trial)
Endpoints: What Gets Measured
Endpoint Selection Strategy
Key Statistical Concepts
Modern Trial Designs

Introduction
The Three Phases
Phase I: Safety and Dosing (6-12 months, $4-10M)
Phase II: Efficacy Signal (1-3 years, $10-50M)
Phase III: Confirmation (2-4 years, $50-150M+ per trial)
Endpoints: What Gets Measured
Endpoint Selection Strategy
Key Statistical Concepts
Modern Trial Designs

Introduction

Clinical trial data is the raw material of biotech valuation. Every value inflection point, every M&A decision, and every probability assumption in an rNPV model ultimately traces back to clinical trial results. Healthcare bankers do not need to be clinical scientists, but they must understand the structure of clinical development, the hierarchy of endpoints, the key statistical concepts that determine whether data supports approval, and how trial design choices affect timelines and costs. This article provides the complete framework that healthcare bankers use to evaluate clinical data and translate it into financial analysis.

The Three Phases

Clinical development follows a phased structure mandated by the FDA. Each phase increases in size, cost, and rigor, progressively building the evidence package required for approval. The total development timeline from first-in-human dosing to approval averages 8-12 years, though expedited pathways can compress this significantly for drugs addressing serious conditions with unmet medical need.

Phase I: Safety and Dosing (6-12 months, $4-10M)

Phase I trials enroll 20-80 healthy volunteers (or patients, in oncology where healthy volunteer dosing is unethical) to establish the drug's safety profile, appropriate dosing range, and pharmacokinetic properties (how the body absorbs, distributes, metabolizes, and excretes the drug).

The primary objectives of Phase I are: identifying the maximum tolerated dose (MTD), characterizing side effects and their dose-dependence, understanding the drug's pharmacokinetic profile (half-life, bioavailability, metabolism pathway), and selecting the dose for Phase II testing.

Phase I trials rarely fail on safety alone: approximately 60-65% of drugs advance from Phase I to Phase II. Failures at this stage are typically due to unacceptable toxicity at doses needed for efficacy, unfavorable pharmacokinetic profiles (drug is metabolized too quickly, does not achieve adequate tissue concentrations), or unexpected drug interactions.

Maximum Tolerated Dose (MTD) and Recommended Phase II Dose (RP2D): The MTD is the highest dose level at which a drug can be administered without causing unacceptable side effects (dose-limiting toxicities). The RP2D is the dose selected for Phase II testing, which may be at or below the MTD. Dose selection is critical because it determines the efficacy signal in Phase II: too low and the drug may appear ineffective; too high and toxicity may obscure the benefit. In oncology, where tolerability thresholds are higher due to disease severity, dose escalation studies are particularly important. Modern dose escalation uses Bayesian statistical methods (like the Bayesian Optimal Interval design) rather than the traditional 3+3 approach, allowing more efficient and safer dose finding.

Financial implications for Phase I. The low cost ($4-10 million) and relatively short timeline make Phase I a period of modest cash burn. For pre-revenue biotechs, Phase I is funded by venture capital or early public market proceeds. The primary financial risk is not the Phase I cost itself but the binary outcome: if Phase I reveals safety issues, the entire program's value (potentially hundreds of millions in future development costs avoided) is eliminated, and the company's stock can decline 30-50%.

Phase II: Efficacy Signal (1-3 years, $10-50M)

Phase II is the most important phase for healthcare bankers because it is where the fundamental question gets answered: does the drug work in patients? Phase II trials enroll 100-300 patients with the target disease and measure preliminary efficacy while continuing to monitor safety.

Phase II has the highest attrition rate of any phase: only 30-35% of drugs advance from Phase II to Phase III. This high failure rate reflects the biological reality that promising preclinical and Phase I data frequently does not translate to efficacy in larger patient populations. The reasons for Phase II failure include: insufficient efficacy (the drug works but not well enough to justify further development), safety signals that emerge in larger patient populations, inability to identify a responsive patient subgroup, and biomarker hypotheses that do not hold up in clinical testing.

Phase II as the Valuation Fulcrum

Phase II proof-of-concept data is the single most important data readout in the biotech lifecycle. Positive Phase II data can increase a company's value 2-5x virtually overnight. Negative Phase II data can destroy 60-90% of value. For healthcare bankers, Phase II data determines: (1) whether a pipeline asset should be valued at a pre-PoC or post-PoC probability of success, (2) whether the company is an attractive M&A target (acquirers overwhelmingly prefer post-Phase II assets where clinical risk is substantially de-risked), (3) whether the company can raise capital on favorable terms in the biotech capital markets, and (4) what peak sales assumptions are credible for the rNPV model. A Phase II readout is not just a clinical event; it is the single most significant financial event in a biotech's life.

Phase II is sometimes divided into Phase IIa (dose-finding and initial efficacy signal, often 50-100 patients) and Phase IIb (larger, more definitive efficacy assessment, often 200-400 patients with randomization against a comparator), though not all development programs make this distinction. Phase IIb trials that are randomized and well-powered can sometimes serve as the basis for accelerated approval, blurring the boundary between Phase II and Phase III.

Phase III: Confirmation (2-4 years, $50-150M+ per trial)

Phase III trials are large, randomized, controlled studies designed to provide definitive evidence of efficacy and safety for regulatory submission. They enroll 1,000-5,000+ patients across multiple clinical sites (often 100-300 sites across 20-40 countries for global programs), are typically double-blinded and controlled (patients are randomized to receive the drug or a comparator), and must hit pre-specified primary endpoints with statistical significance.

Phase III trials are the most expensive phase of clinical development, often costing $50-150 million per trial. Many programs require two Phase III trials for regulatory approval (the FDA's "substantial evidence" standard generally requires replication), doubling the cost and timeline. The Phase III success rate is approximately 55-65%, reflecting the stringent statistical requirements for approval.

Why Phase III trials fail despite positive Phase II data:

Phase II was underpowered: Small Phase II sample sizes can produce false positive results that do not replicate in larger Phase III populations
Endpoint mismatch: Phase II may use a surrogate endpoint (tumor response rate) while Phase III requires a harder endpoint (overall survival), and the surrogate may not predict the primary endpoint
Population heterogeneity: The more diverse Phase III population may include patient subgroups that respond poorly, diluting the overall treatment effect
Comparator change: Phase II may compare against placebo while Phase III compares against a standard of care treatment, raising the bar for demonstrating superiority

Endpoints: What Gets Measured

The endpoint hierarchy determines what data the trial measures and which measurements the FDA uses to make approval decisions. The choice of endpoint has profound financial implications because it determines trial size, duration, and cost.

Endpoint Type	Definition	Examples	Financial Relevance
Primary endpoint	The main outcome the trial is designed to measure. Must achieve statistical significance for the trial to "succeed"	Overall survival (OS), progression-free survival (PFS), overall response rate (ORR)	Determines trial success or failure; drives probability of approval
Secondary endpoints	Additional outcomes that support the primary endpoint. Meaningful but not sufficient alone for approval	Duration of response, quality of life, disease-free survival	Supports labeling claims, competitive positioning, payer coverage
Exploratory endpoints	Hypothesis-generating measurements not powered for statistical significance	Biomarker correlations, subgroup analyses	May identify future development opportunities or patient selection strategies

Primary Endpoint Miss = Trial Failure

If a trial misses its primary endpoint (does not achieve statistical significance on the pre-specified primary outcome measure), the trial has failed for regulatory purposes, even if secondary endpoints show positive results. FDA approval requires demonstration of efficacy on the primary endpoint. A common mistake among candidates is assuming that "mixed results" (missed primary, positive secondaries) can still support approval. In the vast majority of cases, they cannot. The primary endpoint determines the trial's success or failure, and everything else is supplementary. There are rare exceptions where the totality of evidence across endpoints supports approval, but these are the exception, not the rule, and typically require an FDA advisory committee discussion.

Endpoint Selection Strategy

The choice of primary endpoint involves strategic tradeoffs that directly affect the financial model:

Overall survival (OS) is the gold standard endpoint: it measures whether patients live longer. OS is unambiguous, universally accepted, and supports the strongest labeling claims. However, OS trials require very large patient populations (to detect statistically significant differences) and long follow-up periods (often 3-5+ years in oncology), making them the most expensive and time-consuming trial design. OS trials in adjuvant oncology settings can cost $200-400 million and take 5-7 years.

Progression-free survival (PFS) measures the time before disease worsens. PFS events occur more frequently than death, so PFS trials require fewer patients and shorter follow-up than OS trials, reducing cost and timeline. PFS is widely accepted as a primary endpoint in many oncology settings, though debate continues about whether PFS gains always translate to OS benefits.

Objective response rate (ORR) measures the percentage of patients whose tumors shrink by a defined amount. ORR trials can be single-arm (no comparator needed), require the fewest patients (often 100-200), and are the fastest to complete. ORR can support accelerated approval but typically requires a confirmatory trial with a harder endpoint.

Endpoint Choice and Trial Economics: A Practical Comparison

A biotech developing an oncology drug must choose its Phase III primary endpoint:

Endpoint	Patients Needed	Duration	Cost	Regulatory Confidence
Overall survival	1,500-2,000	4-5 years	$150-200M	Highest (full approval)
Progression-free survival	600-800	2-3 years	$75-100M	High (full approval in most settings)
Objective response rate	100-200	12-18 months	$15-30M	Moderate (accelerated approval)

The financial impact is enormous. A company with $200 million in cash can fund one OS trial or two PFS trials or four ORR trials. The endpoint choice determines how many indications the company can pursue simultaneously, how quickly it can generate data, and how much dilution shareholders face from additional capital raises.

Key Statistical Concepts

Healthcare bankers need to understand three statistical concepts that determine whether clinical data is approvable:

P-Value: The probability that the observed treatment effect (or a more extreme effect) would occur by chance if the drug had no real benefit. In clinical trials, the conventional threshold for statistical significance is p < 0.05, meaning there is less than a 5% probability the observed result is due to chance. A p-value of 0.001 provides much stronger evidence than p = 0.04. When evaluating trial results, the first question is always: did the primary endpoint achieve p < 0.05? If not, the trial missed its primary endpoint regardless of how encouraging the numerical results may appear. Context matters: a p-value of 0.051 in a trial with a strong secondary endpoint package may still generate discussion at an FDA advisory committee, but it is formally a miss.

Hazard ratio (HR). Used in time-to-event analyses (survival, progression-free survival), the hazard ratio compares the rate of events (death, disease progression) in the treatment group versus the control group. HR < 1.0 means the treatment reduces the event rate. An HR of 0.65 means the treatment reduces the risk of the event by 35%. Lower HRs indicate stronger treatment effects. In oncology, an HR of 0.70-0.80 is considered clinically meaningful for most indications, while an HR below 0.60 is considered a strong result.

Confidence interval (CI). The range within which the true treatment effect likely falls. A 95% CI that does not cross 1.0 (for hazard ratios) or 0 (for absolute differences) indicates statistical significance. A narrow CI indicates precise estimation; a wide CI indicates uncertainty. The CI provides more information than the p-value alone because it shows both the magnitude and precision of the treatment effect. A drug with HR 0.70 (95% CI: 0.55-0.90) has a well-characterized benefit, while one with HR 0.70 (95% CI: 0.45-1.10) might not be statistically significant despite the same point estimate.

Reading an Oncology Data Readout: Step by Step

A Phase III lung cancer trial reports: "Median OS of 22.4 months vs. 14.8 months (HR 0.68, 95% CI 0.55-0.84, p < 0.001)."

How a healthcare banker interprets this:

1. Survival benefit: The drug extended median survival by 7.6 months (22.4 vs. 14.8), a clinically meaningful improvement in lung cancer

2. Hazard ratio: 0.68 means the drug reduced the risk of death by 32% (1 - 0.68), a strong treatment effect

3. Statistical significance: p < 0.001 far exceeds the p < 0.05 threshold, indicating very strong statistical evidence

4. Precision: The 95% CI (0.55-0.84) does not cross 1.0, confirming statistical significance, and the narrow range indicates a precise estimate

5. Financial translation: This data supports a high probability of approval (85-90%), justifies robust peak sales assumptions based on the survival benefit, and would likely result in a stock price increase of 20-50% if not already priced in

Compare this to a borderline readout: "Median PFS of 5.4 months vs. 4.2 months (HR 0.82, 95% CI 0.67-1.01, p = 0.06)." This trial technically missed its primary endpoint (p > 0.05), the CI crosses 1.0, and the absolute benefit of 1.2 months is modest. Despite showing a numerical trend, this is a negative trial for regulatory purposes.

Modern Trial Designs

Traditional clinical trials follow a linear path (Phase I, then Phase II, then Phase III). Modern designs compress timelines and improve efficiency, which directly affects biotech cash burn, time to revenue, and valuation:

Adaptive designs allow modifications to the trial (sample size, dose selection, patient population, randomization ratio) based on interim data, without compromising the trial's statistical integrity. Adaptive designs can reduce development time and cost by eliminating unpromising doses or populations earlier. A dose-adaptive trial might start with three dose levels, drop the lowest dose at an interim analysis based on insufficient efficacy, and reallocate patients to the remaining doses, completing enrollment faster with a higher probability of demonstrating efficacy at the optimal dose.

Basket trials test a single drug across multiple cancer types that share a common biomarker or genetic mutation (e.g., testing a BRAF inhibitor in melanoma, colorectal cancer, thyroid cancer, and non-small cell lung cancer simultaneously). This allows simultaneous evaluation in multiple indications, accelerating the identification of responsive patient populations and potentially supporting multiple approval indications from a single trial.

Umbrella trials test multiple drugs against different biomarker-defined subgroups within a single disease. All patients with lung cancer, for example, are screened for multiple biomarkers (EGFR, ALK, ROS1, KRAS, PD-L1) and assigned to the appropriate treatment arm based on their tumor's molecular profile. Umbrella trials create efficiency by sharing a common screening infrastructure across multiple experimental treatments.

Seamless Phase II/III designs combine Phase II and Phase III into a single trial, using an adaptive framework where Phase II patients are enrolled first, an interim analysis determines whether to proceed, and additional patients are enrolled to complete the Phase III portion. If the interim analysis is positive, Phase II patients can be included in the Phase III analysis, saving 12-18 months vs. sequential Phase II and Phase III. This design is increasingly common for breakthrough therapy designated programs where speed is critical.

Clinical Trial Knowledge in Interviews

Interviewers do not expect you to design clinical trials. They expect you to interpret trial results and connect them to valuation. If given a data readout, follow a structured approach: (1) Did the primary endpoint achieve statistical significance (p < 0.05)? (2) What is the magnitude of the treatment effect (hazard ratio, response rate difference)? Is it clinically meaningful? (3) Is the confidence interval tight or wide? (4) How does this data change the probability of success and peak sales assumptions in the rNPV model? This analytical framework connects clinical data to financial analysis, which is exactly what healthcare bankers do. You should also understand the financial implications of endpoint choice: "An ORR-based accelerated approval strategy costs $15-30 million and takes 18 months, while a full OS trial costs $150-200 million and takes 4-5 years. This choice determines the company's capital needs and time to revenue."

The next article covers surrogate endpoints and accelerated approval, which create a unique valuation dynamic where drugs can generate revenue while confirmatory evidence is still being collected.

Interview Questions

Interview Question #1Easy

Walk me through the phases of a clinical trial.

Clinical trials proceed through four phases:

Phase I (Safety). 20-100 healthy volunteers. Tests whether the drug is safe at various doses, identifies side effects, and establishes the recommended dose. Duration: 6-12 months. Success rate: ~65%. Cost: $1-5M typically.

Phase II (Efficacy). 100-300 patients with the target disease. Tests whether the drug actually works (efficacy) at the established dose. This is the highest-risk phase, with ~30-35% success rate. Generates the first evidence of therapeutic benefit. Duration: 1-2 years. Cost: $10-40M.

Phase III (Confirmation). 1,000-5,000+ patients across multiple clinical sites. Confirms efficacy and safety at scale, collects the data package needed for regulatory submission. Often randomized, double-blind, controlled trials. Success rate: ~55-60%. Duration: 2-4 years. Cost: $50-300M+, the most expensive phase.

Phase IV (Post-marketing). Conducted after FDA approval. Monitors long-term safety in a broader patient population, explores new indications, and fulfills post-marketing commitments required by the FDA.

Total development cost from preclinical through approval averages $1-2 billion per approved drug, inclusive of the cost of failed programs.

Interview Question #2Medium

What is the difference between a primary endpoint and a secondary endpoint, and why does it matter?

A primary endpoint is the main outcome measure that determines whether a clinical trial succeeds or fails. The trial is statistically powered to detect a meaningful difference on this endpoint. Examples: overall survival (OS) in oncology, HbA1c reduction in diabetes, ACR response rate in rheumatoid arthritis.

A secondary endpoint provides supporting evidence but does not independently determine trial success. Examples: progression-free survival (PFS), quality of life, duration of response.

Why it matters:

1. FDA decision. The FDA bases approval primarily on whether the primary endpoint was met with statistical significance. Meeting secondary but missing primary endpoints usually means the trial failed.

2. Labeling. The primary endpoint data drives what claims can be included in the drug label, which directly affects commercial positioning.

3. Valuation impact. Analysts model the probability of success based on the primary endpoint. A trial that meets its primary endpoint de-risks the asset significantly. A trial that misses the primary but hits secondary endpoints creates ambiguity that the market discounts.

4. Endpoint choice signals. Companies choosing surrogate primary endpoints (e.g., PFS instead of OS) may get faster approval but may face FDA requirements for confirmatory trials on harder endpoints later.

Interview Question #3Medium

What is a surrogate endpoint and when would the FDA accept it?

A surrogate endpoint is a biomarker or intermediate clinical measure used in place of the "gold standard" clinical endpoint (like overall survival) to allow faster evaluation of drug efficacy. Examples: tumor shrinkage (objective response rate) as a surrogate for overall survival in oncology, HbA1c for long-term diabetic complications, viral load suppression for HIV clinical outcomes.

The FDA accepts surrogate endpoints under two main pathways:

1. Accelerated Approval. Grants approval based on a surrogate endpoint that is "reasonably likely to predict clinical benefit." The drug reaches market faster, but the manufacturer must conduct post-marketing confirmatory trials to verify the clinical benefit. If confirmatory trials fail, the FDA can withdraw approval.

2. Established surrogates. Some surrogates have been validated through decades of use and are accepted as primary endpoints in standard (non-accelerated) approval. Example: blood pressure reduction for antihypertensive drugs, LDL cholesterol reduction for statins.

The trade-off for valuation: surrogate endpoints enable faster approval (better NPV due to earlier revenue) but create post-approval risk. If confirmatory trials fail, the drug faces withdrawal. A drug approved on a surrogate endpoint should carry some residual regulatory risk in valuation models.

The Biotech Lifecycle and Value Inflection Points Surrogate Endpoints, Accelerated Approval, and Confirmatory Trials

Explore More

Guides

Financial Sponsors Group (FSG) in Investment Banking

What the Financial Sponsors Group does: how FSG covers private equity firms, why it differs from industry coverage, the leveraged finance link, and FSG careers.

May 30, 2026

3-Statement Financial Model: How to Build One

Accounting

Technical

3-Statement Financial Model: How to Build One

Learn how to build an integrated 3-statement financial model from scratch. Step-by-step walkthrough of income statement, balance sheet, and cash flow projections.

February 22, 2026

What Is a Fairness Opinion in Investment Banking?

Technical

M&A

What Is a Fairness Opinion in Investment Banking?

How fairness opinions work in M&A: when they're required, how banks prepare them, what they cost, and why interviewers ask about them.

October 27, 2025

Ready to Transform Your Interview Prep?

Join 3,000+ students preparing smarter

Download on App Store

Free Interview PDF Guide

Join 3,000+ students who have downloaded this resource