T-Test Calculator: p-Value, CI, Welch & Paired Tests

t-test Calculator

One-sample • Two-sample • Paired • p-value + CI

Settings

🧪 Test type

CI %

CI uses the same df as the selected test. Default: 95%.

🎯 One-sample hypothesis

μ₀

Tests whether the sample mean differs from μ₀.

Quick summary

Enter data to compute the t-test.

⚖️ Two-sample assumption Assume equal variances (pooled)

Unchecked = Welch’s t-test (recommended).

Quick summary

Enter data for both groups.

🔗 Paired test

Uses differences (A − B) and runs a one-sample t-test on them with μ₀ = 0.

Quick summary

Enter paired data (same length).

Data

📌 Sample / Group A

Tip: paste from Excel/Sheets; newlines are fine.

Results

t statistic

—

Degrees of freedom (df)

—

p-value

—

Effect (mean diff)

—

n (A / B)

—

Updating…

Action

Hard reset: cancels pending updates and clears everything on the first click.

Use this T Test Calculator to run one-sample, two-sample, Welch, pooled, or paired t-tests and get the t statistic, degrees of freedom, p-value, confidence interval, mean difference, group statistics, and a test conclusion. It is useful for statistics homework, research planning, classroom examples, spreadsheet checking, and quick hypothesis-test review from pasted data.

Reviewed by: AjaxCalculators Editorial Team
Last updated: May 2, 2026
Method source: Standard one-sample, two-sample, Welch, pooled, and paired t-test formulas with matching confidence-interval and p-value logic
Editorial standards: AjaxCalculators Editorial Policy

What This T Test Calculator Calculates

This calculator performs a full t-test workflow for common mean-comparison problems. Depending on the selected mode, it can report:

t statistic: the standardized test statistic
Degrees of freedom (df): the value used with the t distribution
p-value: the probability-based measure used in hypothesis testing
Confidence interval (CI): a range of plausible values for the mean or mean difference
Effect / mean difference: the observed difference being tested
Standard error (SE): the estimated sampling variability of the mean or difference
Group statistics: sample size, mean, and standard deviation when available
Test conclusion: a plain-language summary based on the selected significance level

It supports three main test types:

One-sample t-test: compare one sample mean with a hypothesized value
Two-sample t-test: compare two independent sample means
Paired t-test: compare matched observations such as before-and-after data

For two-sample mode, it supports both:

Welch’s t-test: for unequal variances or unequal sample sizes
Pooled t-test: when equal variances are assumed

What a T Test Means

A t-test is a statistical method used to compare a sample mean or mean difference against a null hypothesis. It asks whether the observed difference is large relative to the estimated random variation in the data.

A t-test can help answer questions such as:

Is one sample mean different from a known or hypothesized value?
Are two independent group means different?
Did a matched group change from before to after?
Is the observed difference large enough to be unlikely under the null hypothesis?

The calculator can provide a conclusion such as whether the result is statistically significant at the chosen alpha level, but the practical meaning still depends on the size of the effect, data quality, study design, and real-world context.

How the T Test Calculator Works

1) One-Sample t-Test

A one-sample t-test compares a sample mean with a hypothesized population mean.

t = (x̄ − μ₀) / (s / √n)

Where:

x̄ = sample mean
μ₀ = hypothesized population mean
s = sample standard deviation
n = sample size
s / √n = standard error of the mean

The degrees of freedom are:

df = n − 1

2) Two-Sample Welch t-Test

Welch’s t-test compares two independent sample means without assuming equal variances.

t = (x̄₁ − x̄₂) / √(s₁²/n₁ + s₂²/n₂)

Where:

x̄₁ = mean of group 1
x̄₂ = mean of group 2
s₁ = sample standard deviation of group 1
s₂ = sample standard deviation of group 2
n₁ = sample size of group 1
n₂ = sample size of group 2

The degrees of freedom are estimated with the Welch–Satterthwaite formula. Welch’s method is generally a safer default when sample sizes or variances may differ.

3) Two-Sample Pooled t-Test

A pooled t-test compares two independent means while assuming equal population variances.

t = (x̄₁ − x̄₂) / [s_p × √(1/n₁ + 1/n₂)]

The pooled variance is:

s_p² = [(n₁ − 1)s₁² + (n₂ − 1)s₂²] / (n₁ + n₂ − 2)

For the pooled test:

df = n₁ + n₂ − 2

The pooled method should be used only when the equal-variance assumption is appropriate. If that assumption is doubtful, Welch’s method is usually preferable.

4) Paired t-Test

A paired t-test is used when the observations are matched, such as before-and-after measurements from the same subjects or matched pairs.

First, the calculator computes paired differences:

D_i = A_i − B_i

Then it performs a one-sample t-test on the differences:

t = (d̄ − μ₀) / (s_D / √n)

Where:

d̄ = mean of the paired differences
μ₀ = hypothesized mean difference, often 0
s_D = sample standard deviation of the differences
n = number of matched pairs

For the usual paired t-test:

df = n − 1

5) p-Value and Confidence Interval

After the t statistic and degrees of freedom are computed, the calculator uses the selected alternative hypothesis to determine the p-value.

The alternative hypothesis may be:

Two-tailed: the mean or mean difference is not equal to the null value
Right-tailed: the mean or mean difference is greater than the null value
Left-tailed: the mean or mean difference is less than the null value

The confidence interval uses the same test setup, standard error, degrees of freedom, and confidence level so that the interval matches the selected t-test method.

Formula Summary

Test Type	Main Formula	Degrees of Freedom	Best Used When
One-sample t-test	t = (x̄ − μ₀) / (s / √n)	n − 1	One sample mean is compared with a hypothesized value
Welch two-sample t-test	t = (x̄₁ − x̄₂) / √(s₁²/n₁ + s₂²/n₂)	Welch–Satterthwaite approximation	Two independent groups are compared without assuming equal variances
Pooled two-sample t-test	t = (x̄₁ − x̄₂) / [s_p√(1/n₁ + 1/n₂)]	n₁ + n₂ − 2	Two independent groups are compared and equal variances are reasonable
Paired t-test	t = (d̄ − μ₀) / (s_D / √n)	n − 1	Matched pairs or before-and-after observations are compared

Choosing the Correct T Test

The correct t-test depends on your research question and data structure.

Your Data Question	Use This Test	Example
Is one sample mean different from a known or hypothesized value?	One-sample t-test	Compare average score with 75
Are two unrelated group means different?	Welch two-sample t-test	Compare two separate classes or treatment groups
Are two unrelated group means different when equal variances are justified?	Pooled two-sample t-test	Textbook equal-variance comparison
Did the same subjects change from before to after?	Paired t-test	Compare pre-test and post-test scores from the same people
Are the values ordinal, heavily skewed, or dominated by outliers?	May need another method	Consider nonparametric or model-based analysis

Welch vs Pooled Two-Sample t-Test

Two-sample t-tests require careful handling of variance assumptions.

Feature	Welch t-Test	Pooled t-Test
Equal variance assumption	Not required	Required
Unequal sample sizes	Generally handles better	Can be sensitive when variances also differ
Degrees of freedom	Approximate Welch–Satterthwaite df	n₁ + n₂ − 2
Recommended use	Often safer default for independent groups	Use only when equal variances are reasonable

If you are unsure whether the two groups have equal variances, Welch’s test is usually the more cautious option.

Alternative Hypothesis and Tail Direction

The alternative hypothesis controls how the p-value is calculated and how the result should be interpreted.

Alternative Type	Meaning	Use When
Two-tailed	The mean or difference is not equal to the null value	Differences in either direction matter
Right-tailed	The mean or difference is greater than the null value	Only a positive increase supports the alternative
Left-tailed	The mean or difference is less than the null value	Only a negative decrease supports the alternative

Choose the tail direction before looking at the result. Changing from two-tailed to one-tailed after seeing the data can create misleading conclusions.

p-Value, Alpha, and Statistical Significance

The p-value measures how unusual the observed test statistic would be if the null hypothesis were true, in the direction specified by the alternative hypothesis.

The alpha level, often written as α, is the significance threshold you choose before testing.

Comparison	Common Conclusion	Important Reminder
p-value ≤ α	Statistically significant at the selected alpha level	Reject the null hypothesis in the test framework
p-value > α	Not statistically significant at the selected alpha level	Fail to reject the null hypothesis; this does not prove no effect

Statistical significance does not automatically mean the effect is large, important, causal, or practically useful.

Confidence Interval Interpretation

A confidence interval gives a range of plausible values for the true mean or mean difference under the selected model.

Confidence Interval Pattern	Possible Interpretation	Important Note
CI does not include the null value	Often aligns with statistical significance for a matching two-tailed test	Depends on the selected confidence level and test setup
CI includes the null value	The data may not show a statistically clear difference at that level	Does not prove the groups are identical
Narrow CI	More precise estimate	Precision can improve with larger sample size or lower variation
Wide CI	Less precise estimate	May reflect small samples or high variability

The confidence interval is often more informative than the p-value alone because it shows both direction and approximate size of the effect.

Worked Example: One-Sample t-Test

Suppose a sample has:

Sample mean: 12
Hypothesized mean: 10
Sample standard deviation: 4
Sample size: 16

Step 1: Find the standard error
SE = s / √n

Step 2: Substitute the values
SE = 4 / √16

Step 3: Calculate the standard error
SE = 4 / 4 = 1

Step 4: Compute the t statistic
t = (x̄ − μ₀) / SE

Step 5: Substitute the values
t = (12 − 10) / 1

Step 6: Calculate
t = 2.00

Step 7: Find degrees of freedom
df = n − 1 = 16 − 1 = 15

Result: The one-sample t statistic is 2.00 with 15 degrees of freedom. The p-value and confidence interval are then calculated from the t distribution with 15 df.

Worked Example: Two-Sample Welch t-Test

Suppose two independent groups have:

Group A: mean = 20, SD = 5, n = 25
Group B: mean = 16, SD = 6, n = 20

Step 1: Find the mean difference
x̄₁ − x̄₂ = 20 − 16 = 4

Step 2: Compute the standard error
SE = √(s₁²/n₁ + s₂²/n₂)

Step 3: Substitute the values
SE = √(5²/25 + 6²/20)

Step 4: Calculate
SE = √(25/25 + 36/20)
SE = √(1 + 1.8)
SE = √2.8 ≈ 1.673

Step 5: Compute the t statistic
t = 4 / 1.673 ≈ 2.39

Result: The Welch t statistic is approximately 2.39. The calculator then uses Welch degrees of freedom to compute the p-value and confidence interval.

Worked Example: Pooled Two-Sample t-Test

Suppose two independent groups have equal-variance support:

Group A: mean = 15, SD = 4, n = 18
Group B: mean = 12, SD = 5, n = 16

Step 1: Find the mean difference
15 − 12 = 3

Step 2: Calculate pooled variance
s_p² = [(18 − 1)4² + (16 − 1)5²] / (18 + 16 − 2)

Step 3: Simplify
s_p² = [(17 × 16) + (15 × 25)] / 32
s_p² = (272 + 375) / 32
s_p² = 647 / 32 ≈ 20.219

Step 4: Find pooled standard deviation
s_p = √20.219 ≈ 4.496

Step 5: Find standard error
SE = s_p × √(1/n₁ + 1/n₂)
SE = 4.496 × √(1/18 + 1/16)
SE ≈ 1.545

Step 6: Compute t
t = 3 / 1.545 ≈ 1.94

Step 7: Find degrees of freedom
df = 18 + 16 − 2 = 32

Result: The pooled two-sample t statistic is approximately 1.94 with 32 degrees of freedom.

Worked Example: Paired t-Test

Suppose five matched before-and-after differences are:

Differences: 3, 2, 4, 1, 5
Hypothesized mean difference: 0

Step 1: Find the mean difference
d̄ = (3 + 2 + 4 + 1 + 5) / 5 = 3

Step 2: Find the sample standard deviation of differences
s_D ≈ 1.581

Step 3: Find the standard error
SE = s_D / √n
SE = 1.581 / √5 ≈ 0.707

Step 4: Compute the paired t statistic
t = (d̄ − μ₀) / SE
t = (3 − 0) / 0.707 ≈ 4.24

Step 5: Find degrees of freedom
df = n − 1 = 5 − 1 = 4

Result: The paired t statistic is approximately 4.24 with 4 degrees of freedom.

How to Use This T Test Calculator

Select the correct test type: one-sample, two-sample, or paired.
Choose the alternative hypothesis: two-tailed, right-tailed, or left-tailed.
Set the confidence level if the calculator provides a confidence-interval option.
Paste or enter the sample data for Group A, and Group B when required.
For one-sample mode, enter the hypothesized mean μ₀.
For two-sample mode, use Welch’s test unless you have a good reason to assume equal variances.
For paired mode, make sure both lists are matched, in the same order, and have the same number of observations.
Click Calculate if the tool requires it.
Review the t statistic, df, p-value, confidence interval, mean difference, and conclusion.
Interpret the result together with assumptions, effect size, and context.

How to Interpret the Result

The calculator gives several outputs that work together.

Output	Meaning	How to Use It
t statistic	Standardized distance from the null value	Used with df to find the p-value
Degrees of freedom	Controls the shape of the t distribution	Needed for p-values and confidence intervals
p-value	How unusual the result is under the null hypothesis	Compared with alpha to assess statistical significance
Confidence interval	Range of plausible values for the effect	Shows direction, size, and precision
Mean difference	Raw observed effect before standardization	Helps assess practical meaning
Conclusion	Summary based on alpha and p-value	Useful shorthand, but not the whole interpretation

A statistically significant result means the data are unlikely under the null hypothesis according to the selected test and alpha level. It does not automatically mean the result is large, important, causal, unbiased, or practically meaningful.

Positive and Negative t Values

The sign of the t statistic depends on the order of subtraction.

t Value	General Meaning	Example
Positive t	The observed mean or difference is above the null value	Group A mean is higher than Group B if using A − B
Negative t	The observed mean or difference is below the null value	Group A mean is lower than Group B if using A − B
t near 0	The observed difference is small relative to standard error	The data are close to the null value in standardized terms

For two-tailed tests, the absolute size of t matters. For one-tailed tests, both the sign and the selected direction matter.

Common Assumptions by Test Type

Every t-test depends on assumptions. These assumptions matter more when samples are small or data are unusual.

Test Type	Key Assumptions	Common Warning Signs
One-sample t-test	Independent observations; meaningful mean; data suitable for t-based inference	Strong skew, severe outliers, non-independent observations
Welch two-sample t-test	Independent groups; independent observations within groups; means are meaningful	Matched data entered as independent, severe outliers, poor sampling design
Pooled two-sample t-test	Independent groups plus reasonable equal-variance assumption	Very different standard deviations or unequal sample sizes
Paired t-test	Matched pairs; differences are analyzed; pairs are independent of other pairs	Unequal list lengths, unmatched observations, outliers in differences

When assumptions are questionable, consider visual checks, robust methods, nonparametric tests, transformation, or statistical guidance.

T Test vs t Statistic Calculator

A t statistic calculator usually computes only the t value and related quantities. A t test calculator goes further by adding p-values, confidence intervals, and a statistical conclusion.

Tool	Typical Output	Best Use
t Statistic Calculator	t statistic, df, standard error, mean difference	Checking the core test statistic
T Test Calculator	t statistic, df, p-value, confidence interval, conclusion	Completing a fuller hypothesis-test workflow

Use this T Test Calculator when you need both the calculation and the statistical interpretation outputs.

When This Calculator Is Useful

This calculator is useful when you need a quick t-test result from sample data or summary statistics.

Run a one-sample t-test against a known reference value
Compare two independent groups with Welch or pooled assumptions
Analyze before-and-after or matched-pair data
Check statistics homework or classroom examples
Compare spreadsheet work with calculator output
Get p-values and confidence intervals quickly
Review whether a result is statistically significant at a chosen alpha level
Prepare results before writing a statistical interpretation

When You May Need More Than This Calculator

A t-test calculator may not be enough when the data or decision is complex.

Use full statistical software or qualified statistical guidance when you need:

formal assumption checks
data visualization and outlier review
effect sizes such as Cohen’s d
power analysis or sample-size planning
nonparametric alternatives
regression, ANOVA, mixed models, or repeated-measures models
multiple-comparison correction
survey-weighted or clustered-data analysis
research publication reporting
medical, financial, legal, policy, or high-stakes decisions

Common Mistakes to Avoid

Choosing the wrong test type: paired data should not be analyzed as independent samples.
Using pooled t automatically: pooled t requires a reasonable equal-variance assumption.
Changing the tail direction after seeing the result: this can make conclusions misleading.
Confusing p-value with effect size: a small p-value does not necessarily mean a large effect.
Ignoring confidence intervals: CIs show the possible size and direction of the effect.
Ignoring sample size: small samples can make results unstable and assumptions more important.
Ignoring outliers: outliers can strongly affect means, standard deviations, and t-test results.
Assuming statistical significance proves causation: causation depends on study design, not just a p-value.
Reporting only “significant” or “not significant”: include the estimate, CI, p-value, and context when possible.

Important Assumptions and Limitations

This calculator performs t-test calculations, so the result depends on choosing the correct test type for your data.
One-sample and paired t-tests usually assume the sampled values or paired differences are reasonably suitable for t-based inference, especially for smaller samples.
Independent two-sample tests require independent groups, not matched observations.
Paired mode requires Group A and Group B to contain matched observations in the same order and with the same length.
Welch’s t-test is generally safer when group variances or sample sizes may differ.
The pooled two-sample t-test should only be used when the equal-variance assumption is appropriate.
A statistically significant p-value does not automatically mean the effect is large, important, unbiased, or practically meaningful.
The calculator does not replace full statistical review for complex or high-stakes analysis.

Practical Uses of a T Test Calculator

Run a one-sample t-test from sample data
Compare two independent group means
Use Welch’s t-test when variances may differ
Use pooled t-tests for equal-variance textbook or planned comparisons
Analyze matched pairs or before-and-after values
Calculate p-values and confidence intervals
Check hand calculations or spreadsheet formulas
Support classroom, homework, and early research-planning tasks

References

Related Calculators

Frequently Asked Questions

What does this T Test Calculator calculate?

It calculates the t statistic, degrees of freedom, p-value, confidence interval, mean difference, group statistics, and a test conclusion for common t-test setups.

What is a t-test?

A t-test is a statistical method used to compare a sample mean or mean difference with a null hypothesis while accounting for sample variability.

What is the one-sample t-test used for?

A one-sample t-test compares one sample mean with a hypothesized value, such as comparing an average score with a target score.

What is a two-sample t-test used for?

A two-sample t-test compares the means of two independent groups, such as two separate classes, treatments, or populations.

What is Welch’s t-test?

Welch’s t-test is a two-sample t-test that does not assume equal variances between the two groups. It is often safer when variances or sample sizes differ.

What is a pooled t-test?

A pooled t-test is a two-sample t-test that assumes equal population variances and uses a pooled variance estimate.

What is a paired t-test?

A paired t-test compares matched observations by analyzing the differences between paired values, such as before-and-after measurements from the same people.

What is a p-value?

A p-value measures how unusual the observed result would be under the null hypothesis, in the direction of the selected alternative hypothesis.

What does statistically significant mean?

A result is statistically significant when the p-value is less than or equal to the selected alpha level. This means the result is unlikely under the null hypothesis according to that test setup.

Does statistical significance mean practical importance?

No. Statistical significance does not automatically mean the effect is large, useful, causal, or important in real-world terms.

What is a confidence interval?

A confidence interval is a range of plausible values for the true mean or mean difference under the selected statistical model.

Which t-test should I choose?

Use one-sample for one mean versus a hypothesized value, Welch two-sample for independent groups when variances may differ, pooled two-sample only when equal variances are reasonable, and paired t-test for matched data.

Can I use this calculator for paired before-and-after data?

Yes, use paired mode and enter matched values in the same order for Group A and Group B.

Can I use this calculator for non-normal data?

It depends on sample size, outliers, and the type of non-normality. For small samples or severe outliers, consider diagnostics, nonparametric methods, or statistical guidance.

Can this calculator replace statistical software?

No. It is useful for education and quick checks, but complex research or high-stakes analysis should use full statistical software and proper review.

Disclaimer: This T Test Calculator provides educational statistical estimates for one-sample, two-sample, Welch, pooled, and paired t-test setups. Results depend on the data entered, sample sizes, sample standard deviations, selected test type, alternative hypothesis, confidence level, variance assumption, pairing structure, and hypothesized mean or mean difference. A t-test result should be interpreted only when the selected test matches the study design and the assumptions are reasonably appropriate. One-sample and paired t-tests rely on the sampled values or paired differences being suitable for t-based inference, especially with small samples or outliers. Independent two-sample tests require independent groups, while paired tests require matched observations in the same order. Welch’s t-test is often safer when variances or sample sizes differ, while pooled t-tests should only be used when the equal-variance assumption is reasonable. A statistically significant p-value does not prove practical importance, causation, data quality, or real-world usefulness. Use this calculator for learning, homework, and quick statistical checks, and use full statistical software or qualified statistical guidance for research, publication, medical, financial, legal, policy, or high-stakes decisions.

More of our calculators...

t-statistic Calculator

Use our t-Statistic Calculator to compute the t value, degrees of freedom, and standard error for one-sample, two-sample, Welch, pooled, or paired t-tests.

Outlier Calculator (Tukey Fences)

Use our Outlier Calculator to find low and high outliers with Tukey fences and the IQR method. Get Q1, Q3, IQR, fences, counts, and sorted data fast.

Power Analysis Calculator

Use our Power Analysis Calculator to estimate sample size per group, statistical power, and minimum detectable effect for a two-sample mean test with alpha and standard deviation.

Sample Size Calculator

Use our Sample Size Calculator to estimate sample size for a mean or proportion from confidence level, margin of error, and optional finite population correction.

Margin of Error Calculator

Use our Margin of Error Calculator to find margin of error for a mean or proportion from sample size, confidence level, and z value.

Percentile Calculator

Use our Percentile Calculator to find percentile values from a data set with nearest rank or linear interpolation. Supports raw numbers and value:count input.

Variance Calculator

Use our Variance Calculator to find sample or population variance from a data set. Get mean, variance, standard deviation, SSE, range, and sorted values fast.

IQR (Interquartile Range) Calculator

Use our IQR Calculator(Interquartile Range) to find Q1, Q3, median, and interquartile range from a data set. Supports raw numbers, value:count input, and two quartile methods.

Standard Deviation Calculator

Use our Standard Deviation Calculator to find sample or population standard deviation from a data set. Get variance, mean, range, and sorted values fast.