SciPy Statistics Module
Quick Navigation
Statistics with SciPy
The scipy.stats module provides a wide range of statistical functions including descriptive statistics, probability distributions, statistical tests, and correlation analysis.
Key Features:
- Descriptive Statistics: Mean, median, mode, variance, standard deviation
- Probability Distributions: Normal, binomial, Poisson, exponential, etc.
- Statistical Tests: T-tests, ANOVA, chi-square tests, etc.
- Correlation Analysis: Pearson, Spearman, Kendall correlations
- Regression Analysis: Linear regression, curve fitting
Descriptive Statistics
Descriptive statistics summarize and describe the main features of a dataset.
Probability Distributions
SciPy provides numerous probability distributions with methods for PDF, CDF, random variate generation, and statistical moments.
Hypothesis Testing
Statistical hypothesis testing helps make decisions about populations based on sample data.
- P-value < 0.05: Statistically significant (reject null hypothesis)
- P-value >= 0.05: Not statistically significant
- T-statistic: Measures the size of difference relative to variation
Correlation and Regression
Measure relationships between variables and build predictive models.
Practical Example: Data Analysis
Complete analysis of a sample dataset: