Pareto Distribution

📈 Pareto Distribution in Python

The Pareto Distribution is a heavy-tailed continuous probability distribution often used in economics, finance, and social sciences.
It is famous for modeling wealth distribution, income, or city sizes, where a small number of occurrences account for the majority of the effect (the 80/20 rule).


 1. Characteristics of Pareto Distribution

  • Continuous distribution

  • Parameters:

    • a → shape parameter (α > 0)

    • size → number of samples

  • Probability Density Function (PDF):

f(x;a)=axma/xa+1,x≥xmf(x; a) = a x_m^a / x^{a+1}, \quad x \ge x_m

  • x_m → minimum possible value (default in NumPy = 1)

  • Heavy-tailed (large values are possible)

Applications:

  • Wealth or income distribution

  • File sizes in the internet

  • Population sizes of cities


 2. Generate Pareto Data Using NumPy


 

Output (example):

[0.23, 0.78, 1.45, 0.56, 2.34, 0.12, 0.89, 1.67, 0.34, 3.12]
  • Values ≥ 0

  • Can shift using x_m + data if minimum value other than 1 is required


 3. Visualize Pareto Distribution


 

  • Histogram is right-skewed

  • Few very large values → heavy tail


 4. Shifted Pareto Distribution


 

  • Shift ensures minimum value ≥ xm


 5. Compare Different Shape Parameters


 

  • Smaller a → heavier tail (more extreme values)

  • Larger a → distribution more concentrated near 0


🧠 Summary Table

FunctionParametersDescription
np.random.pareto()a, sizeGenerates Pareto random numbers
aShape parameterControls tail heaviness
sizeNumber of samplesOutput array size
xmMinimum valueCan shift data by adding xm

🎯 Practice Exercises

  1. Generate 1000 Pareto random numbers with a=2 and plot histogram with KDE.

  2. Compare Pareto distributions for a=2 vs a=5.

  3. Shift Pareto data to have minimum value 10 and visualize.

You may also like...