Oracle
Search Site  
Find
 
Crystal Ball Home
Oracle Home
Information On
   Web & Live Events
   Six Sigma & DFSS
   Education Alliance
   Training Classes
   Conferences & Forums
   Available Languages
 
Quick Links
   Shop
   Download
   Newsletters
   Contact Us
 
 
Worldwide Offices
   United States
   United Kingdom
   Germany
 
 
 
 
TECHNOTE

Which distributions should I use?

Selecting a distribution for an assumption is one of the most challenging steps in creating a Crystal Ball model. Crystal Ball has 17 possible discrete and continuous distributions you can use to describe an assumption, including a custom distribution, which can be a combination of continuous and discrete ranges.

  • A continuous distribution assumes all values in the range are possible, so any range contains an infinite number of possible values. These distributions are smooth, solid curves.
  • A discrete probability distribution describes distinct, finite, commonly integer values. These distributions look like different-height columns set next to each other.

The first step in selecting a probability distribution is to use any available data. In the absence of data, use your understanding of the physics or conditions of the variable to help select a distribution. Finally, apply reasonable limits to a simple distribution.

NOTE: For additional distributions such as Erlang and Rayleigh (variations on those listed below), see our CB Tip (#16) Alternate Distributions.

 

Distribution

Conditions

Applications

Examples

Normal  

bullet The mean value is most likely
bullet It is symmetrical about the mean
bullet It is more likely to be close to the mean than far away

Natural phenomena.

People's heights, reproduction rates, inflation

Lognormal  

bullet Upper limit is unlimited but values cannot fall below zero
bullet Distribution is positively skewed, with most values near lower limit
bullet Natural logarithm of the distribution is a normal distribution

Situations where values are positively skewed, but cannot be negative.

Real estate prices, stock prices, pay scales, oil reservoir size

Triangular  

bullet The minimum is fixed
bullet The maximum is fixed
bullet It has a most likely value in this range, which forms a triangle with the minimum and maximum

When you know the minimum, maximum, and most likely values, popular for when you have limited data.

Sales estimates, number of cars sold in a week, inventory numbers, marketing costs

niform  

bullet Minimum is fixed
bullet Maximum is fixed
bullet All values in range are equally likely to occur

When you know the range and all possible values are equally likely.

A real estate appraisal, leak on a pipeline

Custom  

bullet Very flexible distribution, used to represent a situation you cannot describe with other distribution types
bullet Can be either continuous or discrete or a combination of both
bullet Used to input an entire set of data points from a range of cells

Binomial 

bullet For each trial, only 2 outcomes are possible; usually, success or failure
bullet The trials are independent
bullet The probability is the same from trial to trial

Describes the number of times an event occurs in a fixed number of trials, also used for Boolean logic (true/false or on/off).

Number of heads in 10 flips of a coin, likelihood of success or failure

Poisson  

bullet Number of possible occurrences is not limited
bullet Occurrences are independent
bullet Average number of occurrences is the same from unit to unit

Describes the number of times an event occurs in a given interval (usually time).

Number of telephone calls per minute, number of defects per 100 square yards of material

Exponential  

bullet The distribution describes the time between occurrences
bullet Distribution is not affected by previous events

Describes events that recur randomly.

Time between incoming phone calls, time between customer arrivals

Geometric  

bullet Number of trials is not fixed
bullet Trials continue until the first success
bullet Probability of success is the same from trial to trial

Describes the number of trials until the first successful occurrence.

Number of times you spin a roulette wheel before you win, how many wells to drill before you hit oil

Hypergeometric  

bullet Total number of items (population) is fixed
bullet Sample size (number of trials) is a portion of the population
bullet Probability of success changes after each trial

Describes the number of times an event occurs in a fixed number of trials, but trials are dependent on previous results.

Chance of a picked part being defective when selected from a box (without replacing picked parts to the box for the next trial)

Weibull 

This flexible distribution can assume the properties of other distributions.

When shape parameters equal 1, it is identical to Exponential. When equal to 2, it is identical to the Rayleigh.

Fatigue and failure tests or other physical quantities.

Failure time in a reliability study, breaking strength of a material in a control test

Beta  

bullet Range is between 0 and a positive value
bullet Shape can be specified with two positive values, alpha and beta

Represents variability over a fixed range, describes empirical data.

Representing the reliability of a company's devices

Gamma  

bullet The possible occurrences in any unit of measurement is not limited
bullet The occurrences are independent
bullet The average number of occurrences is constant from unit to unit

Applied for physical quantities, such as the time between events when the event process is not completely random.

Demand for expected number of units sold during lead time, meteorological processes (pollutant concentrations)

Logistic  

Conditions and parameters are complex.
See: Fishman, G. Springer Series in Operations Reaserch. NY: Springer-Verlag, 1996.

Describes growth.

Growth of a population as a function of time, some chemical reactions

Pareto 

Conditions and parameters are complex. See: Fishman, G. Springer Series in Operations Reaserch. NY: Springer-Verlag, 1996.

Analyzes other distributions associated with empirical phenomena.

Investigating distributions associated with city population sizes, size of companies, stock price fluctuations

Extreme Value  

Conditions and parameters are complex. See: Castillo, Enrique. Extreme Value Theory in Engineering. London: Academic Press, 1988.

Describes largest value of a response over time or the breaking strength of materials.

Largest flood flows, rainfall, and earthquakes, aircraft loads and tolerances

Negative Binomial  

bullet Number of trials is not fixed
bullet Trials continue to the th success (trials never less than r )
bullet Probability of success is the same from trial to trial

Models the distribution of the number of trials or failures until the th successful occurrence.

Number of sales calls before you close 10 orders

 
Home | Products | Services | Industries | Applications | Support | About Us | How to Buy
Privacy Policy | Trademarks | Copyright © 2007, Oracle and/or its affiliates. All rights reserved.