Most techniques in the Define phase are not related to substantial use of statistical methods. The objective of the DEFINE phase is to bring together all parties concerned, grasp their knowledge and insights to the process involved, set a common objective and DEFINE how each party contributes(or the role each party takes) to the solving of the problem. In order not to get lost in subsequent meetings and ongoing discussion, this common objective, the contribution of each party, milestones and responsibilities need to be written down in what is known to be a Project Charter. Of course, problems with easy-to-identify causes are not subject of these kind of projects.

However, a classical visualization technique that is used in this phase and available in the qualityTools package is the pareto chart. Pareto charts are special forms of bar charts that help to separate the vital few from the trivial many causes for a given problem (e.g.the most frequent cause for a defective product). This way pareto charts visualize how much a cause contributes to a specific failure i.~e. the non-fulfillment of a requirement (nonconformity).

Suppose a company is investigating non conforming units (products). 120 units were investigated and 6 different types of defects (qualitative data) were found. The defects are named A to F. The defects data can be found in defects.

#create artificial defect data set
defects = c(rep("E", 62), rep("B", 15), rep("F", 3), rep("A", 10), rep("C",20), rep("D", 10))

paretoChart(defects)

##                                                     
## Frequency          62    20    15    10    10      3
## Cum. Frequency     62    82    97   107   117    120
## Percentage      51.7% 16.7% 12.5%  8.3%  8.3%   2.5%
## Cum. Percentage 51.7% 68.3% 80.8% 89.2% 97.5% 100.0%
##                                                                       
## Frequency       62.00000 20.00000 15.00000  10.000000  10.000000   3.0
## Cum. Frequency  62.00000 82.00000 97.00000 107.000000 117.000000 120.0
## Percentage      51.66667 16.66667 12.50000   8.333333   8.333333   2.5
## Cum. Percentage 51.66667 68.33333 80.83333  89.166667  97.500000 100.0

This pareto chart might convey the message that in order to solve 68 percent of the problem 33 percent of the causes vital few (the vital few and the trivial many - 20 percent of the defects cause 80 percent of the problems) need to be subject of an investigation. Besides this use case, pareto charts are also used for visualizing the effect sizes of different factors for designed experiments ( see paretoPlot in [IMPROVE] (#/DEFINE) ).

The pareto chart method is available also as web application under webapps.r-qualityTools.org