Sums of Squares

Page Contents

1 One-Way Analysis of Variances
2 Table of Variances
3 Deriving SSD Directly
4 Deriving SSD Recursively

1 One-Way Analysis of Variances

We reformulate the theory of elementary One-Way Analysis of Variance in terms of factors:

We have one factor, F say, with k levels, and which has the corresonding linear space L_F as we have seen. In the theory of Linear Normal Models it is assumed that the observations y₁, ⋯ , y_n are from independent normally distributed random variables Y₁, ⋯ , Y_n which have the same variance but different means: ......Y_i ...∼ ...N ( μ_i , σ²) ......, and the model for the one-way analysis of variance model is that the means vector , ...μ = ( μ₁, ⋯ , μ_n ) ...is in the space L_F, ...or equivalently that ...μ = X_F α , ...for some (unspecified) α ∊ IR^k.

Once we have established this model we can test hypotheses on it. For example the hypothesis of a uniform mean, i.e. that the factor does not have any effect on the observations. This is equivalent to saying μ ∊ L_O , ...the space corresponding to the null factor .

So the design for this example is ...Δ = { I, F, O }. ...To test for a uniform mean we calculated the quantity

	ESS ⁄ k – 1

	RSS ⁄ n – k

, where

ESS = || p_F y – p_O y ||² , ...RSS = || y – p_F y ||²

2 Table of Variances

Until now we've looked at Factors simply in terms of their abstract qualities, in terms of mappings between finite sets and the associated linear spaces and projections, concluding in the orthogonal partition of IRⁿ determined by factors in an orthogonal design. So far we haven't considered the observations which are categorized by the factors. This we will now do, by considering some important statistics on a set of observations. For the moment we make no assumptions as to the nature of the variables (distribution, mean etc.) or make any hypotheses about them.

Let y = (y₁, ⋯ ,y_n) be a set of observations. Let Δ be an orthogonal design on {1, ⋯ ,n}. For any factor F ∊ Δ there is a linear space V_F with an orthogonal projection Q_F (as discussed in Factorial Design )

We define the following quantities:

SSD_F ...= ...|| Q_F y || ²
SS_F ...= ...|| P_F y || ²

SSD_F is known as the sum of squares of the deviations and SS_F is the sum of squares of the factor F.
The quantity ...d_F = dim V_F ...is the degrees of freedom corresponding to the SSD.

d_F ...= ...dim V_F
|F| ...= ...dim L_F

If Δ is an orthogonal design on the set of observations (i.e. the set of all factors under consideration) then we can draw up a table containing the values of the SSD and their corresponding degrees of freedom for all the factors in the design. We will call this the Table of Variances .

3 Deriving SSD Directly

For the time being we concentrate on deriving the table, without reference to its use in estimation and hypothesis testing.

From the theory of 2-sided ANOVA we know that:

SS_F ...= ...|| P_F y || ² ...= ...

...

∑

	S_f²

	n_f

f ∊ F

where

S_f ......= ...

...
∑		y_i
F(i) = f

, ............n_f ......= ...#{ i | F(i) = f }

We know that

Q_F ...= ...

...
∑		α_F^G P_G
G ≤ F

SSD_F ...= ...|| Q_F y || ² ...= ......y^TQ_F y

............= ...

...
∑		α_F^G ...y^TP_G y
G ≤ F

...= ...

...
∑		α_F^G \|\| P_G y \|\| ²
G ≤ F

............= ...

...
∑		α_F^G SS_G
G ≤ F

similarly

d_F ...= ...tr(Q_F) ...= ...tr

...
∑		α_F^G P_G
G ≤ F

...............= ...

...
∑		α_F^G tr(P_G)
G ≤ F

...= ...

...
∑		α_F^G \|G\|
G ≤ F

SSD_F ...= ...

...
∑		α_F^G SS_G
G ≤ F

d_F ...= ...

...
∑		α_F^G \|G\|
G ≤ F

4 Deriving SSD Recursively

The above formulae are explicit but not very useful for computation, as a priori we do not know the values of the α_F^G. To find a more useful algorithm we use the formula for P_F in terms of Q_G previously derived, we have:

SS_F ...= ...|| P_F y || ² ...= ...|| ∑_{G ≤ F} Q_G y || ²

...............= ...∑_{G ≤ F} || Q_G y || ² ...= ...∑_{G ≤ F} SSD_G

SS_F ...= ...

...
∑		SSD_G
G ≤ F

SSD_F ...= ...SS_F –

...
∑		SSD_G
G < F

Similarly

|F| ...= ...dim L_F ...= ...∑_{G ≤ F} dim V_G ...= ...∑_{G ≤ F} d_G

|F| ...= ...

...
∑		d_G
G ≤ F

d_F ...= ...|F| –

...
∑		d_G
G < F

With the above formulae it is possible to work recursively from the coarsest factor, C say, (which will be the null factor in a balanced design), since SSD _C = SS _C, ...and ...d _C = |C|.