3.2 A
-Optimality Introduction
The second measure of optimality we shall consider is \(A\)-optimality. This is where we seek to minimise the average variance of our estimators.
If you recall from our section on regression, we claimed that the variance of an estimator is a measure of its precision. We also established a formula for the variance of the regression coefficients, \(\boldsymbol{\hat{\beta}}\) using the design matrix, \(\mathbf{F}\):
\[\text{Var}(\boldsymbol{\hat{\beta}}) = \sigma^2 \left(\mathbf{F}^T \mathbf{F}\right)^{-1}\]where the design matrix is given by
\[\mathbf{F} = \begin{pmatrix} 1 & x_1 \\ 1 & x_2 \\ 1 & x_3 \\ \vdots & \vdots \\ 1 & x_n \end{pmatrix}\]Therefore, our Fisher information matrix becomes:
\[\begin{align} \mathbf{I} &= \mathbf{F}^T \mathbf{F} \\ \\ & = \begin{pmatrix} 1 & 1 & 1 & \ldots & 1\\ x_1 & x_2 & x_3 & \ldots & x_n \end{pmatrix} \begin{pmatrix} 1 & x_1 \\ 1 & x_2 \\ 1 & x_3 \\ \vdots & \vdots \\ 1 & x_n \end{pmatrix} \\ \\ &= \begin{pmatrix} n & \sum_{i=1}^{n} x_i \\ \sum_{i=1}^{n} x_i & \sum_{i=1}^{n} x_i^2 \end{pmatrix} \end{align}\]We should also recall that the \(j\)-th diagonal element of the inverse Fisher information matrix is proportional to the variance of the \(j\)-th estimator, \(\hat{\beta_j}\). Therefore, in order to find the average variance of our estimators we should compute the trace of the inverse information matrix.
To show this let’s compute the inverse Fisher information matrix:
\[\begin{align} \mathbf{I}^{-1} &= \frac{1}{n \sum_{i=1}^{n} x_i^2 - \left( \sum_{i=1}^{n} x_i \right)^2} \begin{pmatrix} \sum_{i=1}^{n} x_i^2 & -\sum_{i=1}^{n} x_i \\ -\sum_{i=1}^{n} x_i & n \end{pmatrix} \end{align}\]Now, compute its trace (recall that the trace of a matrix is just the sum of its diagonal elements):
\[\text{Trace}(\mathbf{I}^{-1}) = \frac{\sum_{i=1}^{n} x_i^2 + n}{n \sum_{i=1}^{n} x_i^2 - \left( \sum_{i=1}^{n} x_i \right)^2}\]This is the definition of \(A\)-optimality; the precision of the estimators very much depends on your choice of design points, \(\left\{x_i\right\}\), and the number of design points in your experiment, \(n\).