<?xml-stylesheet type="text/xsl" href="../styles/pmathml.xsl"?>
<!-- saved from url=(0014)about:internet -->
<html xmlns="http://www.w3.org/1999/xhtml" xmlns:dsi="http://www.w3.org/1999/xlink" xmlns:m="http://www.w3.org/1998/Math/MathML" xml:space="preserve"><head><meta http-equiv="Content-Type" content="text/html; charset=US-ASCII"/><title>G07 Chapter Introduction : NAG Library, Mark 22</title><link rel="stylesheet" href="../styles/libdoc.css" type="text/css"/><script type="text/javascript">
   function showLevel(_levelId){
    var thisLevel = document.getElementById(_levelId);
    var thisplus = document.getElementById( _levelId.concat('plus'));
    var thisminus = document.getElementById( _levelId.concat('minus'));
    if(thisLevel.style.display != "block"){
     thisLevel.style.display = "block";
     thisplus.style.display = "none";
     thisminus.style.display = "inline";
     }
    else{
     thisLevel.style.display = "none";
     thisminus.style.display = "none";
     thisplus.style.display = "inline";
     }
    }
  </script></head><body><hr/><div><a class="chap" href="g07conts.xml">G07 Chapter Contents</a></div><div><a class="chapint" href="../../pdf/G07/g07intro.pdf">G07 Chapter Introduction (PDF version)</a></div>
<div><a class="htmltoc" href="../FRONTMATTER/manconts.xml">NAG Library Manual</a></div><hr/><h1 class="libdoc">NAG Library Chapter Introduction<br/><br/>G07 &#8211; Univariate Estimation</h1><div class="htmltoc">
<h2 class="htmltoc"><span class="htmltochead" onclick="showLevel('htmltoc');"><span class="htmltocplus" id="htmltocplus">+</span><span class="htmltocminus" id="htmltocminus">&#8722;</span></span>&#160;Contents</h2>
<div class="htmltocitem" id="htmltoc">
<div class="htmltoc">
<span class="htmltocplus">&#160;&#160;&#160;</span>
<a class="htmltoc" href="#scope">1&#160;&#160;<b>Scope of the Chapter</b></a>
</div><div class="htmltoc">
<span class="htmltoc" onclick="showLevel('tocbackground');"><span class="htmltocplus" id="tocbackgroundplus">+</span><span class="htmltocminus" id="tocbackgroundminus">&#8722;</span></span>
<a class="htmltoc" href="#background">2&#160;&#160;<b>Background to the Problems</b></a>
<div class="htmltocitem" id="tocbackground">
<div class="htmltoc">
<span class="htmltocplus">&#160;&#160;&#160;</span>
<a class="htmltoc" href="#intbackground1">2.1&#160;&#160;<b>Maximum Likelihood Estimation</b></a>
</div><div class="htmltoc">
<span class="htmltocplus">&#160;&#160;&#160;</span>
<a class="htmltoc" href="#intbackground2">2.2&#160;&#160;<b>Confidence Intervals</b></a>
</div><div class="htmltoc">
<span class="htmltocplus">&#160;&#160;&#160;</span>
<a class="htmltoc" href="#intbackground3">2.3&#160;&#160;<b>Robust Estimation</b></a>
</div><div class="htmltoc">
<span class="htmltocplus">&#160;&#160;&#160;</span>
<a class="htmltoc" href="#intbackground4">2.4&#160;&#160;<b>Robust Confidence Intervals</b></a>
</div>
</div>
</div><div class="htmltoc">
<span class="htmltocplus">&#160;&#160;&#160;</span>
<a class="htmltoc" href="#available">3&#160;&#160;<b>Recommendations on Choice and Use of Available Routines</b></a>
</div><div class="htmltoc">
<span class="htmltocplus">&#160;&#160;&#160;</span>
<a class="htmltoc" href="#index">4&#160;&#160;<b>Index</b></a>
</div><div class="htmltoc">
<span class="htmltocplus">&#160;&#160;&#160;</span>
<a class="htmltoc" href="#withdrawn">5&#160;&#160;<b>Routines Withdrawn or Scheduled for Withdrawal</b></a>
</div><div class="htmltoc">
<span class="htmltocplus">&#160;&#160;&#160;</span>
<a class="htmltoc" href="#references">6&#160;&#160;<b>References</b></a>
</div>
</div>
</div><h2 class="standard"><a class="sec" name="scope" id="scope"/>1&#160;&#160;Scope of the Chapter</h2>
<div class="paramtext">This chapter deals with the estimation of unknown parameters of a univariate distribution.  It includes both point and interval estimation using maximum likelihood and robust methods.</div><h2 class="standard"><a class="sec" name="background" id="background"/>2&#160;&#160;Background to the Problems</h2>
<div class="paramtext">Statistical inference is concerned with the making of inferences about a <b>population</b> using the observed part of the population called a <b>sample</b>.  The population can usually be described using a probability model which will be written in terms of some unknown <b>parameters</b>.  For example, the hours of relief given by a drug may be assumed to follow a Normal distribution with mean <m:math><m:mi>&#956;</m:mi></m:math>&#160;and variance <m:math><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup></m:math>; it is then required to make inferences about the parameters, <m:math><m:mi>&#956;</m:mi></m:math>&#160;and <m:math><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup></m:math>, on the basis of an observed sample of relief times.</div><div class="paramtext">There are two main aspects of statistical inference: the  <b>estimation</b> of the parameters and the <b>testing of hypotheses</b> about the parameters.  In the example above, the values of the parameter <m:math><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup></m:math>&#160;may be estimated and the hypothesis that <m:math><m:mi>&#956;</m:mi><m:mo>&#8805;</m:mo><m:mn>3</m:mn></m:math>&#160;tested.  This chapter is mainly concerned with estimation but the test of a hypothesis about a parameter is often closely linked to its estimation.  Tests of hypotheses which are not linked closely to estimation are given in the chapter on nonparametric statistics (<a class="chap" href="../G08/g08conts.xml">Chapter G08</a>).</div><div class="paramtext">There are two types of estimation to be considered in this chapter:  <b>point estimation</b> and <b>interval estimation</b>.  Point estimation is when a single value is obtained as the best estimate of the parameter.  However, as this estimate will be based on only one of a large number of possible samples, it can be seen that if a different sample were taken, a different estimate would be obtained.  The distribution of the estimate across all the possible samples is known as the <b>sampling distribution</b>.  The sampling distribution contains information on the performance of the estimator, and enables estimators to be compared.  For example, a good estimator would have a sampling distribution with mean equal to the true value of the parameter; that is, it should be an <b>unbiased</b> estimator; also the variance of the sampling distribution should be as small as possible.  When considering a parameter estimate it is important to consider its variability as measured by its variance, or more often the square root of the variance, the  <b>standard error</b>.</div><div class="paramtext">The sampling distribution can be used to find interval estimates or confidence intervals for the parameter.  A <b>confidence interval</b>  is an interval calculated from the sample so that its distribution, as given by the sampling distribution, is such that it contains the true value of the parameter with a certain probability.</div><div class="paramtext">Estimates will be functions of the observed sample and these functions are known as <b>estimators</b>.  It is usually more convenient for the estimator to be based on statistics from the sample rather than all the individuals observations.  If these statistics contain all the relevant information then they are known as <b>sufficient statistics</b>.  There are several ways of obtaining the estimators; these include least-squares, the method of moments, and <b>maximum likelihood</b>.  Least-squares estimation requires no knowledge of the distributional form of the error apart from its mean and variance matrix, whereas the method of maximum likelihood is mainly applicable to situations in which the true distribution is known apart from the values of a finite number of unknown parameters.  Note that under the assumption of Normality, the least-squares estimation is equivalent to the maximum likelihood estimation.  Least squares is often used in regression analysis as described in  <a class="chap" href="../G02/g02conts.xml">Chapter G02</a>, and maximum likelihood is described below.</div><div class="paramtext">Estimators derived from least-squares or maximum likelihood will often be greatly affected by the presence of extreme or unusual observations.  Estimators that are designed to be less affected are known as <b>robust estimators</b>.</div><h3 class="standard"><a class="sec" name="intbackground1" id="intbackground1"/>2.1&#160;&#160;Maximum Likelihood Estimation</h3>
<div class="paramtext">Let <m:math><m:msub><m:mi>X</m:mi><m:mi>i</m:mi></m:msub></m:math>&#160;be a univariate random variable with probability density function

<div class="formula"><table class="formula"><tr><td class="formula"><m:math display="block">
<m:msub><m:mi>f</m:mi><m:msub><m:mi>X</m:mi><m:mi>i</m:mi></m:msub></m:msub><m:mfenced separators=""><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>;</m:mo><m:mi>&#952;</m:mi></m:mfenced><m:mtext>,</m:mtext>
</m:math></td><td class="formula2"/></tr></table></div>

where <m:math><m:mi>&#952;</m:mi></m:math>&#160;is a vector of length <m:math><m:mi>p</m:mi></m:math>&#160;consisting of the unknown parameters.  For example, a Normal distribution with mean <m:math><m:msub><m:mi>&#952;</m:mi><m:mn>1</m:mn></m:msub></m:math>&#160;and standard deviation  <m:math><m:msub><m:mi>&#952;</m:mi><m:mn>2</m:mn></m:msub></m:math>&#160;has probability density function

<div class="formula"><table class="formula"><tr><td class="formula"><m:math display="block">
<m:mfrac><m:mn>1</m:mn><m:mrow><m:msqrt><m:mn>2</m:mn><m:mi>&#960;</m:mi></m:msqrt><m:msub><m:mi>&#952;</m:mi><m:mn>2</m:mn></m:msub></m:mrow>
 </m:mfrac><m:mrow><m:mi>exp</m:mi><m:mfenced separators=""><m:mo>-</m:mo><m:mfrac><m:mn>1</m:mn><m:mn>2</m:mn></m:mfrac>
<m:msup>
<m:mfenced separators=""><m:mfrac><m:mrow><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>-</m:mo><m:msub><m:mi>&#952;</m:mi><m:mn>1</m:mn></m:msub></m:mrow><m:msub><m:mi>&#952;</m:mi><m:mn>2</m:mn></m:msub></m:mfrac></m:mfenced>
<m:mn>2</m:mn></m:msup></m:mfenced></m:mrow>
<m:mtext>.</m:mtext>
</m:math></td><td class="formula2"/></tr></table></div>


The likelihood for a sample of <m:math><m:mi>n</m:mi></m:math>&#160;independent observations is

<div class="formula"><table class="formula"><tr><td class="formula"><m:math display="block">
<m:mi mathvariant="normal">Like</m:mi><m:mo>=</m:mo><m:munderover><m:mo>&#8719;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>n</m:mi></m:munderover><m:msub><m:mi>f</m:mi><m:msub><m:mi>X</m:mi><m:mi>i</m:mi></m:msub></m:msub>
<m:mfenced separators=""><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>;</m:mo><m:mi>&#952;</m:mi></m:mfenced>
<m:mtext>,</m:mtext>
</m:math></td><td class="formula2"/></tr></table></div>

where <m:math><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub></m:math>&#160;is the observed value of <m:math><m:msub><m:mi>X</m:mi><m:mi>i</m:mi></m:msub></m:math>.  If each <m:math><m:msub><m:mi>X</m:mi><m:mi>i</m:mi></m:msub></m:math>&#160;has an identical distribution, this reduces to

<div class="formula-eqn"><a name="eqn1" id="eqn1"/><table class="formula-eqn"><tr><td class="formula-eqn"><m:math display="block">
<m:mi mathvariant="normal">Like</m:mi><m:mo>=</m:mo><m:munderover><m:mo>&#8719;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>n</m:mi></m:munderover><m:msub><m:mi>f</m:mi><m:mi>X</m:mi></m:msub>
<m:mfenced separators=""><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>;</m:mo><m:mi>&#952;</m:mi></m:mfenced>
<m:mtext>,</m:mtext>
</m:math></td><td class="formula-eqn2">
      (1)
     </td></tr></table></div>

and the log-likelihood is

<div class="formula-eqn"><a name="eqn2" id="eqn2"/><table class="formula-eqn"><tr><td class="formula-eqn"><m:math display="block">
<m:mrow><m:mi>log</m:mi><m:mfenced separators=""><m:mi mathvariant="normal">Like</m:mi></m:mfenced></m:mrow><m:mo>=</m:mo><m:mi>L</m:mi><m:mo>=</m:mo><m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>n</m:mi></m:munderover><m:mrow><m:mi>log</m:mi><m:mfenced separators=""><m:msub><m:mi>f</m:mi><m:mi>X</m:mi></m:msub><m:mfenced separators=""><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>;</m:mo><m:mi>&#952;</m:mi></m:mfenced></m:mfenced></m:mrow><m:mtext>.</m:mtext>
</m:math></td><td class="formula-eqn2">
      (2)
     </td></tr></table></div>

The maximum likelihood estimates (<m:math><m:mover><m:mi>&#952;</m:mi><m:mo>^</m:mo></m:mover></m:math>) of  <m:math><m:mi>&#952;</m:mi></m:math>&#160;are the values of <m:math><m:mi>&#952;</m:mi></m:math>&#160;that maximize <a class="eqn" href="#eqn1">(1)</a> and <a class="eqn" href="#eqn2">(2)</a>.  If the range of <m:math><m:mi>X</m:mi></m:math>&#160;is independent of the parameters, then  <m:math><m:mover><m:mi>&#952;</m:mi><m:mo>^</m:mo></m:mover></m:math>&#160;can usually be found as the solution to

<div class="formula-eqn"><a name="eqn3" id="eqn3"/><table class="formula-eqn"><tr><td class="formula-eqn"><m:math display="block">
<m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>n</m:mi></m:munderover><m:mfrac other="display">
 <m:mo>&#8706;</m:mo><m:mrow><m:mo>&#8706;</m:mo><m:msub><m:mover><m:mi>&#952;</m:mi><m:mo>^</m:mo></m:mover><m:mi>j</m:mi></m:msub></m:mrow>
 </m:mfrac><m:mrow><m:mi>log</m:mi><m:mfenced separators=""><m:msub><m:mi>f</m:mi><m:mi>X</m:mi></m:msub><m:mfenced separators=""><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>;</m:mo><m:mover><m:mi>&#952;</m:mi><m:mo>^</m:mo></m:mover></m:mfenced></m:mfenced></m:mrow><m:mo>=</m:mo><m:mfrac other="display">
  <m:mrow><m:mo>&#8706;</m:mo><m:mi>L</m:mi></m:mrow>
  <m:mrow><m:mo>&#8706;</m:mo><m:msub><m:mover><m:mi>&#952;</m:mi><m:mo>^</m:mo></m:mover><m:mi>j</m:mi></m:msub></m:mrow>
 </m:mfrac><m:mo>=</m:mo><m:mn>0</m:mn><m:mtext>, &#8195;</m:mtext><m:mi>j</m:mi><m:mo>=</m:mo><m:mn>1</m:mn><m:mo>,</m:mo><m:mn>2</m:mn><m:mo>,</m:mo><m:mo>&#8230;</m:mo><m:mo>,</m:mo><m:mi>p</m:mi><m:mtext>.</m:mtext>
</m:math></td><td class="formula-eqn2">
      (3)
     </td></tr></table></div>

Note that <m:math>
 <m:mfrac other="display">
  <m:mrow><m:mo>&#8706;</m:mo><m:mi>L</m:mi></m:mrow>
  <m:mrow><m:mo>&#8706;</m:mo><m:msub><m:mi>&#952;</m:mi><m:mi>j</m:mi></m:msub></m:mrow>
 </m:mfrac>
</m:math>&#160;is known as the efficient score.</div><div class="paramtext">Maximum likelihood estimators possess several important properties.
<table class="standard-100"><tr>
<td style="width:2.1em;" valign="baseline">(a)</td>
<td valign="top">Maximum likelihood estimators are functions of the sufficient statistics.</td>
</tr><tr>
<td style="width:2.1em;" valign="baseline">(b)</td>
<td valign="top">Maximum likelihood estimators are (under certain conditions) <b>consistent</b>. That is, the estimator converges in probability to the true value as the sample size increases. Note that for small samples the maximum likelihood estimator may be biased.</td>
</tr><tr>
<td style="width:2.1em;" valign="baseline">(c)</td>
<td valign="top">For maximum likelihood estimators found as a solution to  <a class="eqn" href="#eqn3">(3)</a>, subject to certain conditions, it follows that

<div class="formula-eqn"><a name="eqn4" id="eqn4"/><table class="formula-eqn"><tr><td class="formula-eqn"><m:math display="block">
<m:mi>E</m:mi>
<m:mfenced separators=""><m:mfrac other="display">
  <m:mrow><m:mo>&#8706;</m:mo><m:mi>L</m:mi></m:mrow>
  <m:mrow><m:mo>&#8706;</m:mo><m:mi>&#952;</m:mi></m:mrow>
 </m:mfrac></m:mfenced><m:mo>=</m:mo><m:mn>0</m:mn><m:mtext>,</m:mtext>
</m:math></td><td class="formula-eqn2">
      (4)
     </td></tr></table></div>

and

<div class="formula-eqn"><a name="eqn5" id="eqn5"/><table class="formula-eqn"><tr><td class="formula-eqn"><m:math display="block">
<m:mi>I</m:mi><m:mfenced separators=""><m:mi>&#952;</m:mi></m:mfenced><m:mo>=</m:mo><m:mo>-</m:mo><m:mi>E</m:mi>
<m:mfenced separators="">
 <m:mfrac other="display">
  <m:mrow><m:msup><m:mo>&#8706;</m:mo><m:mn>2</m:mn></m:msup><m:mi>L</m:mi></m:mrow>
  <m:mrow><m:mo>&#8706;</m:mo><m:msup><m:mi>&#952;</m:mi><m:mn>2</m:mn></m:msup></m:mrow>
 </m:mfrac></m:mfenced><m:mo>=</m:mo><m:mi>E</m:mi>
<m:mfenced separators=""><m:msup>
<m:mfenced separators=""><m:mfrac other="display">
  <m:mrow><m:mo>&#8706;</m:mo><m:mi>L</m:mi></m:mrow>
  <m:mrow><m:mo>&#8706;</m:mo><m:mi>&#952;</m:mi></m:mrow>
 </m:mfrac></m:mfenced>
<m:mn>2</m:mn></m:msup></m:mfenced>
<m:mtext>,</m:mtext>
</m:math></td><td class="formula-eqn2">
      (5)
     </td></tr></table></div>

and then that <m:math><m:mover><m:mi>&#952;</m:mi><m:mo>^</m:mo></m:mover></m:math>&#160;is asymptotically Normal with mean vector <m:math><m:msub><m:mi>&#952;</m:mi><m:mn>0</m:mn></m:msub></m:math>&#160;and variance-covariance matrix  <m:math><m:msubsup><m:mi>I</m:mi><m:msub><m:mi>&#952;</m:mi><m:mn>0</m:mn></m:msub><m:mrow><m:mo>-</m:mo><m:mn>1</m:mn></m:mrow></m:msubsup></m:math>&#160;where <m:math><m:msub><m:mi>&#952;</m:mi><m:mn>0</m:mn></m:msub></m:math>&#160;denotes the true value of <m:math><m:mi>&#952;</m:mi></m:math>.  The matrix <m:math><m:msub><m:mi>I</m:mi><m:mi>&#952;</m:mi></m:msub></m:math>&#160;is known as the information matrix and <m:math><m:msubsup><m:mi>I</m:mi><m:msub><m:mi>&#952;</m:mi><m:mn>0</m:mn></m:msub><m:mrow><m:mo>-</m:mo><m:mn>1</m:mn></m:mrow></m:msubsup></m:math>&#160;is known as the Cramer&#8211;Rao lower bound for the variance of an estimator of <m:math><m:mi>&#952;</m:mi></m:math>.</td>
</tr></table>
</div><div class="paramtext">For example, if we consider a sample, <m:math><m:msub><m:mi>x</m:mi><m:mn>1</m:mn></m:msub><m:mo>,</m:mo><m:msub><m:mi>x</m:mi><m:mn>2</m:mn></m:msub><m:mo>,</m:mo><m:mo>&#8230;</m:mo><m:mo>,</m:mo><m:msub><m:mi>x</m:mi><m:mi>n</m:mi></m:msub></m:math>,  of size <m:math><m:mi>n</m:mi></m:math>&#160;drawn from a Normal distribution with unknown mean  <m:math><m:mi>&#956;</m:mi></m:math>&#160;and unknown variance <m:math><m:msub><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msub></m:math>&#160;then we have

<div class="formula"><table class="formula"><tr><td class="formula"><m:math display="block">
<m:mi>L</m:mi><m:mo>=</m:mo><m:mrow><m:mi>log</m:mi><m:mfenced separators=""><m:mi mathvariant="normal">Like</m:mi><m:mfenced separators=""><m:mi>&#956;</m:mi><m:mo>,</m:mo><m:mrow><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup><m:mo>;</m:mo><m:mi>x</m:mi></m:mrow></m:mfenced></m:mfenced></m:mrow><m:mo>=</m:mo><m:mo>-</m:mo><m:mfrac><m:mi>n</m:mi><m:mn>2</m:mn></m:mfrac><m:mrow><m:mi>log</m:mi><m:mfenced separators=""><m:mn>2</m:mn><m:mi>&#960;</m:mi></m:mfenced></m:mrow><m:mo>-</m:mo><m:mfrac><m:mi>n</m:mi><m:mn>2</m:mn></m:mfrac><m:mrow><m:mi>log</m:mi><m:mfenced separators=""><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup></m:mfenced></m:mrow><m:mo>-</m:mo><m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>n</m:mi></m:munderover><m:msup>
<m:mfenced separators=""><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>-</m:mo><m:mi>&#956;</m:mi></m:mfenced>
<m:mn>2</m:mn></m:msup><m:mo>/</m:mo><m:mn>2</m:mn><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup>
</m:math></td><td class="formula2"/></tr></table></div>

and thus

<div class="formula"><table class="formula"><tr><td class="formula"><m:math display="block">
<m:mfrac other="display">
  <m:mrow><m:mo>&#8706;</m:mo><m:mi>L</m:mi></m:mrow>
  <m:mrow><m:mo>&#8706;</m:mo><m:mi>&#956;</m:mi></m:mrow>
 </m:mfrac><m:mo>=</m:mo><m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo> <m:mn>1</m:mn></m:mrow><m:mi>n</m:mi></m:munderover> <m:mfenced separators=""><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>-</m:mo><m:mi>&#956;</m:mi></m:mfenced><m:mo>/</m:mo><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup>
</m:math></td><td class="formula2"/></tr></table></div>

and

<div class="formula"><table class="formula"><tr><td class="formula"><m:math display="block">
<m:mfrac other="display">
  <m:mrow><m:mo>&#8706;</m:mo><m:mi>L</m:mi></m:mrow>
  <m:mrow><m:mo>&#8706;</m:mo><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup></m:mrow>
 </m:mfrac><m:mo>=</m:mo><m:mo>-</m:mo><m:mfrac><m:mi>n</m:mi><m:mrow><m:mn>2</m:mn><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup></m:mrow>
 </m:mfrac><m:mo>+</m:mo><m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>n</m:mi></m:munderover><m:msup>
<m:mfenced separators=""><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>-</m:mo><m:mi>&#956;</m:mi></m:mfenced>
<m:mn>2</m:mn></m:msup><m:mo>/</m:mo><m:mn>2</m:mn><m:msup><m:mi>&#963;</m:mi><m:mn>4</m:mn></m:msup><m:mtext>.</m:mtext>
</m:math></td><td class="formula2"/></tr></table></div>


Then equating these two equations to zero and solving gives the maximum likelihood estimates

<div class="formula"><table class="formula"><tr><td class="formula"><m:math display="block">
<m:mover><m:mi>&#956;</m:mi><m:mo>^</m:mo></m:mover><m:mo>=</m:mo><m:mover><m:mi>x</m:mi><m:mo>-</m:mo></m:mover>
</m:math></td><td class="formula2"/></tr></table></div>

and

<div class="formula"><table class="formula"><tr><td class="formula"><m:math display="block">
<m:msup><m:mover><m:mi>&#963;</m:mi><m:mo>^</m:mo></m:mover><m:mn>2</m:mn></m:msup><m:mo>=</m:mo><m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>n</m:mi></m:munderover><m:msup>
<m:mfenced separators=""><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>-</m:mo><m:mover><m:mi>x</m:mi><m:mo>-</m:mo></m:mover></m:mfenced>
<m:mn>2</m:mn></m:msup><m:mo>/</m:mo><m:mi>n</m:mi><m:mtext>.</m:mtext>
</m:math></td><td class="formula2"/></tr></table></div>

These maximum likelihood estimates are asymptotically Normal with mean vector <m:math><m:mi>a</m:mi></m:math>, where

<div class="formula"><table class="formula"><tr><td class="formula"><m:math display="block">
<m:msup><m:mi>a</m:mi><m:mi mathvariant="normal">T</m:mi></m:msup><m:mo>=</m:mo><m:mfenced separators=""><m:mi>&#956;</m:mi><m:mo>,</m:mo><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup></m:mfenced><m:mtext>,</m:mtext>
</m:math></td><td class="formula2"/></tr></table></div>

and covariance matrix <m:math><m:mi>C</m:mi></m:math>.  To obtain <m:math><m:mi>C</m:mi></m:math>&#160;we find the second derivatives of <m:math><m:mi>L</m:mi></m:math>&#160;with respect to <m:math><m:mi>&#956;</m:mi></m:math>&#160;and <m:math><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup></m:math>&#160;as follows:

<div class="formula"><table class="formula"><tr><td class="formula"><m:math display="block">
<m:mtable>
 <m:mtr>
  <m:mtd><m:mfrac other="display">
  <m:mrow><m:msup><m:mo>&#8706;</m:mo><m:mn>2</m:mn></m:msup><m:mi>L</m:mi></m:mrow>
  <m:mrow><m:mo>&#8706;</m:mo><m:msup><m:mi>&#956;</m:mi><m:mn>2</m:mn></m:msup></m:mrow>
 </m:mfrac><m:mo>=</m:mo><m:mo>-</m:mo><m:mfrac other="display">
  <m:mi>n</m:mi><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup></m:mfrac></m:mtd>
 </m:mtr><m:mtr>
  <m:mtd><m:mfrac other="display">
  <m:mrow><m:msup><m:mo>&#8706;</m:mo><m:mn>2</m:mn></m:msup><m:mi>L</m:mi></m:mrow>
  <m:mrow><m:mo>&#8706;</m:mo><m:msup><m:mfenced separators=""><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup></m:mfenced><m:mn>2</m:mn></m:msup></m:mrow>
 </m:mfrac><m:mo>=</m:mo><m:mfrac other="display">
  <m:mi>n</m:mi><m:mrow><m:mn>2</m:mn><m:msup><m:mi>&#963;</m:mi><m:mn>4</m:mn></m:msup></m:mrow>
 </m:mfrac><m:mo>-</m:mo><m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>n</m:mi></m:munderover>
<m:msup>
<m:mfenced separators=""><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>-</m:mo><m:mi>&#956;</m:mi></m:mfenced>
<m:mn>2</m:mn></m:msup><m:mo>/</m:mo><m:msup><m:mi>&#963;</m:mi><m:mn>6</m:mn></m:msup></m:mtd>
 </m:mtr><m:mtr>
  <m:mtd><m:mfrac other="display">
  <m:mrow><m:msup><m:mo>&#8706;</m:mo><m:mn>2</m:mn></m:msup><m:mi>L</m:mi></m:mrow>
  <m:mrow><m:mo>&#8706;</m:mo><m:mi>&#956;</m:mi><m:mo>&#8706;</m:mo><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup></m:mrow>
 </m:mfrac><m:mo>=</m:mo><m:mfrac other="display">
  <m:mrow><m:msup><m:mo>&#8706;</m:mo><m:mn>2</m:mn></m:msup><m:mi>L</m:mi></m:mrow>
  <m:mrow><m:mo>&#8706;</m:mo><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup><m:mo>&#8706;</m:mo><m:mi>&#956;</m:mi></m:mrow>
 </m:mfrac><m:mo>=</m:mo><m:mo>-</m:mo><m:mfrac other="display">
  <m:mrow><m:mi>n</m:mi><m:mfenced separators=""><m:mover><m:mi>x</m:mi><m:mo>-</m:mo></m:mover><m:mo>-</m:mo><m:mi>&#956;</m:mi></m:mfenced></m:mrow><m:msup><m:mi>&#963;</m:mi><m:mn>4</m:mn></m:msup></m:mfrac><m:mtext>.</m:mtext></m:mtd>
 </m:mtr>
</m:mtable>
</m:math></td><td class="formula2"/></tr></table></div>

Then

<div class="formula"><table class="formula"><tr><td class="formula"><m:math display="block">
<m:msup><m:mi>C</m:mi><m:mrow><m:mo>-</m:mo><m:mn>1</m:mn></m:mrow></m:msup><m:mo>=</m:mo><m:mo>-</m:mo><m:mi>E</m:mi>
<m:mfenced><m:mtable>
  <m:mtr>
   <m:mtd><m:mfrac other="display">
  <m:mrow><m:msup><m:mo>&#8706;</m:mo><m:mn>2</m:mn></m:msup> <m:mi>L</m:mi></m:mrow>
  <m:mrow><m:mo>&#8706;</m:mo><m:msup><m:mi>&#956;</m:mi><m:mn>2</m:mn></m:msup></m:mrow>
 </m:mfrac></m:mtd>
   <m:mtd><m:mfrac other="display">
  <m:mrow><m:msup><m:mo>&#8706;</m:mo><m:mn>2</m:mn></m:msup> <m:mi>L</m:mi></m:mrow>
  <m:mrow><m:mo>&#8706;</m:mo><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup><m:mo>&#8706;</m:mo><m:mi>&#956;</m:mi></m:mrow>
 </m:mfrac></m:mtd>
  </m:mtr><m:mtr>
   <m:mtd><m:mfrac other="display">
  <m:mrow><m:msup><m:mo>&#8706;</m:mo><m:mn>2</m:mn></m:msup> <m:mi>L</m:mi></m:mrow>
  <m:mrow><m:mo>&#8706;</m:mo><m:mi>&#956;</m:mi><m:mo>&#8706;</m:mo><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup></m:mrow>
 </m:mfrac></m:mtd>
   <m:mtd><m:mfrac other="display">
  <m:mrow><m:msup><m:mo>&#8706;</m:mo><m:mn>2</m:mn></m:msup> <m:mi>L</m:mi></m:mrow>
  <m:mrow><m:mo>&#8706;</m:mo><m:msup><m:mfenced separators=""><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup></m:mfenced><m:mn>2</m:mn></m:msup></m:mrow>
 </m:mfrac></m:mtd>
  </m:mtr>
 </m:mtable></m:mfenced><m:mo>=</m:mo><m:mfenced><m:mtable>
  <m:mtr>
   <m:mtd><m:mi>n</m:mi><m:mo>/</m:mo><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup></m:mtd>
   <m:mtd><m:mn>0</m:mn></m:mtd>
  </m:mtr><m:mtr>
   <m:mtd><m:mn>0</m:mn></m:mtd>
   <m:mtd><m:mi>n</m:mi><m:mo>/</m:mo><m:mn>2</m:mn><m:msup><m:mi>&#963;</m:mi><m:mn>4</m:mn></m:msup></m:mtd>
  </m:mtr>
 </m:mtable></m:mfenced>
</m:math></td><td class="formula2"/></tr></table></div>

so that

<div class="formula"><table class="formula"><tr><td class="formula"><m:math display="block">
<m:mi>C</m:mi><m:mo>=</m:mo><m:mfenced><m:mtable>
  <m:mtr>
   <m:mtd><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup><m:mo>/</m:mo><m:mi>n</m:mi></m:mtd>
   <m:mtd><m:mn>0</m:mn></m:mtd>
  </m:mtr><m:mtr>
   <m:mtd><m:mn>0</m:mn></m:mtd>
   <m:mtd><m:mn>2</m:mn><m:msup><m:mi>&#963;</m:mi><m:mn>4</m:mn></m:msup><m:mo>/</m:mo><m:mi>n</m:mi></m:mtd>
  </m:mtr>
 </m:mtable></m:mfenced>
<m:mtext>.</m:mtext>
</m:math></td><td class="formula2"/></tr></table></div>

To obtain an estimate of <m:math><m:mi>C</m:mi></m:math>&#160;the matrix may be evaluated at the maximum likelihood estimates.</div><div class="paramtext">It may not always be possible to find maximum likelihood estimates in a convenient closed form, and in these cases iterative numerical methods, such as the Newton&#8211;Raphson procedure or the EM algorithm (expectation maximization), will be necessary to compute the maximum likelihood estimates.  Their asymptotic variances and covariances may then be found by substituting the estimates into the second derivatives.  Note that it may be difficult to find the expected value of the second derivatives required for the variance-covariance matrix and in these cases the observed value of the second derivatives is often used.</div><div class="paramtext">The use of maximum likelihood estimation allows the construction of generalized likelihood ratio tests.  If <m:math><m:mi>&#955;</m:mi><m:mo>=</m:mo><m:mn>2</m:mn><m:mfenced separators=""><m:msub><m:mi>l</m:mi><m:mn>1</m:mn></m:msub><m:mo>-</m:mo><m:msub><m:mi>l</m:mi><m:mn>2</m:mn></m:msub></m:mfenced></m:math>, where <m:math><m:msub><m:mi>l</m:mi><m:mn>1</m:mn></m:msub></m:math>&#160;is the maximized log-likelihood function for a model <m:math><m:mn>1</m:mn></m:math>&#160;and <m:math><m:msub><m:mi>l</m:mi><m:mn>2</m:mn></m:msub></m:math>&#160;is the maximized log-likelihood function for a model <m:math><m:mn>2</m:mn></m:math>, then under the hypothesis that model <m:math><m:mn>2</m:mn></m:math>&#160;is correct,   <m:math><m:mn>2</m:mn><m:mi>&#955;</m:mi></m:math>&#160;is asymptotically distributed as a  <m:math><m:msup><m:mi>&#967;</m:mi><m:mn>2</m:mn></m:msup></m:math>&#160;variable with <m:math><m:mi>p</m:mi><m:mo>-</m:mo><m:mi>q</m:mi></m:math>&#160;degrees of freedom.  Consider two models in which model <m:math><m:mn>1</m:mn></m:math>&#160;has <m:math><m:mi>p</m:mi></m:math>&#160;parameters and model <m:math><m:mn>2</m:mn></m:math>&#160;is a sub-model (nested model) of model <m:math><m:mn>1</m:mn></m:math>&#160;with <m:math><m:mi>q</m:mi><m:mo>&lt;</m:mo><m:mi>p</m:mi></m:math>&#160;parameters, that is model <m:math><m:mn>1</m:mn></m:math>&#160;has an extra  <m:math><m:mi>p</m:mi><m:mo>-</m:mo><m:mi>q</m:mi></m:math>&#160;parameters.  This result provides a useful method for performing hypothesis tests on the parameters.  Alternatively, tests exist based on the asymptotic  Normality of the estimator and the efficient score; see page 315 of <a class="ref" href="#ref531">Cox and Hinkley (1974)</a>.</div><h3 class="standard"><a class="sec" name="intbackground2" id="intbackground2"/>2.2&#160;&#160;Confidence Intervals</h3>
<div class="paramtext">Suppose we can find a function, <m:math><m:mi>t</m:mi><m:mfenced separators=""><m:mi>x</m:mi><m:mo>,</m:mo><m:mi>&#952;</m:mi></m:mfenced></m:math>, whose distribution depends upon the sample <m:math><m:mi>x</m:mi></m:math>&#160;but not on the unknown parameter <m:math><m:mi>&#952;</m:mi></m:math>, and which is a monotonic (say decreasing) function in <m:math><m:mi>&#952;</m:mi></m:math>&#160;for each <m:math><m:mi>x</m:mi></m:math>, then we can find <m:math><m:msub><m:mi>t</m:mi><m:mn>1</m:mn></m:msub></m:math>&#160;such that <m:math>
 <m:mi>P</m:mi>
 <m:mfenced separators="">
  <m:msub>
   <m:mi>t</m:mi>
   <m:mn>1</m:mn>
  </m:msub>
  <m:mo>&#8804;</m:mo>
  <m:mi>t</m:mi>
  <m:mfenced separators=""><m:mi>x</m:mi><m:mo>,</m:mo><m:mi>&#952;</m:mi></m:mfenced>
 </m:mfenced>
 <m:mo>=</m:mo>
 <m:mn>1</m:mn>
 <m:mo>-</m:mo>
 <m:mi>&#945;</m:mi>
</m:math>&#160;no matter what <m:math><m:mi>&#952;</m:mi></m:math>&#160;happens to be.  The function <m:math><m:mi>t</m:mi><m:mfenced separators=""><m:mi>x</m:mi><m:mo>,</m:mo><m:mi>&#952;</m:mi></m:mfenced></m:math>&#160;is known as a pivotal quantity.  Since the function is monotonic the statement that <m:math><m:msub><m:mi>t</m:mi><m:mn>1</m:mn></m:msub><m:mo>&#8804;</m:mo><m:mi>t</m:mi><m:mfenced separators=""><m:mi>x</m:mi><m:mo>,</m:mo><m:mi>&#952;</m:mi></m:mfenced></m:math>&#160;may be rewritten as <m:math><m:mi>&#952;</m:mi><m:mo>&#8805;</m:mo><m:msub><m:mi>&#952;</m:mi><m:mn>1</m:mn></m:msub><m:mfenced separators=""><m:mi>x</m:mi></m:mfenced></m:math>&#160;see <a class="fig" href="#G07INT1">Figure 1</a>.  The statistic <m:math><m:msub><m:mi>&#952;</m:mi><m:mn>1</m:mn></m:msub><m:mfenced separators=""><m:mi>x</m:mi></m:mfenced></m:math>&#160;will vary from sample to sample and if we assert that <m:math><m:mi>&#952;</m:mi><m:mo>&#8805;</m:mo><m:msub><m:mi>&#952;</m:mi><m:mn>1</m:mn></m:msub><m:mfenced separators=""><m:mi>x</m:mi></m:mfenced></m:math>&#160;for any sample values which arise, we will be right in a proportion <m:math><m:mn>1</m:mn><m:mo>-</m:mo><m:mi>&#945;</m:mi></m:math>&#160;of the cases, in the long run or on average.  We call <m:math><m:msub><m:mi>&#952;</m:mi><m:mn>1</m:mn></m:msub><m:mfenced separators=""><m:mi>x</m:mi></m:mfenced></m:math>&#160;a  <m:math><m:mn>1</m:mn><m:mo>-</m:mo><m:mi>&#945;</m:mi></m:math>&#160;upper confidence limit for <m:math><m:mi>&#952;</m:mi></m:math>.</div><div class="paramtext">
<div class="figure"><a name="G07INT1" id="G07INT1"/><img src="../figures/G07INT1fl17.png" style="height: 20em" alt="Figure 1"/></div><div class="figure"><b>Figure 1</b></div>
We have considered only an upper confidence limit.  The above idea may be generalized to a two-sided confidence interval where two quantities,  <m:math><m:msub><m:mi>t</m:mi><m:mn>0</m:mn></m:msub></m:math>&#160;and <m:math><m:msub><m:mi>t</m:mi><m:mn>1</m:mn></m:msub></m:math>, are found such that for all <m:math><m:mi>&#952;</m:mi></m:math>,  <m:math>
 <m:mi mathvariant="normal">P</m:mi>
 <m:mfenced separators="">
  <m:msub>
   <m:mi>t</m:mi>
   <m:mn>1</m:mn>
  </m:msub>
  <m:mo>&#8804;</m:mo>
  <m:mi>t</m:mi>
  <m:mfenced separators=""><m:mi>x</m:mi><m:mo>,</m:mo><m:mi>&#952;</m:mi></m:mfenced>
  <m:mo>&#8804;</m:mo>
  <m:msub>
   <m:mi>t</m:mi>
   <m:mn>0</m:mn>
  </m:msub>
 </m:mfenced>
 <m:mo>=</m:mo>
 <m:mn>1</m:mn>
 <m:mo>-</m:mo>
 <m:mi>&#945;</m:mi>
</m:math>.  This interval may be rewritten as  <m:math><m:msub><m:mi>&#952;</m:mi><m:mn>0</m:mn></m:msub><m:mfenced separators=""><m:mi>x</m:mi></m:mfenced><m:mo>&#8804;</m:mo><m:mi>&#952;</m:mi><m:mo>&#8804;</m:mo><m:msub><m:mi>&#952;</m:mi><m:mn>1</m:mn></m:msub><m:mfenced separators=""><m:mi>x</m:mi></m:mfenced></m:math>.  Thus if we assert that <m:math><m:mi>&#952;</m:mi></m:math>&#160;lies in the interval  [<m:math><m:msub><m:mi>&#952;</m:mi><m:mn>0</m:mn></m:msub><m:mfenced separators=""><m:mi>x</m:mi></m:mfenced><m:mo>,</m:mo><m:msub><m:mi>&#952;</m:mi><m:mn>1</m:mn></m:msub><m:mfenced separators=""><m:mi>x</m:mi></m:mfenced></m:math>] we will be right on average in <m:math><m:mn>1</m:mn><m:mo>-</m:mo><m:mi>&#945;</m:mi></m:math>&#160;proportion of the times under repeated sampling.</div><div class="paramtext">Hypothesis (significance) tests on the parameters may be used to find these confidence limits.  For example, if we observe a value, <m:math><m:mi>k</m:mi></m:math>, from a binomial distribution, with known parameter <m:math><m:mi>n</m:mi></m:math>&#160;and unknown parameter <m:math><m:mi>p</m:mi></m:math>, then to find the lower confidence limit we find <m:math><m:msub><m:mi>p</m:mi><m:mi>l</m:mi></m:msub></m:math>&#160;such that the probability that the null hypothesis <m:math><m:msub><m:mi>H</m:mi><m:mn>0</m:mn></m:msub></m:math>: <m:math><m:mi>p</m:mi><m:mo>=</m:mo><m:msub><m:mi>p</m:mi><m:mi>l</m:mi></m:msub></m:math>&#160;(against the one sided alternative that  <m:math><m:mi>p</m:mi><m:mo>&gt;</m:mo><m:msub><m:mi>p</m:mi><m:mi>l</m:mi></m:msub></m:math>) will be rejected, is less than or equal to <m:math><m:mi>&#945;</m:mi><m:mo>/</m:mo><m:mn>2</m:mn></m:math>.  Thus for a binomial random variable, <m:math><m:mi>B</m:mi></m:math>, with parameters <m:math><m:mi>n</m:mi></m:math>&#160;and <m:math><m:msub><m:mi>p</m:mi><m:mi>l</m:mi></m:msub></m:math>&#160;we require that <m:math><m:mi>P</m:mi><m:mfenced separators=""><m:mi>B</m:mi><m:mo>&#8805;</m:mo><m:mi>k</m:mi></m:mfenced><m:mo>&#8804;</m:mo><m:mi>&#945;</m:mi><m:mo>/</m:mo><m:mn>2</m:mn></m:math>.  The upper confidence limit, <m:math><m:msub><m:mi>p</m:mi><m:mi>u</m:mi></m:msub></m:math>, can be constructed in a similar way.</div><div class="paramtext">For large samples the asymptotic Normality of the maximum likelihood estimates discussed above is used to construct confidence intervals for the unknown parameters.</div><h3 class="standard"><a class="sec" name="intbackground3" id="intbackground3"/>2.3&#160;&#160;Robust Estimation</h3>
<div class="paramtext">For particular cases the probability density function can be written as

<div class="formula"><table class="formula"><tr><td class="formula"><m:math display="block">
<m:msub><m:mi>f</m:mi><m:msub><m:mi>X</m:mi><m:mi>i</m:mi></m:msub></m:msub><m:mfenced separators=""><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>;</m:mo><m:mi>&#952;</m:mi></m:mfenced><m:mo>=</m:mo><m:mfrac><m:mn>1</m:mn><m:msub><m:mi>&#952;</m:mi><m:mn>2</m:mn></m:msub></m:mfrac><m:mi>g</m:mi>
<m:mfenced separators=""><m:mfrac><m:mrow><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>-</m:mo><m:msub><m:mi>&#952;</m:mi><m:mn>1</m:mn></m:msub></m:mrow><m:msub><m:mi>&#952;</m:mi><m:mn>2</m:mn></m:msub></m:mfrac></m:mfenced>
</m:math></td><td class="formula2"/></tr></table></div>

for a suitable function g; then <m:math><m:msub><m:mi>&#952;</m:mi><m:mn>1</m:mn></m:msub></m:math>&#160;is known as a location parameter and <m:math><m:msub><m:mi>&#952;</m:mi><m:mn>2</m:mn></m:msub></m:math>, usually written as <m:math><m:mi>&#963;</m:mi></m:math>, is known as a scale parameter.  This is true of the Normal distribution.</div><div class="paramtext">If <m:math><m:msub><m:mi>&#952;</m:mi><m:mn>1</m:mn></m:msub></m:math>&#160;is a location parameter, as described above,  then equation <a class="eqn" href="#eqn3">(3)</a> becomes

<div class="formula-eqn"><a name="eqn6" id="eqn6"/><table class="formula-eqn"><tr><td class="formula-eqn"><m:math display="block">
<m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>n</m:mi></m:munderover><m:mi>&#968;</m:mi>
<m:mfenced separators=""><m:mfrac><m:mrow><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>-</m:mo><m:msub><m:mover><m:mi>&#952;</m:mi><m:mo>^</m:mo></m:mover><m:mn>1</m:mn></m:msub></m:mrow><m:mover><m:mi>&#963;</m:mi><m:mo>^</m:mo></m:mover></m:mfrac></m:mfenced><m:mo>=</m:mo><m:mn>0</m:mn><m:mtext>,</m:mtext>
</m:math></td><td class="formula-eqn2">
      (6)
     </td></tr></table></div>

where <m:math><m:mi>&#968;</m:mi><m:mfenced separators=""><m:mi>z</m:mi></m:mfenced><m:mo>=</m:mo><m:mo>-</m:mo><m:mfrac other="display">
  <m:mi>d</m:mi><m:mrow><m:mi>d</m:mi><m:mi>z</m:mi></m:mrow>
 </m:mfrac><m:mrow><m:mi>log</m:mi><m:mfenced separators=""><m:mi>g</m:mi><m:mfenced separators=""><m:mi>z</m:mi></m:mfenced></m:mfenced></m:mrow></m:math>.</div><div class="paramtext">For the scale parameter <m:math><m:mi>&#963;</m:mi></m:math>&#160;(or <m:math><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup></m:math>)  the equation is

<div class="formula-eqn"><a name="eqn7" id="eqn7"/><table class="formula-eqn"><tr><td class="formula-eqn"><m:math display="block">
<m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>n</m:mi></m:munderover><m:mi>&#967;</m:mi>
<m:mfenced separators=""><m:mfrac><m:mrow><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>-</m:mo><m:msub><m:mover><m:mi>&#952;</m:mi><m:mo>^</m:mo></m:mover><m:mn>1</m:mn></m:msub></m:mrow><m:mover><m:mi>&#963;</m:mi><m:mo>^</m:mo></m:mover></m:mfrac></m:mfenced><m:mo>=</m:mo><m:mi>n</m:mi><m:mo>/</m:mo><m:mn>2</m:mn><m:mtext>,</m:mtext>
</m:math></td><td class="formula-eqn2">
      (7)
     </td></tr></table></div>

where <m:math><m:mi>&#967;</m:mi><m:mfenced separators=""><m:mi>z</m:mi></m:mfenced><m:mo>=</m:mo><m:mi>z</m:mi><m:mi>&#968;</m:mi><m:mfenced separators=""><m:mi>z</m:mi></m:mfenced><m:mo>/</m:mo><m:mn>2</m:mn></m:math>.</div><div class="paramtext">For the Normal distribution <m:math><m:mi>&#968;</m:mi><m:mfenced separators=""><m:mi>z</m:mi></m:mfenced><m:mo>=</m:mo><m:mi>z</m:mi></m:math>&#160;and  <m:math><m:mi>&#967;</m:mi><m:mfenced separators=""><m:mi>z</m:mi></m:mfenced><m:mo>=</m:mo><m:msup><m:mi>z</m:mi><m:mn>2</m:mn></m:msup><m:mo>/</m:mo><m:mn>2</m:mn></m:math>.  Thus, the maximum likelihood estimates for <m:math><m:msub><m:mi>&#952;</m:mi><m:mn>1</m:mn></m:msub></m:math>&#160;and <m:math><m:msup><m:mi>&#963;</m:mi><m:mn>2</m:mn></m:msup></m:math>&#160;are the sample mean and variance with the <m:math><m:mi>n</m:mi></m:math>&#160;divisor respectively.  As the latter is biased, <a class="eqn" href="#eqn7">(7)</a> can be replaced by

<div class="formula-eqn"><a name="eqn8" id="eqn8"/><table class="formula-eqn"><tr><td class="formula-eqn"><m:math display="block">
<m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>n</m:mi></m:munderover><m:mi>&#967;</m:mi>
<m:mfenced separators=""><m:mfrac><m:mrow><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>-</m:mo><m:msub><m:mover><m:mi>&#952;</m:mi><m:mo>^</m:mo></m:mover><m:mn>1</m:mn></m:msub></m:mrow><m:mover><m:mi>&#963;</m:mi><m:mo>^</m:mo></m:mover></m:mfrac></m:mfenced><m:mo>=</m:mo><m:mfenced separators=""><m:mi>n</m:mi><m:mo>-</m:mo><m:mn>1</m:mn></m:mfenced><m:mi>&#946;</m:mi><m:mtext>,</m:mtext>
</m:math></td><td class="formula-eqn2">
      (8)
     </td></tr></table></div>

where <m:math><m:mi>&#946;</m:mi></m:math>&#160;is a suitable constant, which for the Normal  <m:math><m:mi>&#967;</m:mi></m:math>&#160;function is <m:math><m:mfrac><m:mn>1</m:mn><m:mn>2</m:mn></m:mfrac>
</m:math>.</div><div class="paramtext">The influence of an observation on the estimates depends on the form of the <m:math><m:mi>&#968;</m:mi></m:math>&#160;and <m:math><m:mi>&#967;</m:mi></m:math>&#160;functions.  For a discussion of influence, see <a class="ref" href="#ref397">Hampel <span class="italic">et al.</span> (1986)</a> and <a class="ref" href="#ref398">Huber (1981)</a>.  The influence of extreme values can be reduced by bounding the values of the <m:math><m:mi>&#968;</m:mi></m:math>- and  <m:math><m:mi>&#967;</m:mi></m:math>-functions.  One suggestion due to <a class="ref" href="#ref398">Huber (1981)</a> is

<div class="formula"><table class="formula"><tr><td class="formula"><m:math display="block">
<m:mi>&#968;</m:mi><m:mfenced separators=""><m:mi>z</m:mi></m:mfenced><m:mo>=</m:mo><m:mfenced open="{" close="" separators="">
 <m:mtable columnalign="right left">
  <m:mtr>
   <m:mtd><m:mo>-</m:mo><m:mi>C</m:mi><m:mtext>,</m:mtext></m:mtd>
   <m:mtd><m:mphantom><m:mo>|</m:mo><m:mo>|</m:mo></m:mphantom><m:mi>z</m:mi><m:mo>&lt;</m:mo><m:mo>-</m:mo><m:mi>C</m:mi></m:mtd>
  </m:mtr><m:mtr>
   <m:mtd><m:mi>z</m:mi><m:mtext>,</m:mtext></m:mtd>
   <m:mtd><m:mfenced open="|" close="|" separators=""><m:mi>z</m:mi></m:mfenced><m:mo>&#8804;</m:mo><m:mi>C</m:mi></m:mtd>
  </m:mtr><m:mtr>
   <m:mtd><m:mi>C</m:mi><m:mtext>,</m:mtext></m:mtd>
   <m:mtd><m:mphantom><m:mo>|</m:mo><m:mo>|</m:mo></m:mphantom><m:mi>z</m:mi><m:mo>&gt;</m:mo><m:mi>C</m:mi><m:mtext>.</m:mtext></m:mtd>
  </m:mtr>
 </m:mtable>
</m:mfenced>
</m:math></td><td class="formula2"/></tr></table></div><div class="figure"><a name="G07INT2" id="G07INT2"/><img src="../figures/G07INT2fl17.png" style="height: 16em" alt="Figure 2"/></div><div class="figure"><b>Figure 2</b></div>
Redescending <m:math><m:mi>&#968;</m:mi></m:math>-functions are often considered; these give zero values to <m:math><m:mi>&#968;</m:mi><m:mfenced separators=""><m:mi>z</m:mi></m:mfenced></m:math>&#160;for large positive or negative values of <m:math><m:mi>z</m:mi></m:math>.  <a class="ref" href="#ref397">Hampel <span class="italic">et al.</span> (1986)</a> suggested

<div class="formula"><table class="formula"><tr><td class="formula"><m:math display="block">
<m:mi>&#968;</m:mi><m:mfenced separators=""><m:mi>z</m:mi></m:mfenced><m:mo>=</m:mo><m:mfenced open="{" close="" separators="">
 <m:mtable columnalign="center right">
  <m:mtr>
   <m:mtd><m:mo>-</m:mo><m:mi>&#968;</m:mi><m:mfenced separators=""><m:mo>-</m:mo><m:mi>z</m:mi></m:mfenced></m:mtd>
   <m:mtd/>
  </m:mtr><m:mtr>
   <m:mtd><m:mi>z</m:mi><m:mtext>,</m:mtext></m:mtd>
   <m:mtd><m:mn>0</m:mn><m:mo>&#8804;</m:mo><m:mi>z</m:mi><m:mo>&#8804;</m:mo><m:msub><m:mi>h</m:mi><m:mn>1</m:mn></m:msub><m:mphantom><m:mtext>.</m:mtext></m:mphantom></m:mtd>
  </m:mtr><m:mtr>
   <m:mtd><m:msub><m:mi>h</m:mi><m:mn>1</m:mn></m:msub><m:mtext>,</m:mtext></m:mtd>
   <m:mtd><m:msub><m:mi>h</m:mi><m:mn>1</m:mn></m:msub><m:mo>&#8804;</m:mo><m:mi>z</m:mi><m:mo>&#8804;</m:mo><m:msub><m:mi>h</m:mi><m:mn>2</m:mn></m:msub><m:mphantom><m:mtext>.</m:mtext></m:mphantom></m:mtd>
  </m:mtr><m:mtr>
   <m:mtd><m:msub><m:mi>h</m:mi><m:mn>1</m:mn></m:msub><m:mfenced separators=""><m:msub><m:mi>h</m:mi><m:mn>3</m:mn></m:msub><m:mo>-</m:mo><m:mi>z</m:mi></m:mfenced><m:mo>/</m:mo><m:mfenced separators=""><m:msub><m:mi>h</m:mi><m:mn>3</m:mn></m:msub><m:mo>-</m:mo><m:msub><m:mi>h</m:mi><m:mn>2</m:mn></m:msub></m:mfenced><m:mtext>,</m:mtext></m:mtd>
   <m:mtd><m:msub><m:mi>h</m:mi><m:mn>2</m:mn></m:msub><m:mo>&#8804;</m:mo><m:mi>z</m:mi><m:mo>&#8804;</m:mo><m:msub><m:mi>h</m:mi><m:mn>3</m:mn></m:msub><m:mphantom><m:mtext>.</m:mtext></m:mphantom></m:mtd>
  </m:mtr><m:mtr>
   <m:mtd><m:mn>0</m:mn><m:mtext>,</m:mtext></m:mtd>
   <m:mtd><m:mi>z</m:mi><m:mo>&gt;</m:mo><m:msub><m:mi>h</m:mi><m:mn>3</m:mn></m:msub><m:mtext>.</m:mtext></m:mtd>
  </m:mtr>
 </m:mtable>
</m:mfenced>
</m:math></td><td class="formula2"/></tr></table></div><div class="figure"><a name="G07INT3" id="G07INT3"/><img src="../figures/G07INT3fl17.png" style="height: 16em" alt="Figure 3"/></div><div class="figure"><b>Figure 3</b></div>
Usually a <m:math><m:mi>&#967;</m:mi></m:math>-function based on Huber's  <m:math><m:mi>&#968;</m:mi></m:math>-function is used: <m:math><m:mi>&#967;</m:mi><m:mo>=</m:mo><m:msup><m:mi>&#968;</m:mi><m:mn>2</m:mn></m:msup><m:mo>/</m:mo><m:mn>2</m:mn></m:math>.  Estimators based on such bounded <m:math><m:mi>&#968;</m:mi></m:math>-functions are known as  <m:math><m:mi>M</m:mi></m:math>-estimators, and provide one type of robust estimator.</div><div class="paramtext">Other robust estimators for the location parameter are
<table class="standard-100"><tr>
<td style="width:2.1em;" valign="baseline">(i)</td>
<td valign="top">the sample median,</td>
</tr><tr>
<td style="width:2.1em;" valign="baseline">(ii)</td>
<td valign="top">the trimmed mean, i.e., the mean calculated after the extreme values have been removed from the sample,</td>
</tr><tr>
<td style="width:2.1em;" valign="baseline">(iii)</td>
<td valign="top">the winsorized mean, i.e., the mean calculated after the extreme values of the sample have been replaced by other more moderate values from the sample.</td>
</tr></table>
</div><div class="paramtext">For the scale parameter, alternative estimators are
<table class="standard-100"><tr>
<td style="width:2.1em;" valign="baseline">(i)</td>
<td valign="top">the median absolute deviation scaled to produce an estimator which is unbiased in the case of data coming from a Normal distribution,</td>
</tr><tr>
<td style="width:2.1em;" valign="baseline">(ii)</td>
<td valign="top">the winsorized variance, i.e., the variance calculated after the extreme values of the sample have been replaced by other more moderate values from the sample.</td>
</tr></table>
</div><div class="paramtext">For a general discussion of robust estimation, see <a class="ref" href="#ref397">Hampel <span class="italic">et al.</span> (1986)</a> and <a class="ref" href="#ref398">Huber (1981)</a>.</div><h3 class="standard"><a class="sec" name="intbackground4" id="intbackground4"/>2.4&#160;&#160;Robust Confidence Intervals</h3>
<div class="paramtext">In <a class="sec" href="#intbackground2">Section 2.2</a> it was shown how tests of hypotheses can be used to find confidence intervals.  That approach uses a parametric test that requires the assumption that the data used in the computation of the confidence has a known distribution.  As an alternative, a more robust confidence interval can be found by replacing the parametric test by a nonparametric test.  In the case of the confidence interval for the location parameter, a Wilcoxon test statistic can be used,  and for the difference in location, computed from two samples, a Mann&#8211;Whitney test statistic can be used.</div><h2 class="standard"><a class="sec" name="available" id="available"/>3&#160;&#160;Recommendations on Choice and Use of Available Routines</h2>
<div class="paramtext"><b>Maximum Likelihood Estimation and Confidence Intervals</b>
<table class="standard-100"><tr><td style="width:15%" class="libdoc" valign="top"><a class="rout" href="../G07/g07aaf.xml">G07AAF</a></td><td valign="top">provides a confidence interval for the parameter <m:math><m:mi>p</m:mi></m:math>&#160;of the binomial distribution.</td></tr><tr><td style="width:15%" class="libdoc" valign="top"><a class="rout" href="../G07/g07abf.xml">G07ABF</a></td><td valign="top">provides a confidence interval for the mean parameter of the Poisson distribution.</td></tr><tr><td style="width:15%" class="libdoc" valign="top"><a class="rout" href="../G07/g07bbf.xml">G07BBF</a></td><td valign="top">provides maximum likelihood estimates and their standard errors for the parameters of the Normal distribution from grouped and/or censored data.</td></tr><tr><td style="width:15%" class="libdoc" valign="top"><a class="rout" href="../G07/g07bef.xml">G07BEF</a></td><td valign="top">provides maximum likelihood estimates and their standard errors for the parameters of the Weibull distribution from data which may be right-censored.</td></tr><tr><td style="width:15%" class="libdoc" valign="top"><a class="rout" href="../G07/g07caf.xml">G07CAF</a></td><td valign="top">provides a <m:math><m:mi>t</m:mi></m:math>-test statistic to test for a difference in means between two Normal populations, together with a confidence interval for the difference between the means.</td></tr></table>
</div><div class="paramtext"><b>Robust Estimation</b>
<table class="standard-100"><tr><td style="width:15%" class="libdoc" valign="top"><a class="rout" href="../G07/g07dbf.xml">G07DBF</a></td><td valign="top">provides <m:math><m:mi>M</m:mi></m:math>-estimates for location and, optionally, scale using four common forms of the <m:math><m:mi>&#968;</m:mi></m:math>-function.</td></tr><tr><td style="width:15%" class="libdoc" valign="top"><a class="rout" href="../G07/g07dcf.xml">G07DCF</a></td><td valign="top">produces the <m:math><m:mi>M</m:mi></m:math>-estimates for location and, optionally, scale but for user-supplied <m:math><m:mi>&#968;</m:mi></m:math>- and <m:math><m:mi>&#967;</m:mi></m:math>-functions.</td></tr><tr><td style="width:15%" class="libdoc" valign="top"><a class="rout" href="../G07/g07daf.xml">G07DAF</a></td><td valign="top">provides the sample median, median absolute deviation, and the scaled value of the median absolute deviation.</td></tr><tr><td style="width:15%" class="libdoc" valign="top"><a class="rout" href="../G07/g07ddf.xml">G07DDF</a></td><td valign="top">provides the trimmed mean and winsorized mean together with estimates of their variance based on a winsorized variance.</td></tr></table>
</div><div class="paramtext"><b>Robust Internal Estimation</b>
<table class="standard-100"><tr><td style="width:15%" class="libdoc" valign="top"><a class="rout" href="../G07/g07eaf.xml">G07EAF</a></td><td valign="top">produces a rank based confidence interval for locations.</td></tr><tr><td style="width:15%" class="libdoc" valign="top"><a class="rout" href="../G07/g07ebf.xml">G07EBF</a></td><td valign="top">produces a rank based confidence interval for the difference in location between two populations.</td></tr></table>
</div><h2 class="standard"><a class="sec" name="index" id="index"/>4&#160;&#160;Index</h2>
<div>
<table style="width:95%"><tr><td style="width:1pt; white-space: no-wrap;"><nobr>2&#160;sample&#160;<span><i>t</i></span>-test</nobr></td><td style="border-bottom: 0.1em dotted black;">&#160;</td><td style="width: 1pt; white-space: no-wrap;"><nobr><a class="rout" href="../G07/g07caf.xml">G07CAF</a></nobr></td></tr></table>
<table style="width:95%"><tr><td style="width:1pt; white-space: no-wrap;"><nobr>Confidence&#160;intervals&#160;for&#160;parameters:</nobr></td><td>&#160;</td><td style="width: 1pt; white-space: no-wrap;"><nobr/></td></tr></table>
<table style="width:95%"><tr><td style="width:1pt; white-space: no-wrap;"><nobr>&#160;&#160;&#160;&#160;binomial&#160;distribution</nobr></td><td style="border-bottom: 0.1em dotted black;">&#160;</td><td style="width: 1pt; white-space: no-wrap;"><nobr><a class="rout" href="../G07/g07aaf.xml">G07AAF</a></nobr></td></tr></table>
<table style="width:95%"><tr><td style="width:1pt; white-space: no-wrap;"><nobr>&#160;&#160;&#160;&#160;Poisson&#160;distribution</nobr></td><td style="border-bottom: 0.1em dotted black;">&#160;</td><td style="width: 1pt; white-space: no-wrap;"><nobr><a class="rout" href="../G07/g07abf.xml">G07ABF</a></nobr></td></tr></table>
<table style="width:95%"><tr><td style="width:1pt; white-space: no-wrap;"><nobr>Maximum&#160;likelihood&#160;estimation&#160;of&#160;parameters:</nobr></td><td>&#160;</td><td style="width: 1pt; white-space: no-wrap;"><nobr/></td></tr></table>
<table style="width:95%"><tr><td style="width:1pt; white-space: no-wrap;"><nobr>&#160;&#160;&#160;&#160;Normal&#160;distribution,&#160;grouped&#160;and/or&#160;censored&#160;data</nobr></td><td style="border-bottom: 0.1em dotted black;">&#160;</td><td style="width: 1pt; white-space: no-wrap;"><nobr><a class="rout" href="../G07/g07bbf.xml">G07BBF</a></nobr></td></tr></table>
<table style="width:95%"><tr><td style="width:1pt; white-space: no-wrap;"><nobr>&#160;&#160;&#160;&#160;Weibull&#160;distribution</nobr></td><td style="border-bottom: 0.1em dotted black;">&#160;</td><td style="width: 1pt; white-space: no-wrap;"><nobr><a class="rout" href="../G07/g07bef.xml">G07BEF</a></nobr></td></tr></table>
<table style="width:95%"><tr><td style="width:1pt; white-space: no-wrap;"><nobr>Robust&#160;estimation:</nobr></td><td>&#160;</td><td style="width: 1pt; white-space: no-wrap;"><nobr/></td></tr></table>
<table style="width:95%"><tr><td style="width:1pt; white-space: no-wrap;"><nobr>&#160;&#160;&#160;&#160;confidence&#160;intervals:</nobr></td><td>&#160;</td><td style="width: 1pt; white-space: no-wrap;"><nobr/></td></tr></table>
<table style="width:95%"><tr><td style="width:1pt; white-space: no-wrap;"><nobr>&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;one&#160;sample</nobr></td><td style="border-bottom: 0.1em dotted black;">&#160;</td><td style="width: 1pt; white-space: no-wrap;"><nobr><a class="rout" href="../G07/g07eaf.xml">G07EAF</a></nobr></td></tr></table>
<table style="width:95%"><tr><td style="width:1pt; white-space: no-wrap;"><nobr>&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;two&#160;samples</nobr></td><td style="border-bottom: 0.1em dotted black;">&#160;</td><td style="width: 1pt; white-space: no-wrap;"><nobr><a class="rout" href="../G07/g07ebf.xml">G07EBF</a></nobr></td></tr></table>
<table style="width:95%"><tr><td style="width:1pt; white-space: no-wrap;"><nobr>&#160;&#160;&#160;&#160;median,&#160;median&#160;absolute&#160;deviation&#160;and&#160;robust&#160;standard&#160;deviation</nobr></td><td style="border-bottom: 0.1em dotted black;">&#160;</td><td style="width: 1pt; white-space: no-wrap;"><nobr><a class="rout" href="../G07/g07daf.xml">G07DAF</a></nobr></td></tr></table>
<table style="width:95%"><tr><td style="width:1pt; white-space: no-wrap;"><nobr>&#160;&#160;&#160;&#160;<span><i>M</i></span>-estimates for location and scale parameters:</nobr></td><td>&#160;</td><td style="width: 1pt; white-space: no-wrap;"><nobr/></td></tr></table>
<table style="width:95%"><tr><td style="width:1pt; white-space: no-wrap;"><nobr>&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;standard&#160;weight&#160;functions</nobr></td><td style="border-bottom: 0.1em dotted black;">&#160;</td><td style="width: 1pt; white-space: no-wrap;"><nobr><a class="rout" href="../G07/g07dbf.xml">G07DBF</a></nobr></td></tr></table>
<table style="width:95%"><tr><td style="width:1pt; white-space: no-wrap;"><nobr>&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;trimmed&#160;and&#160;winsorized&#160;means&#160;and&#160;estimates&#160;of&#160;their&#160;variance</nobr></td><td style="border-bottom: 0.1em dotted black;">&#160;</td><td style="width: 1pt; white-space: no-wrap;"><nobr><a class="rout" href="../G07/g07ddf.xml">G07DDF</a></nobr></td></tr></table>
<table style="width:95%"><tr><td style="width:1pt; white-space: no-wrap;"><nobr>&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;user-defined&#160;weight&#160;functions</nobr></td><td style="border-bottom: 0.1em dotted black;">&#160;</td><td style="width: 1pt; white-space: no-wrap;"><nobr><a class="rout" href="../G07/g07dcf.xml">G07DCF</a></nobr></td></tr></table></div><h2 class="standard"><a class="sec" name="withdrawn" id="withdrawn"/>5&#160;&#160;Routines Withdrawn or Scheduled for Withdrawal</h2>
<div class="paramtext">None.</div><h2 class="standard"><a class="sec" name="references" id="references"/>6&#160;&#160;References</h2><div class="paramtext"><a name="ref531" id="ref531"/>Cox D R and Hinkley D V (1974)  <i>Theoretical Statistics</i> Chapman and Hall </div>
<div class="paramtext"><a name="ref397" id="ref397"/>Hampel F R, Ronchetti E M, Rousseeuw P J and Stahel W A (1986)  <i>Robust Statistics. The Approach Based on Influence Functions</i> Wiley </div>
<div class="paramtext"><a name="ref398" id="ref398"/>Huber P J (1981)  <i>Robust Statistics</i> Wiley </div>
<div class="paramtext"><a name="ref587" id="ref587"/>Kendall M G and Stuart A (1973)  <i>The Advanced Theory of Statistics (Volume 2)</i> (3rd Edition) Griffin </div>
<div class="paramtext"><a name="ref532" id="ref532"/>Silvey S D (1975)  <i>Statistical Inference</i> Chapman and Hall </div><hr/><div><a class="chap" href="g07conts.xml">G07 Chapter Contents</a></div><div><a class="chapint" href="../../pdf/G07/g07intro.pdf">G07 Chapter Introduction (PDF version)</a></div>
<div><a class="htmltoc" href="../FRONTMATTER/manconts.xml">NAG Library Manual</a></div>
<div><hr/><a class="genint" href="../FRONTMATTER/copyright.xml">&#169; The Numerical Algorithms Group Ltd, Oxford, UK. 2009</a></div></body></html>
