Loading [MathJax]/jax/output/HTML-CSS/fonts/TeX/fontdata.js
Skip to main content
Library homepage
 

Text Color

Text Size

 

Margin Size

 

Font Type

Enable Dyslexic Font
Statistics LibreTexts

Regression through the origin

( \newcommand{\kernel}{\mathrm{null}\,}\)

Regression through the origin

Sometimes due to the nature of the problem (e.g. (i) physical law where one variable is proportional to another variable, and the goal is to determine the constant of proportionality; (ii) X = sales, Y = profit from sales), or, due to empirical considerations ( in the full regression model the intercept β0 turns out to be insignificant), one may fit the model Yi = β1Xi + εi, where εi are assumed to be uncorrelated, and have mean 0 and variance σ2. Then estimates are:

ˆβ1=˜b1=ni=1XiYini=1X2i,~SSE=ni=1(Yi˜b1Xi)2=iY2i˜b21iX2i.

Also, E(˜b1)=β1, E(~SSE)=(n1)σ2, so that ~MSE=1n1~SSE is an unbiased estimator of σ2 and d.f.(~MSE)=n1. Var(˜b1)=σ2iX2i, and is estimated by s2(˜b1)=~MSEiX2i.

  • 100(1 - α)% confidence interval for β1 : ˜b1±t(1α/2;n1)s(˜b1).
  • Estimate of mean response for X=Xh : ˜Yh=˜b1Xh with estimated standard error s(˜Yh)=~MSEX2hiX2i.
  • 100(1 - α)% confidence interval for mean response : ˜Yh±t(1α/2;n1)s(˜Yh).
  • ANOVA decomposition : ~SSTO=~SSR+~SSE, where ~SSTO=iY2i, with d.f. (~SSTO)=n, ~SSR=˜b21iX2i with d.f.(~SSR)=1. Reject H0:β1=0 if F-ratio F=~MSR~MSE>F(1α;1,n1).

Inverse prediction, or calibration problem

In some experimental studies it is important to know the value of X in order to obtain ( on an average ) a pre-specified value of Y. The following example illustrates such a situtation.

X 10 15 15 20 20 20 25 25 28 30
Y 160 171 175 182 184 181 188 193 195 200

Here Y = tensile strength of paper, X = amount (percentage) of hardwood in the pulp.

Want to find Xh(new) for given value of Yh(new).

Estimate ˆXh(new)=Yh(new)b0b1. Estimated standard error of prediction is s(ˆXh(new)) where
s2(ˆXh(new))=MSEb21[1+1n+(ˆXh(new)¯X)2i(Xi¯X)2].

Then 100(1 - α)% prediction interval for Xh(new) is given by ˆXh(new)±t(1α/2;n2)s(ˆXh(new)).

Fitted model : ˆY = 143.8244 + 1.8786X . SSE = 38.8328, SSTO = 1300.9, SSR = 1262.1, R2 = 0.9701, i(Xi¯X)2 = 357.6, MSE = 4.8541, ¯X = 20.8, ¯Y = 182.9.

Contributors

  • Yingwen Li
  • Debashis Paul


This page titled Regression through the origin is shared under a not declared license and was authored, remixed, and/or curated by Debashis Paul.

Support Center

How can we help?