Need help to locate my mistake

Rolf Turner · 2008-03-02T20:55:56Z

On 3/03/2008, at 9:18 AM, Louise Hoffman wrote: > Dear readers > > I would like to make General Linear Model (GLM) for the following > data set > http://louise.hoffman.googlepages.com/fuel.csv > > The code I have written is > > fuelData n xOnes x y theta > which gives > >> theta > [,1] > [1,] 215.8374077 > [2,]

Rolf Turner

Sun, Mar 2, 2008 12:55 PM

On 3/03/2008, at 9:18 AM, Louise Hoffman wrote:

This is certainly ***NOT*** correct. (If you really got those numbers
from Matlab, then Matlab is up to Puttee.)

Have you plotted your data?

	(1) Fitting a straight line is ridiculous.

	(2) If you are so foolish as to fit a straight line, you get
	theta to have entries -4197.96 (intercept) and 2.16 (slope).
	The line y = 79.69 + 0.18*x is off the edge of the graph and
	does not even appear.

Yes.  The expression (t(x)%*%x)^(-1) is the matrix of entry
	by entry reciprocals of the entries of t(x)%*%x.

	You want:

		theta <- solve(t(x)%*%x))%*%t(x)%*%y


	Anyhow, if you're going to use R, why not ***use R***?

	fit <- lm(fpi ~ rtime,data=fuelData)
	theta <- coef(fit)

	This gives an answer identical to that from the corrected version of
	your ``from scratch'' expression.  (That expression, while  
theoretically
	correct, is numerically ill-advised.  The cognoscenti use either the
	Choleski or the ``qr'' decomposition of t(x)%*%x to effect the  
calculations.
	One of these is what is going on in the bowels of lm().  But here I  
speak
	of that of which I know little.)

		cheers,

			Rolf Turner



######################################################################
Attention:\ This e-mail message is privileged and confid...{{dropped:9}}

Need help to locate my mistake

Thread (7 messages)