r - Used Predict function on New Dataset with different Columns -
using "stackloss" data in r, created regression model seen below:
stackloss.lm = lm(stack.loss ~ air.flow + water.temp + acid.conc.,data=stackloss) stackloss.lm newdata = data.frame(air.flow=stackloss$air.flow, water.temp= stackloss$water.temp, acid.conc.=stackloss$acid.conc.)
suppose new data set , need predict "stack.loss" based on previous model seen below:
#suppose need used model on new set of data stackloss$predict1[-1] <- predict(stackloss.lm, newdata)
i error:
error in `$<-.data.frame`(`*tmp*`, "predict1", value = numeric(0)) : replacement has 0 rows, data has 21
is way used predict function on different data set same columns different rows?
thanks in advance.
you can predict new data set of whatever length want, need make sure assign results existing vector of appropriate size.
this line causes problem because
stackloss$predict1[-1] <- predict(stackloss.lm, newdata)
because can't assign , subset non-existing vector @ same time. doesn't work
dd <- data.frame(a=1:3) dd$b[-1]<-1:2
the length of stackloss
used fit model same length re-assigning new values data.frame doesn't make sense. if want use smaller dataset predict on, that's fine
stackloss.lm = lm(stack.loss ~ air.flow + water.temp + acid.conc.,data=stackloss) newdata = head(data.frame(air.flow=stackloss$air.flow, water.temp= stackloss$water.temp, acid.conc.=stackloss$acid.conc.),5) predict(stackloss.lm, newdata) 1 2 3 4 5 38.76536 38.91749 32.44447 22.30223 19.71165
since result has same number of values newdata
has rows (n=5), makes sense attach these newdata
. not make sense attach stackloss
because has different number of rows (n=21)
newdata$predcit1 <- predict(stackloss.lm, newdata)
Comments
Post a Comment