MLE where loglikelihood function is a function of numerical solutions

An embedded and charset-unspecified text was scrubbed...
Name: not available
URL: <https://stat.ethz.ch/pipermail/r-help/attachments/20110410/934a9b4e/attachment.pl>
Hi Kristian

The obvious approach is to treat it like any other MLE problem:  evaluation
of the log-likelihood is done as often as necessary for the optimizer  
you are using: eg a call to optim(psi,LL,...)  where LL(psi) evaluates  
the log likelihood at psi.  There may be computational shortcuts that  
would work if you knew that LL(psi+eps) were well approximated by  
LL(psi), for the values of eps used to evaluate numerical derivatives  
of LL.  Of course, then you might need to write your own custom  
optimizer.

albyn

Quoting Kristian Lind <kristian.langgaard.lind at gmail.com>:
Hi there,

I'm trying to solve a ML problem where the likelihood function is a function
of two numerical procedures and I'm having some problems figuring out how to
do this.

The log-likelihood function is of the form L(c,psi) = 1/T sum [log (f(c,
psi)) - log(g(c,psi))], where c is a 2xT matrix of data and psi is the
parameter vector. f(c, psi) is the transition density which can be
approximated. The problem is that in order to approximate this we need to
first numerically solve 3 ODEs. Second, numerically solve 2 non-linear
equations in two unknowns wrt the data. The g(c,psi) function is known, but
dependent on the numerical solutions.
I have solved the ODEs using the deSolve package and the 2 non-linear
equations using the BB package, but the results are dependent on the
parameters.

How can I write a program that will maximise this log-likelihood function,
taking into account that the numerical procedures needs to be updated for
each iteration in the maximization procedure?

Any help will be much appreciated.

Kristian

	[[alternative HTML version deleted]]

______________________________________________
R-help at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

to clarify: by  "if you knew that LL(psi+eps) were well approximated  
by LL(psi), for the values of eps used to evaluate numerical  
derivatives of LL. "
I mean the derivatives of LL(psi+eps) are close to the derivatives of LL(psi),
and perhaps you would want the hessian to be close as well.

albyn

Quoting Albyn Jones <jones at reed.edu>:
Hi Kristian

The obvious approach is to treat it like any other MLE problem:  evaluation
of the log-likelihood is done as often as necessary for the  
optimizer you are using: eg a call to optim(psi,LL,...)  where  
LL(psi) evaluates the log likelihood at psi.  There may be  
computational shortcuts that would work if you knew that LL(psi+eps)  
were well approximated by LL(psi), for the values of eps used to  
evaluate numerical derivatives of LL.  Of course, then you might  
need to write your own custom optimizer.

albyn

Quoting Kristian Lind <kristian.langgaard.lind at gmail.com>:

Hi there,

I'm trying to solve a ML problem where the likelihood function is a function
of two numerical procedures and I'm having some problems figuring out how to
do this.

The log-likelihood function is of the form L(c,psi) = 1/T sum [log (f(c,
psi)) - log(g(c,psi))], where c is a 2xT matrix of data and psi is the
parameter vector. f(c, psi) is the transition density which can be
approximated. The problem is that in order to approximate this we need to
first numerically solve 3 ODEs. Second, numerically solve 2 non-linear
equations in two unknowns wrt the data. The g(c,psi) function is known, but
dependent on the numerical solutions.
I have solved the ODEs using the deSolve package and the 2 non-linear
equations using the BB package, but the results are dependent on the
parameters.

How can I write a program that will maximise this log-likelihood function,
taking into account that the numerical procedures needs to be updated for
each iteration in the maximization procedure?

Any help will be much appreciated.

Kristian

	[[alternative HTML version deleted]]

______________________________________________
R-help at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

An embedded and charset-unspecified text was scrubbed...
Name: not available
URL: <https://stat.ethz.ch/pipermail/r-help/attachments/20110413/8c650e35/attachment.pl>
Questions:

1. why are you defining Bo within a loop? 
2. Why are you doing library(nleqslv) within the loop?

Doing both those statements outside the loop once is more efficient.

In your transdens function you are not using the function argument
parameters, why?
Shouldn't there be a with(parameters) since otherwise where is for example
K_vv supposed to come from?

I can't believe that the code "worked": in the call of nleqslv you have ...
control=list(matxit=100000) ...
It should be maxit and nleqslv will issue an error message and stop (at
least in the latest versions).
And why 100000? If that is required, something is not ok with starting
values and/or functions.

Finally the likelihood function at the end of your code

#Maximum likelihood estimation using mle package 
library(stats4) 
#defining loglikelighood function 
#T <- length(v) 
#minuslogLik <- function(x,x2) 
#{    f <- rep(NA, length(x)) 
#        for(i in 1:T) 
#        { 
#            f[1] <- -1/T*sum(log(transdens(parameters = parameters, x = 
c(v[i],v[i+1])))-log(Jac(outmat=outmat, x2=c(v[i],r[i]))) 
#        } 
#    f 
# } 

How do the arguments of your function x and x2 influence the calculations in
the likelihood function?
As written now with argument x and x2 not being used in the body of the
function, there is nothing to optimize.
Shouldn't f[1] be f[i] because otherwise the question is why are looping
for( i in 1:T)?
But then returning f as a vector seems wrong here. Shouldn't a likelihood
function return a scalar?

Berend

--
View this message in context: http://r.789695.n4.nabble.com/MLE-where-loglikelihood-function-is-a-function-of-numerical-solutions-tp3439436p3447224.html
Sent from the R help mailing list archive at Nabble.com.
An embedded and charset-unspecified text was scrubbed...
Name: not available
URL: <https://stat.ethz.ch/pipermail/r-help/attachments/20110414/45428768/attachment.pl>

HI Berend, 

Thank you for your reply. 

......
Finally the likelihood function at the end of your code

#Maximum likelihood estimation using mle package
library(stats4)
#defining loglikelighood function
#T <- length(v)
#minuslogLik <- function(x,x2)
#{    f <- rep(NA, length(x))
#        for(i in 1:T)
#        {
#            f[1] <- -1/T*sum(log(transdens(parameters = parameters, x =
c(v[i],v[i+1])))-log(Jac(outmat=outmat, x2=c(v[i],r[i])))
#        }
#    f
# }

How do the arguments of your function x and x2 influence the calculations in
the likelihood function?

What I was thinking was that the x and x2 would be the input for the transdens() and Jac() functions, but I guess that is already taken care of when defining them.  

Huh?
If you define zz <- function(x) {...} x is an argument which must be specified when using the function.
As written now with argument x and x2 not being used in the body of the
function, there is nothing to optimize.

That is correct and that is the problem. The likelihood need to be stated as a function of the parameters, here the vector "parameters". Because in the maximum likelihood estimation we want to maximize the value of the likelihood function by changing the parameter values. The likelihood function is a function of the parameters but only through the functions, for example Kristian() is a function of "parameters" which feeds into Bo(), transdens() and Jac(). Do you have any suggestions how to get around this issue?  

What kind of problem?. Why don't you then do (parameters and outmat already known globally)

          f[i] <- -1/T*sum(log(transdens(parameters = parameters, x = x))-log(Jac(outmat=outmat, x2=x2)))

You should pass the arguments used by the optimizer in calling your likelihood function to the functions you defined. That way f[] will depend on x and x2 and so will the likelihood.
Shouldn't f[1] be f[i] because otherwise the question is why are looping
for( i in 1:T)?
But then returning f as a vector seems wrong here. Shouldn't a likelihood
function return a scalar?

The likelihood function should return a scalar. I think the fix could be to make the function calculate the function value at each "i" and then make it return the mean of all the f[i]s. 

Is the mean  a likelihood?

Berend