Since this semi-Markov chain is not directly observable, the state
sequence
and the model parameters (i.e.,
the transition probability matrix P for the control states,
the mean interarrival time
,
the vector of mean sojourn times
,
the sojourn time probability vector
for each control state
,
and the control state sequence
of the data set)
are estimated from the observed sequence
.
The main steps of the standard procedure for HMM with explicit state duration are summarized as follows:
Following the above procedure, we can obtain the maximum likelihood
model parameters
for the given observation sequence
and the state
space
.
Let
denote the estimated non-parametric probability
mass function for the sojourn time
of state
, and let
denote the estimated
non-parametric probability mass function for the observation
of state
.
The total number of model parameters can be further reduced if the
observation distribution or the state sojourn time distribution is
approximated by some parametric distributions such as Gaussian, Poisson
or gamma distributions [91,62].
In this case, one only needs to estimate a few parameters that specify
the selected distribution functions.
Ferguson [31] has shown that the parameters for
the parametric sojourn time distribution
and the parametric observation distribution
for state
can be found by maximizing
and
subject to the
stochastic constraints
and
.
If the arrival process for each control state is Poisson and the
control state sojourn times follow a hyperexponential distribution, i.e.,
Finally, as part of the initialization step, we set the total number of
control states to a pre-specified input parameter (which is analyzed in
our experiments).
If this parameter is a sufficiently large integer,
in the re-estimation procedure,
we delete the states that are never visited. As such the value of
is reduced to a smaller number of control states.
This led to a maximum of
control states for the data sets used in our
study.
We also need to initialize the elements of the transition probability matrix,
the control state sojourn time distributions, and the initial control state
probability vector.
An often used choice is to assume that these initial values
for the model parameters are uniformly distributed.
In addition, we assume the initial values of the control-state arrival rates
to be proportional to the state index, i.e.,
where
is the index
of the control state,
is the maximum value of
, and
is the total number of control states.