Page 382 - 360.revista de Alta Velocidad - Nº 5
P. 382

Pagliara, Francesca. Menicocci, Fabrizio.Vassallo, José Manuel. Gómez Sánchez, Juan.




                   where:

                           β   is the intercept, i.e. the expected value of p when all the predictors are 0;
                            0
                           β  i  are the regression coefficients, estimated through the calibration process.



                   As  a  consequence  of  this  linearization  process,  the  interpretation  of  the  β   coefficients  is
                                                                                                 k
                   different compared to standard linear regression models. The slope coefficient suggests that
                   for a unit increase in a certain explanatory variable X , the weighted log of the odds in favor of
                                                                        k
                                                            βk
                   a certain alternative (Y=1) increases of e . Moreover, for a unit increase of a given explanatory
                                                                                        βk
                   variable X , the odds ratio in favor of happening Y=1 increases of e . Furthermore, unlike a
                             k
                   simple linear regression, logistic regression parameters are usually estimated with the method
                   of maximum likelihood, an iterative process calculating small corrections until the convergence
                   is reached.
                   Regarding the goodness of fit of the model, R  statistics refer to the entire model and indicates
                                                                2
                   how useful the explanatory variables are in predicting the response variable. The Cox & Snell
                   and the Nagelkerke R  are two of the most used statistics. The maximum value for Cox & Snell
                                         2
                     2
                                                           2
                   R  is less than 1 while the Nagelkerke R  provides a correction of this one and covers the full
                   range from 0 to 1 and therefore is often preferred.
                   A preliminary  analysis  has been  carried out to check  potential  collinearity  between  the
                   explanatory variables ECO-EXC and INCOME < 2000. Specifically a correlation analysis based on
                   the Chi2 test allows determining the level of relationship between both variables. With a level
                   of significance equal to 0.05 and 1 degree of freedom, according to the Chi  test the alternative
                                                                                             2
                   hypothesis is accepted and the variables are independent.  Estimation results for the binary
                   choice model specified are reported in Table 7. According to the estimates it can be observed
                   that:
                   •       Those users who have travelled  at  least  once by Spanish  HSR (AVE) have  a higher
                       probability of choosing it again (β          positive and highly significant).
                                                        ALREADY-TRAV-HSR
                   •       Those users who feel to be economically excluded (i.e. for whom the cost of the HSR
                       ticket is perceived high) have a lower probability of choosing HSR (β  ECO-EXC  negative and
                       significant).
                   •       Those users who feel to be time-based excluded (i.e. who feel constrained due to the
                       impossibility of reconciling their commitments with train frequency and timetable) have a
                       low probability of choosing HSR (β TIME-EXC  negative and highly significant), keeping the rest of
                       variables constant.
                   •       Those users who feel to be geographically excluded (i.e. who have a lower accessibility
                       to AVE stations) have a low probability of choosing HSR (β     negative and significant).
                                                                                GEO- EXC
                   •       Those users who have a monthly income under 2,000 Euro have a lower probability of
                       choosing   AVE  (β INCOME<2000   negative).  Moreover  this  variable  is  not  statistically  significant
                       (t–ratio = 1.684 < 1.960).

                   Table 7 - Estimation results (not all the variables significant)















            380                                                                             360.revista de alta velocidad
   377   378   379   380   381   382   383   384   385   386   387