Nested Logit regression model Harvard Case Solution & Analysis

Nested Log it regression model  Case Study Solution 


This assignment introduces the census for Motorized vehicle crashes,which represents a significant cause of death annually. However, it was estimated by the world health organization that 20 to 50 million road accidents result in non-fatal injuries which could range from debilitating outcomes to minor injuries. Furthermore, it was also estimated that about 1.24 million people die from fatal accidents on the road. Additionally, it was determined that the increasing road accidents had adverse effects on the economy, which, in turn, raised the medical costs of the victims. Moreover, it decreased the productivity of the organization and also was a major cause of traffic congestion on the road. In addition to this, it could be determined that in the U.S fatalities relating to highways occurred more frequently on rural highways, attributed to the lack of safety measure implement in that area. However, crash severity would remain the primary focus of the analysis as severity levels could have a tremendous impact on the economic condition and emotional state of the people. Therefore, the main object of the study was to estimate crash injury severity outcomes by implementing nested logit and probit Regression models.

Furthermore, KABCO scale directs development of the model towards a discrete choice model. However, this study focuses on binary nested logit Regression models ofseverity outcomes of road accidents.

Data Description

The Data had been collected from four datasets, consisting of details regarding vehicle, crash, roadway and occupant data in dta format, merged together to define specific crash events in the rural areas regarding the highways in Illinois. Furthermore, the dataset defines the level of crash severity for each incident, ranging from the worst injury to any driver or occupant involved in the accident, which could be determined using nested logit and probit regression model.

Furthermore, it can be determined that the studies suggest unreliability of data recorded in the dataset, as it does not account for subjective information of each incident. Moreover, the subjective information, including road defects,weather or driver conditions could also be a relevant cause of crash injuries. Hence, the absence of the subjective data from analysis could cause the model to omit variable bias, which would result in the parameters overestimation or underestimation.

Variable description

Using nested logit and progit regression model to estimate crash severity, where “severity” was taken as the dependent variable. Whereas, the independent variables were distributed into four different groups as per their features namely, Roadway Attributes,Traffic, Driver & Passenger, and Crash Type. However, the variable description is given in Exhibit-1.



Exihibit-1 (Variable Description)

Class Variable Description
Traffic aadt1 Avg annual traffic per day (veh/day)
comm_vol volume of heavy commercial vehicles per day (trucks/day)
large Indicator variable: 1=large vehicle (i.e., bus or

truck) involved in crash; 0=no large vehicle involved

Roadway Attributes lane width Lane width (ft.)
shoulder type Type of shoulder: 1 = earth/sod; 2 = aggregate; 3 = paved; 4 = composite

shoulder type of aggregate and sod; 5 = composite shoulder type of paved with

either aggregate or sod

shoulder width Shoulder width (ft.)
Crash Type other Indicator variable: 1 = collision type other than specific ones below;

0 = otherwise

fixed_object Indicator variable: 1 = fixed-object collision;

0 = otherwise

animal Indicator variable: 1 = animal collision; 0 = other
overturn Indicator variable: 1 = overturn collision; 0=other
turning_rear end Indicator variable: 1 = turning or rear-end collision; 0=other
side_same Indicator variable: 1 = sideswipe same direction; 0=other
side opp_head_angle Indicator variable: 1 = sideswipe opposite direction, head-on, or

angle collision; 0=other

Driver & Passenger drv_max_age Max age of drivers involved in the crash (years)
back_max_age Max age of the back seat occupants involved in the crash
front_max_age Maxage of front seat occupants involved in the crash
num_male Number of male occupants involved in the crash (other than the drivers)
num_female Number of female occupants involved in the crash (other than the drivers)
backind Back seat occupant indicator: 1=there was at least one back seat occupant involved in the crash; 0=no back seat occupants involved
front ind Indicator variable: 1=there was at least one front seat occupant involved in the crash; 0= no front seat occupant involved
drvsex_male Indicator variable: 1 = driver is male; 0 = driver is female
drv_no_rest Indicator variable: 1 = there was at least one driver involved in the crash that did not use restrain or used restraint improperly; 0 =all drivers used restraint properly
front_no_rest Indicator variable: 1 = there was at least one front seat occupant involved in the crash that did not use restraint or used restraint improperly; 0 =all front seat occupants used restraint properly
back_no_rest Indicator variable: 1 = there was at least one back seat occupant involved in the crash that did not use restraint or used restraint improperly; 0 =all back seat occupants used restraint properly


The descriptive statistics of the dependent as well as independent variable is given below in exhibit-2.

Nested Logit regression model Harvard Case Solution & Analysis



Exhibit-2 (Descriptive Statistics)


Descriptive Statistics
Variable                   Observations  Mean   Std. Dev.   Min  Max
severity                    24,622            0.38            0.90                -              4.00
aadt1                    24,622    3,609.63    2,189.94       400.00    14,500.00
comm_vol                    24,622       422.24       301.12           1.00            14.00
shoulderwi~h                    24,622            6.26            2.57           1.00            14.00
drv_max_age                    24,536          41.63          16.73         13.00            98.00
back_max_age                    23,843            1.91            8.45                -            92.00
front_max_~e                    24,467            8.03          17.93                -            97.00
num_male                    24,622            0.17            0.52                -            19.00
num_female                    24,622            0.24            0.58                -            17.00
backind                    24,622            0.09            0.28                -              1.00
drvsex_male                    24,548            0.59            0.49                -              1.00
drv_no_rest                    21,841            0.03            0.76                -              1.00
front_no_r~t                    24,135            0.01            0.08                -              1.00
back_no_rest                    23,797            0.01            0.08                -              1.00
other                    24,622            0.04            0.20                -              1.00
animal                    24,622            0.58            0.49                -              1.00
overturn                    24,622            0.06            0.23                -              1.00
turning_re~d                    24,622            0.78            0.27                -              1.00
side_same                    24,622            0.11            0.11                -              1.00
side opp_he~e                    24,622            0.04            0.20                -              1.00

Exihibit-3 (Regression Analysis)

In Exhibit-3, the p-values of the independent variables are reasonable. However, most of the values are less than 5%. Moreover, under the regression model, the coefficient of AADT and animal are negative, which means that either AADT or the probability of the crash being caused by animals was increasing. However, the accident tends to be in the minor severity category............

This is just a sample partial case solution. Please place the order on the website to order your own originally done case solution








Share This


Save Up To




Register now and save up to 30%.