Machine Learning-Based Cost Predictive Model for Better Operating Expenditure Estimations of U.S. Light Rail Transit Projects

Inaccurate forecasts of operating expenditures during the planning phase for new Light Rail Transit (LRT) projects in the United States underestimated future costs by up to 45% (Pickrell, 1989). When operating expenditures exceeded projected levels, local transit agencies often reduced public transi...

Full description

Bibliographic Details
Main Author: Zhou, Gordon
Language:ENG
Published: The George Washington University 2021
Subjects:
Online Access:http://pqdtopen.proquest.com/#viewpdf?dispub=28157527
Description
Summary:Inaccurate forecasts of operating expenditures during the planning phase for new Light Rail Transit (LRT) projects in the United States underestimated future costs by up to 45% (Pickrell, 1989). When operating expenditures exceeded projected levels, local transit agencies often reduced public transit services to operate within their respective annual budgets. Therefore, it is imperative for transit agencies to produce reasonably accurate planning estimates to secure sufficient funding to support future operations, maintenance, and service delivery associated with LRT systems. The research aimed to develop a more accurate LRT operating expenditure predictive model to be used during the planning stage. Traditional statistical analysis and various machine learning-based algorithms were utilized with input from 22 LRT systems in the United States spanning between 2008 to 2018 from various U.S. governmental public databases. This praxis extended the current state of practice that relied primarily on sum of unit-cost estimates (also known as the unit-cost method) which generally failed to produce accurate forecasts due to lack of engineering details at the planning stage. Existing research attempted to develop regression-based methodologies using system-based attributes but did not substantially increase prediction accuracy from using the unit-cost method. The research improved current practices and research by having developed a more accurate and replicable machine learning-based predictive model using available geographic, socio-economic and LRT system-related variables.