Please use this identifier to cite or link to this item: http://ir.lib.seu.ac.lk/handle/123456789/7570
Full metadata record
DC FieldValueLanguage
dc.contributor.authorUthayasanker, T.-
dc.date.accessioned2025-06-01T08:03:11Z-
dc.date.available2025-06-01T08:03:11Z-
dc.date.issued2024-11-06-
dc.identifier.citationConference Proceedings of 13th Annual Science Research Session – 2024 on “"Empowering Innovations for Sustainable Development Through Scientific Research" on November 6th 2024. Faculty of Applied Sciences, South Eastern University of Sri Lanka, Sammanthurai.. pp. 27.en_US
dc.identifier.isbn978-955-627-029-7-
dc.identifier.urihttp://ir.lib.seu.ac.lk/handle/123456789/7570-
dc.description.abstractRide-hailing applications predict the fare for each trip at the beginning of the ride. Accurate prediction fare prediction helps improve the user experience and trip conversion. We aim to build a machine learning model that classify the ride fare estimation. This model helps improve the fare estimate model and feature analysis suggest the key impact factors in ride fare. A selected sample of trips, the fare predictions are labeled as either correct or incorrect. This project aims to develop an accurate and explainable machine learning model to classify these fare predictions. Key features were manually engineered from the provided dataset, including calculated distances, driving times, and various fare-related ratios, to enhance model performance. The dataset was carefully preprocessed to handle missing values and imbalances in the label distribution. Various machine learning models, including Random Forest, XGBoost, and LightGBM, were employed, with XGBoost proving to be the most effective individual model. To further enhance accuracy, a filtering ensemble technique was used to combine these models, achieving an impressive 98.32% accuracy. The explainability of the model was analyzed using SHAP values, which revealed that the most significant factors influencing predictions were ride rate, ride distance, and waiting time. Additionally, other factors such as ride hour, ride day, and ride month also played a role in the model’s performance. This approach provides a robust and interpretable solution to the problem of ride fare classification. The study highlights that fare misclassification is analogous to an outlier detection problem, where incorrect fares often exhibit extreme rates in terms of fare per distance or driving time. The findings suggest that further improvements could be made by incorporating additional features related to ride penalties.en_US
dc.language.isoen_USen_US
dc.publisherFaculty of Applied Sciences, South Eastern University of Sri Lanka, Sammanthurai.en_US
dc.subjectEngineeringen_US
dc.subjectEnsemble methodsen_US
dc.subjectFeatureen_US
dc.subjectRide fare Classificationen_US
dc.subjectMachine learningen_US
dc.subjectGBoost.en_US
dc.titleClassifying ride fare predictions using filtered ensamples random forest and gradient boosting algorithmsen_US
dc.typeArticleen_US
Appears in Collections:13th Annual Science Research Session

Files in This Item:
File Description SizeFormat 
CLASSIFYING RIDE FARE PREDICTIONS USING FILTERED.pdf127.03 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.