Highest Demanded Location Prediction System for Taxi Drivers

Highest Demanded Location Prediction System for Taxi Drivers πŸš– | Optimized Fleet Management πŸ“ˆ | Enhanced Customer Experience 🌟

<a href="https://github.com/cepdnaclk/e19-co544-Demand-Location-Prediction-For-Taxis/issues">
    <img src="https://img.shields.io/github/issues/cepdnaclk/e19-co544-Demand-Location-Prediction-For-Taxis" alt="GitHub issues">
</a>
<a href="https://github.com/cepdnaclk/e19-co544-Demand-Location-Prediction-For-Taxis/pulls">
    <img src="https://img.shields.io/github/issues-pr/cepdnaclk/e19-co544-Demand-Location-Prediction-For-Taxis" alt="GitHub pull requests">
</a>
<a href="https://github.com/cepdnaclk/e19-co544-Demand-Location-Prediction-For-Taxis/releases">
    <img src="https://img.shields.io/github/downloads/cepdnaclk/e19-co544-Demand-Location-Prediction-For-Taxis/total" alt="GitHub downloads">
</a>
<a href="https://github.com/cepdnaclk/e19-co544-Demand-Location-Prediction-For-Taxis/releases">
    <img src="https://img.shields.io/github/v/release/cepdnaclk/e19-co544-Demand-Location-Prediction-For-Taxis" alt="GitHub release">
</a>


</br>

Table of Content

  1. Introduction
  2. Problem
  3. Opportunity in Domain
  4. Detailed Solution
  5. Data Collection
  6. Exploratory Data Analysis

  7. Getting Started
  8. Contributors
  9. Links

  10. Suggested Approaches
  11. Getting Started
  12. Contributors
  13. Links

πŸ“š Introduction

This project leverages machine learning to predict taxi ride demand across different regions. By forecasting ride numbers for specific locations and time, we aim to create a more efficient and profitable ecosystem for taxi services.

🚧 Problem

For Taxi Drivers:

For Customers:

πŸ› οΈ Detailed Solution

πŸ“Š Data Collection

Our primary data source is the New York City Taxi and Limousine Commission website. From this resource, we can obtain approximately 20 features categorized by monthly data spanning over 20 years.

πŸ” Exploratory Data Analysis

Steps Involved

  1. Remove colums with higher percentage with Null values.
  2. Removes any rows from the dataFrame that contain missing values (NaN)
  3. Calculate trip times and speed. Then we can remove more data with unusual values. Using this we could remove the rows with unusual speeds, and trip times.
  4. Visualize interested parameters in the box plots. And checked for outliers.
  5. Convert pickup times raw into date time object. Since we are not intersted on the time we could remove that.
  6. Fianally, we could filer only date of ride (based on the pickup time) and PUlocation.
  7. Collect .csv files based on month in format. Finally we have 12 csv files for a year with naming format YYYY-MM.

πŸš€ Getting Started

Anybody can explore this project and gain insights. It’s easy.

  1. Clone the repository to the htdocs folder inside the XAMPP installation location.

Contributors


First Image Second Image

  • Convert pickup times raw into date time object. Since we are not intersted on the time we could remove that.</li>

  • Fianally, we could filer only date of ride (based on the pickup time) and PUlocation.</li>

  • Collect .csv files based on month in format. Finally we have 12 csv files for a year with naming format YYYY-MM.</li>

  • Now we can consider one pu_location at a time with respective dates and train the machine learning model.</li>

    </ol>

    πŸ’‘ Suggested Approaches

    • Auto Regression model
    • ARIMA model
    • Nural network
    • LSTM model

    πŸ‘₯ contributors

    • E/19/034, H.M.K.D. Bambaragama, email
    • E/19/226, K.G.M. Madushanka, email
    • E/19/278, A.P.T.T. Perera, email
    • E/19/409, D.P. Udugamasooriya, email
    • E/19/432, U.I. Wickramaarachchi, email