Data analysis of a transportation company
Pavone, Davide (2016)
Pavone, Davide
Metropolia Ammattikorkeakoulu
2016
All rights reserved
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:amk-201605066431
https://urn.fi/URN:NBN:fi:amk-201605066431
Tiivistelmä
The purpose of this thesis project was to investigate the database of a Finnish taxi company in order to provide them a summary of their performance achieved in a specific three-month period. The scope of the research is to understand the data structure, reorganize the database, analyse sets of data, show results, limits and eventually deficiencies in order to give useful information of the performances of the taxis.
The study started by getting familiar with the information the database contains. The information concerns specific rides for customers with special needs. Trips are organised by a company working for the city of Helsinki and the information is collected by an application. It was necessary to adopt a model to use as a structure for the whole project and CRISP-DM model was chosen. For this study, RStudio open source environment with different packages was chosen as workspace. After introducing different concepts regarding data and illustrating models for data mining, the research focuses on analysing a segment of the database.
The results yielded with the support of different and interactive graphs from different perspective can help the company to interpret the profitability of working with these special rides provided by the city.
The study started by getting familiar with the information the database contains. The information concerns specific rides for customers with special needs. Trips are organised by a company working for the city of Helsinki and the information is collected by an application. It was necessary to adopt a model to use as a structure for the whole project and CRISP-DM model was chosen. For this study, RStudio open source environment with different packages was chosen as workspace. After introducing different concepts regarding data and illustrating models for data mining, the research focuses on analysing a segment of the database.
The results yielded with the support of different and interactive graphs from different perspective can help the company to interpret the profitability of working with these special rides provided by the city.
