This repository contains a project developed as part of Data Science Lab course of the Master's degree in Data Science, University of Milano Bicocca.
This project aims at analyzing and forecasting daily revenue and the daily number of receipts across six distinct restaurants. By employing a statistical approach and utilizing predictive models, particularly the SARIMA and TBATS models, the focuse is on analyzing historical data related to daily revenue and the number of receipts in each restaurant and formulating future forecasts. Finally, the results of the two models are compared to potentially identify the better-performing one.
This repository is structured as follows:
.
βββ DATA_PREPROCESSING_EXPLORATORY_ANALYSIS.ipynb # preprocessing operations and exploratory data analysis
βββ FORECAST MODELS.R # forecast models script in R language
βββ ORGANIZED DATASETS # data used (daily revenue and recepits)
βΒ Β βββ fat_r000.csv # revenue data related to the first restaurant
βΒ Β βββ fat_r001.csv # revenue data related to the second restaurant
βΒ Β βββ fat_r002.csv # revenue data related to the third restaurant
βΒ Β βββ fat_r003.csv # revenue data related to the fourth restaurant
βΒ Β βββ fat_r004.csv # revenue data related to the fifth restaurant
βΒ Β βββ fat_r005.csv # revenue data related to the sixth restaurant
βΒ Β βββ fatturato.csv # revenue data related to all restaurants
βΒ Β βββ scon_r000.csv # recepit data related to the first restaurant
βΒ Β βββ scon_r001.csv # recepit data related to the second restaurant
βΒ Β βββ scon_r002.csv # recepit data related to the third restaurant
βΒ Β βββ scon_r003.csv # recepit data related to the fourth restaurant
βΒ Β βββ scon_r004.csv # recepit data related to the fifth restaurant
βΒ Β βββ scon_r005.csv # recepit data related to the sixth restaurant
βΒ Β βββ scontrini.csv # recepit data related to all restaurants
βββ README.md
For the forecast models, the R code is provided as an attachment. This code is specific to a single restaurant. You can change the restaurant by modifying the .csv file to be loaded and then replacing "R00_" with the corresponding number in the code lines marked with "CHANGE RESTAURANT NUMBER HERE".
The code is divided into two parts: one for the analysis with pre and post COVID-19 data and one for the analysis with only post COVID-19 data. Each part is further divided into REVENUE and RECEIPTS, and within those, SARIMA and TBATS models are used separately.