This project sets up and analyzes a NoSQL database using MongoDB. The data is sourced from a JSON
file containing establishments' food hygiene ratings. The project involves:
- NoSQL Database Setup - Configuring and populating a MongoDB database.
- Data Analysis - Querying and analyzing the dataset using Python and MongoDB tools.
The analysis is implemented across two Jupyter Notebooks:
NoSQL_setup.ipynb
- For database creation and population.NoSQL_analysis.ipynb
- For querying and exploring the data.
File Name | Description |
---|---|
establishments.json |
JSON file containing food hygiene rating data. |
NoSQL_setup.ipynb |
Jupyter Notebook for MongoDB database setup and data loading. |
NoSQL_analysis.ipynb |
Jupyter Notebook for data queries and analysis. |
-
MongoDB Database Setup:
- Sets up a MongoDB database named
establishments_db
. - Imports the
establishments.json
file into a collection.
- Sets up a MongoDB database named
-
Data Analysis:
- Queries the MongoDB database to analyze food hygiene ratings.
- Performs queries such as filtering establishments by ratings, location, or business type.
-
Visualizations:
- Generates insightful visualizations (optional) to explore the data.
-
Prerequisites:
- Python 3.x
- MongoDB installed locally or cloud-hosted via MongoDB Atlas
- Jupyter Notebook
-
Setup:
-
Clone this repository or download the project files.
-
Install dependencies (if needed):
-
pip install -r requirements.txt
- Establishments with the highest food hygiene ratings (5) are distributed across various locations.
- Business types such as restaurants and cafes dominate the dataset.
- Analysis reveals insights into the cleanliness and management confidence of establishments.