ID2221 Project - Group 333

Ibrahim Abdelkareem - Daniel Bruke - Erik Vindblad

Flight Route Aggregator

This project simulates live flight traffic data and uses spark to aggregate this data to show how many flights flew over different countries' airspace.

The project uses a lot of the topics discussed in this course:

NoSQL Database: MongoDb
Message Broker: Kafka
Data Processing: Spark Structured Streaming
Containers: Docker
Container Orchestration: Docker Compose

Architecture

We've flight traffic data stored in hosted mongodb instance which is collected and sent over Kafka by flight-route-publisher which is a .NET application that can be found in /apps/flight-route-publisher.

Our Scala app flight-route-aggregator-v2 which can be found in /apps/flight-route-aggregator-v2 will read the Kafka topic as a structured stream via Spark and aggregates the data and uses a 3rd party library to determine the country's airspace for a flight given the latitude and longitude. Then it writes the aggregated data using MongoDB connector to mongodb.

The Python GUI program connects to the stored mongodb collection of mapped country data by incoming stream. The python GUI updates every 10 seconds to account for planes that enter new airspaces. This is shown via proportional circles which show both the country and number of planes currently in that airspace

All the apps are Dockerized except the FE as it has a GUI interface which might not work well in a containerized envrionment, and the setup for the backend is made via Docker Compose (a lightweight orchestration framework, not as comprehensive as kubernetes but it does the job of spawning multiple containers, restart them on failure, and scaling them if needed. The implementation we did for docker-compose was simple to get all the containers running so the user of the project doesn't have to install and configure many dependencies such as spark or kafka).

The following Diagram should simplify the architecture.

NOTES:

We switched from python implementation that can be found in apps/flight-route-aggregator to Scala implementation (hence v2) due to lake of support of datasets in PySpark.
The 3rd party library we used to determine the country given latitude and longitude works offline and has simple implementation. We didn't want to use a hosted API so we don't hit the rate limit of usage.

@startuml
!include https://raw.githubusercontent.com/plantuml-stdlib/C4-PlantUML/master/C4_Container.puml

!define SPRITESURL https://raw.githubusercontent.com/plantuml-stdlib/gilbarbara-plantuml-sprites/v1.0/sprites

!define DEVICONS https://raw.githubusercontent.com/tupadr3/plantuml-icon-font-sprites/master/devicons
!define DEVICONS2 https://raw.githubusercontent.com/tupadr3/plantuml-icon-font-sprites/master/devicons2
!define FONTAWESOME https://raw.githubusercontent.com/tupadr3/plantuml-icon-font-sprites/master/font-awesome-5
!include DEVICONS2/python.puml
!include DEVICONS2/mongodb.puml
!include DEVICONS2/dotnetcore.puml
!include DEVICONS2/scala.puml
!include DEVICONS/angular.puml
!include DEVICONS/java.puml
!include DEVICONS/msql_server.puml
!include FONTAWESOME/users.puml
!include SPRITESURL/kafka.puml
!include SPRITESURL/spark.puml

LAYOUT_WITH_LEGEND()

Person(user, "Customer", "", $sprite="users")
Container(gui, "GUI", "python", "The GUI Interface", $sprite="python")
package "docker compose" {
  ContainerDb(writeDb, "Aggregated Data DB", "mongodb", "", $sprite="mongodb")
  ContainerDb(readDb, "Read DB", "mongodb", "", $sprite="mongodb")
  Container(producer, "Producer", "dotnet", "", $sprite="dotnetcore")
  ContainerQueue(kafka, "", "", "", $sprite="kafka")
  Container(consumer, "Consumer", "scala", "", $sprite="scala")
  Container(sparkMaster, "Spark Master", "spark", "", $sprite="spark")
  Container(sparkWorker, "Spark Worker", "spark", "", $sprite="spark")
}

Rel_D(user, gui, "Uses", "")
Rel_D(gui, writeDb, "Uses", "")
Rel_R(producer, readDb, "Read")
Rel_L(producer, kafka, "Produce")
Rel_D(consumer, kafka, "Consume")
Rel_U(consumer, writeDb, "")
Rel(consumer, sparkMaster, "")
Rel(sparkMaster, sparkWorker, "")
@enduml

Prerequisite

PyMongo, used for connecting to the MongoDB database, is not compatible with Python 3.10 and newer versions. Please use earlier versions of Python, such as 3.8, which have been successfully tested. For Mac users running Python 3.6 or later, run the following command in the terminal:

open "/Applications/Python <YOUR PYTHON VERSION>/Install Certificates.command"

This ensures that OpenSSL can access the latest root certificates on the system.

Use

Back-End

Install Docker on your local machine.
Run docker-compose up -d
To run the front-end application you should

cd ./frontend_gui
pip install -r requirements.txt
python3 plotting.py

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
apps		apps
data/zookeeper/data		data/zookeeper/data
frontend_gui		frontend_gui
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
docker-compose.yaml		docker-compose.yaml
report.pdf		report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ID2221 Project - Group 333

Ibrahim Abdelkareem - Daniel Bruke - Erik Vindblad

Flight Route Aggregator

Architecture

Prerequisite

Use

Back-End

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

ractodev/FlightRouteAggregator

Folders and files

Latest commit

History

Repository files navigation

ID2221 Project - Group 333

Ibrahim Abdelkareem - Daniel Bruke - Erik Vindblad

Flight Route Aggregator

Architecture

Prerequisite

Use

Back-End

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages