- Data distribution across the nodes.
- Resource allocation, database connection.
- Execution life-cycle on submitting a Job.
- Storage of data
- Details related to the Metastore
** Note: Refer the links metioned below under each ecosystem for detailed explanation **
-
HDFS π
-
SQOOP
- Sqoop Incremental Load:
-
HIVE π
-
SPARK π₯
-
HBASE π