Open
Description
Is there an existing issue for this?
- I have searched the existing issues
Category of feature request
Profiler
Problem statement
Most of the functions seen in this Medium article to convert Talend are not all present in the cli.py offered in the current version of LakeBridge.
lakebridge discover --source talend --input talend_exports/ --output talend_inventory.json
lakebridge assess --input talend_inventory.json --output assessment_report.json
lakebridge analyze talend --input talend_exports/ --output talend_profile.json
lakebridge translate talend --input talend_exports/ --output dlt_jobs/
lakebridge generate workflows --input dlt_jobs/ --output databricks_jobs/
lakebridge deploy --workspace-url https://<workspace> --token <token> --project databricks_jobs/
lakebridge validate --source-output talend_outputs/ --target-output databricks_outputs/ --output validation_report.json
BladeBridge used to support this conversion before Databricks bought them and I was hoping to see this again
https://www.youtube.com/watch?v=Rwx4XrybuIs
Proposed Solution
Enable the missing functions, especially the translate
function, for converting Talend ETL jobs to Pyspark as BladeBridge did before.
Additional Context
In the event that I am mistaken, please let me know the proper way of converting Talend ETL jobs.