Redmodel

TFX implementation of text sentiment classification using Airflow.

/dags

Files containing the definition of DAGs that are used to implement the pipeline

/utils

Contains module files that are used in the ML pipeline for tasks such as transformation and training.

analysis notebook is used to preprocess data before saving it in required directory for ML pipelines

remodel_nsl notebook implements the semi supervised workflow as performed in Airflow.

setup_tfx_airflow shell script install the required libraries for reproducing the task.

Pipeline components are implemented as DAGs using Airflow

ImportExampleGen

This component ingests CSV data into the machine learning pipeline by converting the data types into compatible datatypes.

IdentifyExample

Marking each example in the utilized data occurs at this DAG component by assigning each instance with a unique identifier

StatisticsGen

This component computes statistical information concerning the ingested dataset and the information is used in following stages to evaluate the data for anomalies and also analyze model performance.

SchemaGen

This component generates the schema from the provided data and the data is stored in Metadata store for future analysis and evaluation of model performance.

ExampleValidator

Anomalies are detected at this stage to determine any inconsistencies in the data that might undermine the performance of the generated machine learning model.

SynthesizeGraph

Neural Structured Learning is utilized in this component to generate a graph using similarity measurement between labelled and unlabelled data. Embedding for the text is also generated from transfer learning libraries using pretrained layers.

Transform

This component performs the general preprocessing tasks that enhance performance and structure the data better.

GraphAugmentation

This component retrieves the probable label assignment from the synthesized graph to create a matching example entity that contains both labels and features that will be used during training.

Trainer

This method leverages tensorflow to perform training task that completes the model development phase.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
dags		dags
inference		inference
script		script
simple		simple
utils		utils
.gitignore		.gitignore
README.md		README.md
airflow_config.sh		airflow_config.sh
analysis.ipynb		analysis.ipynb
inf2.png		inf2.png
remodel_nsl.ipynb		remodel_nsl.ipynb
setup_tfx_airflow.sh		setup_tfx_airflow.sh
test1.png		test1.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Redmodel

/dags

/utils

ImportExampleGen

IdentifyExample

StatisticsGen

SchemaGen

ExampleValidator

SynthesizeGraph

Transform

GraphAugmentation

Trainer

About

Releases

Packages

Languages

okirialbert/redmodel-pipeline

Folders and files

Latest commit

History

Repository files navigation

Redmodel

/dags

/utils

ImportExampleGen

IdentifyExample

StatisticsGen

SchemaGen

ExampleValidator

SynthesizeGraph

Transform

GraphAugmentation

Trainer

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages