Skip to content

components validation_trigger_import

github-actions[bot] edited this page Dec 6, 2024 · 17 revisions

Validation Trigger Import model

validation_trigger_import

Overview

Component for enabling validation of import pipeline.

Version: 0.0.14

Tags

Preview Internal

View in Studio: https://ml.azure.com/registries/azureml/components/validation_trigger_import/version/0.0.14

Inputs

pipeline specific compute

Name Description Type Default Optional Enum
compute Common compute for model download, MLflow conversion and registration. eg. provide 'FT-Cluster' if your compute is named 'FT-Cluster'. Special characters like \ and ' are invalid in the parameter value. If compute name is provided, instance_type field will be ignored and the respective cluster will be used string serverless True
instance_type Instance type to be used for the component in case of serverless compute, eg. STANDARD_NC6s_v3. The parameter compute must be set to 'serverless' for instance_type to be used string Standard_DS12_v2 True

Inputs for download model

Name Description Type Default Optional Enum
model_source Storage containers from where model will be sourced from string Huggingface ['AzureBlob', 'GIT', 'Huggingface']
model_id A valid model id for the model source selected. For example you can specify bert-base-uncased for importing HuggingFace bert base uncased model. Please specify the complete URL if GIT or AzureBlob is selected in model_source string
model_flavor Flavor of MLFlow to which model the model is converted to. string HFTransformersV2 False ['HFTransformersV2', 'OSS']

Inputs for the MlFLow conversion

Name Description Type Default Optional Enum
license_file_path Path to the license file uri_file True
task_name A Hugging face task on which model was trained on string True ['text-classification', 'fill-mask', 'token-classification', 'question-answering', 'summarization', 'text-generation', 'text-classification', 'translation', 'image-classification', 'text-to-image', 'chat-completion', 'image-feature-extraction']
hf_config_args Provide args that should be used to load Huggingface model config. eg: trust_remote_code=True; string True
hf_tokenizer_args Provide args that should be used to load Huggingface model tokenizer. eg: trust_remote_code=True, device_map=auto, string True
hf_model_args Provide args that should be used to load Huggingface model. eg: trust_remote_code=True, device_map=auto, low_cpu_mem_usage=True string True
hf_pipeline_args Provide pipeline args that should be used while loading the hugging face model. Dont use quotes. If value cannot be eval'ed it will be taken as as string. eg: trust_remote_code=True, device_map=auto string True
hf_config_class AutoConfig class may not be sufficient to load config for some of the models. You can use this parameter to send Config class name as it is string True
hf_model_class AutoModel classes may not be sufficient to load some of the models. You can use this parameter to send Model class name as it is string True
hf_tokenizer_class AutoTokenizer class may not be sufficient to load tokenizer for some of the models. You can use this parameter to send Config class name as it is string True
hf_use_experimental_features Enable experimental features for hugging face MLflow model conversion boolean False True
extra_pip_requirements Extra pip dependencies that MLflow model should capture as part of conversion. This would be used to create environment while loading the model for inference. Pip dependencies expressed as below. Do not use quotes for passing. eg: pkg1==1.0, pkg2, pkg3==1.0 string True

Inputs for MLflow local validation

Name Description Type Default Optional Enum
local_validation_test_data Test data for MLflow local validation. Validation will be skipped if test data is not provided uri_file True
local_validation_column_rename_map Provide mapping for local validation test data column names, that should be renamed before inferencing eg: col1:ren1; col2:ren2; col3:ren3 string True

Inputs for Model registration

Name Description Type Default Optional Enum
custom_model_name Model name to use in the registration. If name already exists, the version will be auto incremented string True
model_version Model version in workspace/registry. If the same model name and version exists, the version will be auto incremented string True
model_description Description of the model that will be shown in AzureML registry or workspace string True
registry_name Name of the AzureML asset registry where the model will be registered. Model will be registered in a workspace if this is unspecified string True
model_metadata A JSON or a YAML file that contains model metadata confirming to Model V2 contract uri_file True
update_existing_model If set to true, will update the already existing model in the registry. If set to false, will create a new model in the provided registry. boolean False True

Outputs

Name Description Type
validation_info Validation status of the import pipeline inputs. uri_file

Environment

azureml://registries/azureml/environments/python-sdk-v2/versions/23

Clone this wiki locally