Pipeline comparison – Pipelines using Kubeflow for Custom Models
Users may run multiple pipelines with different models or with a different sample of data. GCP provides users an option to compare the performance of different pipelines.
In this exercise, we had built a pipeline to train random forest classifier model. Change random forest to any other model of your choice and run the pipeline (decision tree classifier can be used in this scenario with minimal changes in the code). Import the required package and make necessary changes in the code (at third and fourth component of the pipeline). Change the name of the JSON file, and the pipeline name. Compile and submit the pipeline job to execute.
Once the pipeline is executed successfully follow these steps to compare the pipelines.
Step 1: Pipelines module of Vertex AI
All the pipelines executed will be listed under pipeline section as shown in Figure 7.17:
Figure 7.17: Pipeline module of Vertex AI
- Go to the Pipelines module of Vertex AI.
- Select the pipeline to be compared.
- Click COMPARE.
Step 2: Pipeline comparison
Using the pipeline comparison, users can compare the parameters of the pipeline used, the output metrics and KPIs of the pipelines as shown in Figure 7.18:
Figure 7.18: Pipeline comparison
We have utilized workbench, and BigQuery to store the data and cloud storage to store the artifacts of the pipeline. Ensure to delete the workbench, clear the data stored in the cloud storage and BigQuery.
Differences between Vertex AI and Kubeflow pipelines
Few important differences between Vertex AI pipelines and Kubeflow pipelines are listed below:
- Domain-Specific Language (DSL) versions: Pipelines created using TFX v0.30.0 or later and Kubeflow Pipelines SDK v2 domain-specific language (DSL) can be executed on the Vertex AI pipelines. Only the Kubeflow Pipelines SDK are executed using Kubeflow pipelines.
- Storage: Kubernetes resources like persistent volume claims can be used in Kubeflow Pipelines, whereas in Vertex AI cloud storage is used for data storage (Cloud Storage FUSE is used for mounting to the components)
- Recursion: Pipeline components that are called recursively are not supported by Vertex AI whereas Kubeflow supports it.
We learnt how to construct pipeline using custom model using Kubeflow SDK, compile and submit the pipeline job. The use of lineage in the pipeline, comparing different pipelines and its uses were also covered in this chapter.
In the next chapter, we will construct the pipeline using TensorFlow Extended (TFX) for custom model training.
- What changes are needed if we have to execute the pipeline using xgboost/pytorch /TensorFlow?
- Why is Pipeline comparison helpful?
- How are the artifacts and the parameters of the pipeline tracked?
You may also like
Archives
- September 2024
- August 2024
- July 2024
- June 2024
- May 2024
- April 2024
- March 2024
- February 2024
- January 2024
- December 2023
- November 2023
- September 2023
- August 2023
- June 2023
- May 2023
- April 2023
- February 2023
- January 2023
- November 2022
- October 2022
- September 2022
- August 2022
- June 2022
- April 2022
- March 2022
- February 2022
- January 2022
- December 2021
- November 2021
- October 2021
Calendar
M | T | W | T | F | S | S |
---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | ||
6 | 7 | 8 | 9 | 10 | 11 | 12 |
13 | 14 | 15 | 16 | 17 | 18 | 19 |
20 | 21 | 22 | 23 | 24 | 25 | 26 |
27 | 28 | 29 | 30 | 31 |
Leave a Reply