Create your first Data Pipeline

Let’s look at how to create a data pipeline using the QuickML platform.

Resources: Click here to access some sample datasets to get started with.

Creating Data Pipeline using pipeline builder interface

  1. Go to the datasets section

  2. Click Import Dataset to upload a dataset to QuickML platform using the available data connector options. create-data-pipeline-1

  3. Select the preferred data source to perform the import operation. create-data-pipeline-2
    After uploading the file from the above given options, the data quality score will be displayed. create-data-pipeline-3

  4. In order to create a data pipeline, click Go to Data Cleaning. The Create pipeline popup will appear asking you to provide a pipeline name then click the Create Pipeline button. create-data-pipeline-4

  5. You’ll then be taken to the Pipeline Builder page where you can perform the below Data preprocessing operations simply by drag and drop components as shown in the left pane and connecting to previous and subsequent components to create a pipeline.

create-data-pipeline-1
Components under each data preprocessing step in the left pane can be viewed by expanding them.

  1. Every operation performed inside the builder can be seen processed on the sample data in the preview section. Example: Drag and drop the Select/Drop component under the Data Cleaning section; select the required columns; and select the Drop/select operation. You will be able to confirm in the preview section that the columns have dropped. create-data-pipeline-6

  2. In a similar manner, drag and drop all required components into the builder and connect the last component to the destination to finish building the pipeline. It will appear as follows: create-data-pipeline-7

  3. To run the pipeline, click the Execute button. Once the pipeline is executed, it will appear as follows: create-data-pipeline-8

  4. Clicking on the car_price dataset will take you to the car_prices dataset details page, where you can verify new versions and other relevant Dataset profile information create-data-pipeline-9

Note: QuickML uses Zoho IAM as an authentication manager. In order to use the service, the user must be signed in using Zoho account.

Last Updated 2023-10-08 10:48:45 +0530 +0530

ON THIS PAGE