# Churn For Bank Customers


--------------------------------------------------------------------------------
title: "Introduction"
description: "Churn for bank customers, Create a powerful ML pipeline that can be used to predict the churn using the Catalyst QuickML service."
last_updated: "2026-03-18T07:41:08.686Z"
source: "https://docs.catalyst.zoho.com/en/tutorials/churn-for-bank-customers/introduction/"
service: "All Services"
--------------------------------------------------------------------------------


# Churn for Bank Customers

# Introduction

In this tutorial, we will guide you through the process of building a powerful machine learning model using {{%link href=&#34;/en/quickml/getting-started/introduction/&#34; %}}Catalyst QuickML{{%/link%}} to predict whether or not a client would leave. 

In this tutorial, we&#39;ll first do {{%link href=&#34;/en/quickml/help/data-preprocessing/data-cleaning/&#34; %}}preprocess the datasets{{%/link%}} to make sure they&#39;re tidy and prepared for training. A {{%link href=&#34;/en/quickml/help/create-data-pipeline/&#34; %}}data pipeline{{%/link%}} will be built next to handle data transformation, and an {{%link href=&#34;/en/quickml/help/create-ml-pipeline/&#34; %}}ML pipeline{{%/link%}} will be built to train and test the model. Finally, we&#39;ll provide an {{%link href=&#34;/en/quickml/help/pipeline-endpoints/&#34; %}}endpoint{{%/link%}} for the trained model that enables interaction with external apps and provides churn for bank customers.

The churn for bank customers ML model is built using the following Catalyst service:

**{{%link href=&#34;/en/quickml/getting-started/introduction/&#34; %}}Catalyst QuickML{{%/link%}}** : Using this service, we will first pre-process the sample dataset by implementing {{%link href=&#34;/en/quickml/help/data-preprocessing/data-cleaning/&#34; %}}node operations{{%/link%}} on them and constructing the {{%link href=&#34;/en/quickml/help/create-data-pipeline/&#34; %}}data pipeline{{%/link%}}. This pre-processed data will be used to create an ML model by executing {{%link href=&#34;/en/quickml/help/ml-algorithms/classification-algorithms/&#34; %}}ML algorithms{{%/link%}}. Finally, this churn for bank customers ML model can be accessed by external applications using the {{%link href=&#34;/en/quickml/help/pipeline-endpoints/&#34; %}}endpoint URL{{%/link%}} generated in QuickML.

The final output, after creating all the required data and ML pipelines in the {{%link href=&#34;https://console.catalyst.zoho.com/baas/index&#34; %}}Catalyst console{{%/link%}}, will look like this:


--------------------------------------------------------------------------------
title: "Prerequisites"
description: "Churn For Bank Customers, Create a powerful ML pipeline that can be used to predict the churn using the Catalyst QuickML service."
last_updated: "2026-03-18T07:41:08.686Z"
source: "https://docs.catalyst.zoho.com/en/tutorials/churn-for-bank-customers/prerequisites/"
service: "All Services"
related:
- Machine Learning Algorithms (/en/quickml/help/ml-algorithms/classification-algorithms/)

--------------------------------------------------------------------------------


# Prerequisites

Since this tutorial involves only {{%link href=&#34;/en/quickml/getting-started/introduction/&#34; %}}Catalyst QuickML{{%/link%}}, we will be working entirely in the {{%link href=&#34;https://console.catalyst.zoho.com/baas/index&#34; %}}Catalyst console{{%/link%}} to build data and {{%link href=&#34;/en/quickml/help/create-ml-pipeline/&#34; %}}ML pipelines{{%/link%}}, create ML models, and train the models to predict outcomes. Before you begin working on this tutorial, please download the below dataset:

- {{%link href=&#34;https://workdrive.zohoexternal.com/external/324bcf5f911d2d056c4b75d4c6a534ac2b139fb390d0812df7165c1965725083&#34; %}}Bank_Customers_Sample_Data{{%/link%}}

This tutorial aims to implement cleaning, refining and pre-processing operations on the datasets, and then use them to train ML models. We will be uploading the dataset to Catalyst QuickML in the later sections of this tutorial.

--------------------------------------------------------------------------------
title: "Create a Project"
description: "Churn For Bank Customers, Create a powerful ML pipeline that can be used to predict the churn for bank customers using the Catalyst QuickML service."
last_updated: "2026-03-18T07:41:08.686Z"
source: "https://docs.catalyst.zoho.com/en/tutorials/churn-for-bank-customers/create-a-project/"
service: "All Services"
related:
- Catalyst Projects (/en/getting-started/catalyst-projects)

--------------------------------------------------------------------------------


# Create a Project

Let&#39;s {{%link href=&#34;/en/getting-started/catalyst-projects&#34; %}}create a Catalyst project{{%/link%}} from the Catalyst console.

1. Log in to the {{%link href=&#34;https://console.catalyst.zoho.com/baas/index&#34; %}}Catalyst console{{%/link%}}, then click {{%badge%}}Create a new Project{{%/badge%}}
 &lt;br /&gt;

2. Enter the project’s name as &#34;**ChurnForBankCustomers**&#34; (or a name you wish to give for the project) in the pop-up window that appears.
 &lt;br /&gt;

3. Click on {{%badge%}}Create{{%/badge%}} button. Your project will be created and automatically opened. To access your project later, simply click on the {{%badge%}}Access Project{{%/badge%}} button.
&lt;br /&gt;


--------------------------------------------------------------------------------
title: "Upload Dataset"
description: "Churn For Bank Customers, Create a powerful ML pipeline that can be used to predict the churn using the Catalyst QuickML service."
last_updated: "2026-03-18T07:41:08.688Z"
source: "https://docs.catalyst.zoho.com/en/tutorials/churn-for-bank-customers/upload-dataset/"
service: "All Services"
related:
- Create Your First pipeline (/en/quickml/help/create-ml-pipeline)

--------------------------------------------------------------------------------


# Upload the Dataset

Let&#39;s begin by uploading the dataset in Catalyst QuickML using the available dataset {{%link href=&#34;/en/quickml/help/data-connectors/zoho-apps/&#34; %}}dataset connectors{{%/link%}}.

1. Navigate to the QuickML service in the Catalyst console and click {{%badge%}}Start Exploring{{%/badge%}}.
&lt;br /&gt;

2. Navigate to the {{%badge%}}Datasets{{%/badge%}} component and click {{%badge%}}Import Dataset{{%/badge%}}.
&lt;br /&gt;

3. An Import Dataset pop-up will be displayed. In the **Data Sources** step, navigate to File Upload and click {{%badge%}}Upload File{{%/badge%}}.
&lt;br /&gt;

Upload the **Bank_Customers_Sample_Data** dataset that you downloaded earlier. We can have the Quotes Type as &#34;**Double Quotes(&#34;)**&#34; and Escape Character as &#34;**Backslash(\)**&#34; and click {{%badge%}}Next{{%/badge%}}.


&lt;br /&gt;
The name of the dataset will be auto-populated based on the uploaded file. You can edit it, if required, and click {{%badge%}}Upload{{%/badge%}}.

&lt;br /&gt;
The dataset is now uploaded successfully.

&lt;br /&gt;
The dataset will be displayed in the **All Datasets** section. You can click on the dataset name to view the dataset&#39;s details.
&lt;br /&gt;

Once if you click on the **Bank_Customers_Sample_Data** dataset in the list, you&#39;ll be redirected to the **Dataset Details** page where you can view the {{%link href=&#34;/en/quickml/help/data-profiler-and-viewer/#what-is-data-profiling&#34; %}}profiling, data preview{{%/link%}} and {{%link href=&#34;/en/quickml/help/data-visualization/overview/&#34; %}}visualization chart{{%/link%}} of the dataset.
&lt;br /&gt;


--------------------------------------------------------------------------------
title: "Create Data Pipeline"
description: "Churn For Bank Customers, Create a powerful ML pipeline that can be used to predict the churn using the Catalyst QuickML service."
last_updated: "2026-03-18T07:41:08.688Z"
source: "https://docs.catalyst.zoho.com/en/tutorials/churn-for-bank-customers/create-data-pipeline/"
service: "All Services"
related:
- Data Cleaning (/en/quickml/help/data-preprocessing/data-cleaning)
- Data Transformation (/en/quickml/help/data-preprocessing/data-transformation)
- Data Profiler and Viewer (/en/quickml/help/data-profiler-and-viewer/)

--------------------------------------------------------------------------------


# Create a data pipeline

Now that we have uploaded the dataset, we will proceed with creating a {{%link href=&#34;/en/quickml/help/create-data-pipeline/&#34;%}}data pipeline{{%/link%}} with the dataset.

1. Navigate to the **Datasets** component in the left menu. There are two ways to create a data pipeline:
    - You can click on the dataset and then click {{%badge%}}Create Pipeline{{%/badge%}} in the top-right corner of the page.
    &lt;br /&gt;
    - You can click on the pen icon located to the left of the dataset name, as shown in the image below.
    &lt;br /&gt;
    Here, we are uploading the **Bank_Customers_Sample_Data** dataset for preprocessing.

2. Name the pipeline &#34;**Churn_Prediction_Data_Pipeline**&#34; and click {{%badge%}}Create Pipeline{{%/badge%}}.
&lt;br /&gt;

The {{%link href=&#34;/en/quickml/help/pipeline-builder-interface/walkthrough/#pipeline-builder-interface-1&#34; %}}pipeline builder interface{{%/link%}} will open as shown in the screenshot below.
&lt;br /&gt;

We will be performing the following set of data preprocessing operations in order to clean, refine, and transform the datasets, and then execute the data pipeline. Each of these operations involve individual {{%link href=&#34;/en/quickml/help/data-preprocessing/data-cleaning/&#34; %}}data nodes{{%/link%}}  that are used to construct the pipeline.

### Data preprocessing with QuickML
1. #### Select/drop columns
    Selecting or dropping columns from a dataset is a common data preprocessing step in data analysis and machine learning. The choice to select or drop columns depends on the specific objectives and requirements of your analysis or modelling task.
    The columns we don&#39;t need for our model training from this dataset are &#34;**RowNumber**&#34;, &#34;**CustomerId**&#34; and &#34;**Surname**&#34; in the provided datasets. Using QuickML, you may quickly choose the necessary fields from the dataset for model training using the **Select/Drop** [node](/en/quickml/help/data-preprocessing/data-cleaning/#select-or-drop) from the **Data Cleaning** component.
    &lt;br /&gt;

2. #### Filling columns in dataset with values
   Using the {{%badge%}}Fill Columns{{%/badge%}} [node](/en/quickml/help/data-preprocessing/data-cleaning/#fill-columns) in **Data Cleaning**, we can easily fill the column values based on any certain condition. We can fill the null values or non-null values based on our requirements. For the columns &#34;**EstimatedSalary,**&#34; and &#34;**Balance**&#34; we are replacing the empty values with a custom value of &#34;**0**&#34;. 
   &lt;br /&gt;

3. #### Filter Data
    Filtering a dataset typically involves selecting a subset of rows from a DataFrame that meet certain criteria or conditions. Here we are using the Filter node from the **Data Cleaning** session to filter all the columns &#34;**CreditScore**&#34;, &#34;**Geography**&#34;, &#34;**Gender**&#34;, &#34;**Age**&#34;, &#34;**Tenure**&#34; and &#34;**Exited**&#34; that have non-empty values using the {{%badge%}}Filter{{%/badge%}} [node](/en/quickml/help/data-preprocessing/data-cleaning/#filter) from the Data Cleaning session.
    &lt;br /&gt;

4. #### Save and Execute
    Once all the nodes are connected, click the {{%badge%}}Save{{%/badge%}} button to save the pipeline. Then click on {{%badge%}}Execute{{%/badge%}} button to execute the pipeline.
    &lt;br /&gt;

You&#39;ll be redirected to the page below, which shows the executed pipeline with the execution status. We can see here that the pipeline execution was successful.

&lt;br /&gt;

Click on {{%badge%}}Execution Stats{{%/badge%}} to access more details regarding the compute usage, as shown below.

&lt;br /&gt;

In this part, we&#39;ve looked at how to process data using QuickML, giving you a variety of effective ways to get your data ready for the creation of machine learning models. This data pipeline can be reused to create multiple ML experiments for varied use cases within your Catalyst project.


--------------------------------------------------------------------------------
title: "Create ML Pipeline"
description: "Churn For Bank Customers, Create a powerful ML pipeline that can be used to predict the churn using the Catalyst QuickML service."
last_updated: "2026-03-18T07:41:08.689Z"
source: "https://docs.catalyst.zoho.com/en/tutorials/churn-for-bank-customers/create-ml-pipeline/"
service: "All Services"
related:
- ML Algorithms in QuickML (/en/quickml/help/ml-algorithms/classification-algorithms)
- Operations in QuickML (/en/quickml/help/operations-in-quickml/encoding)

--------------------------------------------------------------------------------


# Create an ML pipeline

To build the prediction model, we will use the preprocessed dataset in the {{%link href=&#34;/en/quickml/help/create-ml-pipeline/&#34;%}}ML Pipeline Builder{{%/link%}}. The initial step in building the ML Pipeline involves selecting the **target column**, which is the column that we are trying to predict.

To create an ML pipeline, first Navigate to the **Pipelines** component and click on the {{%badge%}}Create Pipeline{{%/badge%}} option.
&lt;br /&gt;

In the pop-up that appears, select **Prediction** as pipeline type and provide the pipeline name, we&#39;ll name the pipeline as **Churn_Prediction_ML_Pipeline** and the model **Churn_Prediction_ML_Pipeline Model** in the Create Pipeline pop-up. Then, select the appropriate dataset and the column name of the target.
&lt;br /&gt;

We need to select the source dataset that is chosen for building the data pipeline, as the preprocessed data is reflected in the source dataset. In our case, we will be importing the **Bank_Customers_Sample_Data** dataset, as we have selected it for preprocessing and cleaning, and our target is the column named **Exited**.

1. ### Imputers
      Imputers are used in various fields, such as data analysis, statistics, and machine learning to handle missing or incomplete data. Here, we are using mean imputer by importing it from **ML operations &gt; Imputers &gt; Mean Imputer** for imputing the missing values in the dataset.
      Mean Imputing &amp; Mode Imputing refers to a data imputation technique where missing values are filled based on some mean or mode of selected columns. 
      &lt;br /&gt;
      
      Here, the columns should not contain empty values for best model predictions are &#34;**CreditScore**&#34;,&#34;**Age**&#34;,&#34;**Tenure**&#34;,&#34;**Balance**&#34;,&#34;**NumOfProducts**&#34;,&#34;**HasCrCard**&#34;,&#34;**IsActiveMember**&#34;,&#34;**EstimatedSalary**&#34; imputed by its mean values and the few columns that are imputed by their mode are &#34;**Gender**&#34;,&#34;**Geography**&#34;.
      &lt;br /&gt;


2. ### Encoding
      Encoders are used in various data preprocessing and machine learning tasks to convert categorical or non-numeric data into a numerical format that machine learning algorithms can work with effectively.

      #### Ordinal encoding
      Here, we are using ordinal encoding to encode the following categorical features: &#34;gender&#34;. It assigns integers to the categories based on their order, making it possible for machine learning algorithms to capture the ordinal nature of the data. We&#39;ll use the [Ordinal Encoder](/en/quickml/help/operations-in-quickml/encoding/#ordinal-encoder) node by navigating to ML operations, clicking the -&gt;**Encoding component**, and choosing -&gt; **Ordinal Encoder** in QuickML to turn the selected category columns into numerical columns. 
      &lt;br /&gt;

      #### Ordinal Encoder
      {{%link href=&#34;/en/quickml/help/operations-in-quickml/encoding/&#34; %}}Ordinal Encoding{{%/link%}} involves mapping each unique label to an integer value. This type of encoding is really only appropriate if there is a known relationship between the categories. If the data is ordered, we can use ordinal encoding.

      Here we aare using {{%badge%}}Ordinal Encoder{{%/badge%}} node to encode the **Gender** column. We can use the {{%badge%}}Ordinal Encoder{{%/badge%}} node from **ML Operations** &gt; **Encoding** &gt; **Ordinal Encoder** in QuickML to turn the category columns into numerical columns. Here, we are converting all categorical columns to numerical format while retaining the columns’ original order and data for model training.


3. ### One-hot encoding
      {{%link href=&#34;/en/quickml/help/operations-in-quickml/encoding/#one-hot-encoding&#34;%}}One-hot encoding{{%/link%}} is typically applied to categorical columns in a dataset, where each category represents a distinct class or group. This method typically increases the dimensionality of the dataset because it creates a new binary column for each unique category. The number of binary columns is equal to the number of unique categories minus one, as you can infer the presence of the last category from the absence of all others.

      Here, we are using {{%badge%}}One-Hot Encoder{{%/badge%}} node to encode the following column: &#34;**Geography**&#34;. We&#39;ll use the One-Hot Encoder node by navigating to  **ML operations**, selecting the -&gt; **Encoding** component and choosing -&gt; **One-Hot Encoder** in QuickML to turn the selected category columns into numerical columns.
    &lt;br /&gt;

4. ### Normalize the columns
      Navigate to **ML operations-&gt; Normalization**. Drag and drop the **Min-Max Normalization** [node](/en/quickml/help/operations-in-quickml/normalization/#min-max-normalization) to the ML pipeline builder interface. In the configuration box on the right panel, choose all the columns except **Exited** which is the target and click Save.
      &lt;br /&gt;
      
5. ### Feature Engineering:
    {{%link href=&#34;/en/quickml/help/operations-in-quickml/feature-engineering/#feature-selection&#34;%}}Feature selection{{%/link%}} is the process of choosing a subset of the most relevant and important features (variables or columns) from the dataset to use in model training and analysis. The goal of feature selection is to improve the performance, efficiency, and interpretability of machine learning models. Feature selection is particularly crucial when dealing with high-dimensional datasets, as it can help reduce overfitting, reduce computation time, and enhance model interpretability.

    Here we are using the **PCA** [feature](http://localhost:1313/en/quickml/help/operations-in-quickml/feature-engineering/#feature-reduction) selection technique to generate the features. Select **PCA** node by navigating to **ML operations**, clicking -&gt;**Feature Engineering**, and choosing -**&gt;Feature Reduction**.
    &lt;br /&gt;

6. ### ML Algorithm:
	The next step in ML pipeline building is selecting the appropriate algorithm for training the preprocessed data. Here we&#39;ll use the {{%link href=&#34;/en/quickml/help/ml-algorithms/classification-algorithms/#random-forest-classification&#34;%}}Random-Forest Classification{{%/link%}} to train the data.

  In order to make sure the model is optimized for our particular dataset, we may also adjust the tuning parameters; in our instance, we can just stick with the default settings. Select **Random-Forest Classification** node by navigating to **ML operations**, clicking -&gt;**Algorithms**, and choosing -&gt;**Classification**. When everything is configured, we may save the pipeline for further testing and deployment.
  &lt;br /&gt;
Once we drag-and-drop the algorithm node, its end node will be automatically connected to the destination node. Click {{%badge%}}Save{{%/badge%}} to save the pipeline and execute the pipeline by clicking the {{%badge%}}Execute{{%/badge%}} button at the top-right corner of the pipeline builder page.
This will redirect you to the page below which shows the executed pipeline with execution status. We can clearly see here that the pipeline execution is successful.
&lt;br /&gt;
Click {{%badge%}}Execution Stats{{%/badge%}} to view more compute details about each stage of the model execution in detail.
&lt;br /&gt;
The prediction model is created and can be examined under the Model section(click on **Churn_Prediction_ML_Pipeline Model**) following the successful completion of the ML workflow. 
&lt;br /&gt;
This offers useful perceptions into the efficiency and performance of the model while making predictions based on the data.
&lt;br /&gt;


--------------------------------------------------------------------------------
title: "Create Endpoint"
description: "Churn For Bank Customers, Create a powerful ML pipeline that can be used to predict the churn using the Catalyst QuickML service."
last_updated: "2026-03-18T07:41:08.689Z"
source: "https://docs.catalyst.zoho.com/en/tutorials/churn-for-bank-customers/create-endpoint/"
service: "All Services"
related:
- Pipeline Endpoints (/en/quickml/help/pipeline-endpoints)

--------------------------------------------------------------------------------


# Create an endpoint

We will now create an endpoint for the above Deal Prediction model to allow external applications to interact with the model seamlessly and get predictions.

1. Navigate to the **Endpoints** component in the left menu and click {{%badge%}}Create Endpoint{{%/badge%}}.
&lt;br /&gt;

2. Provide a name for the endpoint in **Endpoint Name** field; (we&#39;ll name it **Churn Prediction For Bank Customers**), and select the model pipeline name from the dropdown values of the Choose Model field. Then click {{%badge%}}Create Endpoint{{%/badge%}}.
&lt;br /&gt;

3. Once the endpoint is created, you can view the endpoint&#39;s details page, as shown below. You can test the model by providing a sample request in the Request column and click on the  {{%badge%}}Get Result{{%/badge%}} button. This will generate the predicted value in the Response column.
&lt;br /&gt;

4. Click {{%badge%}}Publish{{%/badge%}} and use the endpoint URL to integrate the ML model with any other applications.
&lt;br /&gt;

{{%note%}}{{%bold%}}Note :{{%/bold%}} You can also check out {{%link href=&#34;/en/quickml/help/pipeline-endpoints/#external-oauth2-authentication&#34; %}}this document{{%/link%}} to implement pipeline authentication. This ensures secured access to endpoints, the ML models, and datasets.{{%/note%}}