MITB Banner

Deployment of Predictive Models

Share

In business and industry managers often need to predict the behavior of an important parameter that plays a vital role on the health of the business.  These could include, among others,  concerns such as,

  • Identifying whether a particular loan applicant is likely to default
  • Understanding the product purchase preference of individual customers
  • Identifying the customers who are likely to leave the organisation’s services
  • Predicting the likely purchase spend for customers with given demographic profiles

Predictive models are built by observing the past behavior of the parameter of interest as well as the effect of those factors that affect its behavior, and then building a mathematical relationship between them.  For example, in order to predict loan default, relevant historical on all the past defaulters would be studied in order to establish a mathematical relationship between defaulting behavior and other characteristics of the defaulters.  This relationship is termed a Predictive Model.

 Deployment of Predictive Models

The name Predictive Model derives from the fact that given a mathematical relationship, and the values of the affecting parameters – often termed the predictors – it is possible to predict the value of the parameter of interest, which is itself called the dependent variable.

In simpler scenarios, a Predictive Model may be represented generically as,

y = f(x1, x2, x3, x4…. xn)

 

Where  y  is the dependent variable, i.e. the parameter whose outcome  we are interested in predicting, and  x1, x2, x3, x4…. xn are the independent predictors, which affect the behavior of y.

Note that, in more complex cases the model may become more challenging, with the outcome of y becoming dependent not one, but on a system of equations.

Building Predictive Models

Predictive Models can be built using a variety of techniques,  including

  • Multiple Regression
  • Logistic Regression
  • Neural Networks
  • Decision Trees
  • Cluster Analysis

The choice of technique generally depends on the nature of the problem as well as the characteristics of the variable to be used in the analysis.

These techniques are available in Predictive Analytics software such as SPSS and R.

Scoring

Scoring is the method of substituting the value of the independent predictors in the mathematical relationship obtained by the modeling process, in order to obtain the ‘prediction’ for the parameter of interest. In the above example, newly observed values of the predictor variables Age, Gender, Income, Marital Status and Profession may be applied to the mathematical relationship, in order to obtain a predicted value for the parameter ‘Defaulter’ .

Currently scoring may be done in about three ways

  1. Manually input the values and calculate on pen and paper
  2. Build the relationship as a spreadsheet formula, and the input the values
  3. Run scoring in a batch mode in the software where the model was built  in the first place, if this feature is supported

In the first two cases where there are significant challenges of computation and understanding  of the theory behind the model,  in the last case the scoring gets restricted to only those who have access to modeling software.

The Challenge of Usability

The model  y = f(x1, x2, x3, x4…. xn) is very powerful.  For by predicting likely outcomes,  it allows decision makers at all levels  to decide their future  course of action more objectively. However, given the skill levels required to actually use it, its use is not in the easy reach of those who can benefit the most, but yet do not have the necessary skills to understand complex mathematical models.  This is depicted in figure 1, which also shows that in comparison, the humble calculator, though not a very powerful tool,  is nonetheless very easy to use.

 Deployment of Predictive Models2

Deployment

In figure 1 above, can we shift the model to the right , so that while retaining its power , it becomes as easy to use as the calculator ? This is the challenge that deployment addresses.

In other words, deployment is the process of presenting a Predictive Model to the untrained end-user, so that while he or she can take full advantage of its power to predict outcomes,  without  an associated learning curve.

These are  end users who need to predict, but

  • Do not have any background or understanding of the mathematical principles of Predictive Models
  • Do not have access to expensive analytical software
  • Maybe geographically distributed
  • May need frequent changes or updates to models as business parameters and priorities change

This can be achieved by translating the model via an application into an online form, somewhat like in figure 2, where the user inputs the values of the independent variables and the application returns the predicted outcome at the click of a button.

Deployment of Predictive Models3

Figure 2. Online Prediction Form

Once we are able to derive this form from the Predictive Model,  it in turn could be delivered to geographically dispersed users either via browsers (figure 3) or Smartphones (figure 4)

Deployment of Predictive Models4

Figure 3. Online Prediction on a Browser

 Deployment of Predictive Models5

Figure 4. Prediction on Smartphone

Once this is achieved we have an effective system for deployment that makes it possible for the untrained end-user to harness the power of Predictive Models as easily as a calculator, as shown in figure 5.

Deployment of Predictive Models6

Deployment represents the last stage in the analytics process. With this in place, and the power of Predictive Modeling available to all users in the organization, Predictive Modeling can now be operationalised. It can be used in day to day operational decision making rather than being restricted in its use in an organizational niche for strategic purposes only,

Deployment with Analuo

Deployment of Predictive Models7

Figure 6 : Deployment with Analuo

Analuo is a deployment portal for analytics, intended to enable organizations in operationalising their Predictive Analytics initiative. With its patent pending technology for online prediction, addresses this challenge of deployment.  As depicted in figure 6, an analyst uses data in SPSS or R to build a Predictive Model.  This Model is now exported out as a PMML file. PMML, standing for Predictive Model Markup Language is an open standard defined by the Data Mining Group (www.dmg.org)   for defining and sharing Predictive Models.

The Predictive Model, in PMML format, is uploaded to Analuo, which dynamically creates a interface, in the shape of a Prediction Form based on the model, which can is delivered either via a browser or on a Smartphone using Analuo’s Android app.  An end-user, even though not skilled in the techniques of Predictive Modeling, now has to simply enter the values of the independent variables or predictors in the form to obtain a prediction.

PS: The story was written using a keyboard.
Share
Picture of Sumit

Sumit

Sumit has over twenty years of experience in Predictive Analytics, the last fourteen of which were with the SARC and India operations of SPSS, which he helped start. During this period he was instrumental in raising SPSS from obscurity and making it one of the most widely used and accepted statistical software in India. Having seen from close quarters the use of analytics in research, academics, industry and government, he was inspired to create Analuo, something that he feels will help bring the benefits of Predictive Modelling to grass-root levels.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India