Logistic Regression Practical part- 4

SAS Analytics Logistic Regression- Case Study & Practical
10 minutes
Share the link to this page
Copied
  Completed
You need to have access to the item to view this lesson.
One-time Fee
$69.99
List Price:  $99.99
You save:  $30
€65.51
List Price:  €93.59
You save:  €28.07
£56.31
List Price:  £80.45
You save:  £24.13
CA$95.76
List Price:  CA$136.81
You save:  CA$41.04
A$107.69
List Price:  A$153.85
You save:  A$46.15
S$95.27
List Price:  S$136.10
You save:  S$40.83
HK$548.21
List Price:  HK$783.19
You save:  HK$234.98
CHF 63.99
List Price:  CHF 91.41
You save:  CHF 27.42
NOK kr766.70
List Price:  NOK kr1,095.33
You save:  NOK kr328.63
DKK kr488.53
List Price:  DKK kr697.93
You save:  DKK kr209.40
NZ$117.93
List Price:  NZ$168.49
You save:  NZ$50.55
د.إ257.06
List Price:  د.إ367.25
You save:  د.إ110.18
৳7,672.34
List Price:  ৳10,960.96
You save:  ৳3,288.61
₹5,831.54
List Price:  ₹8,331.14
You save:  ₹2,499.59
RM334.37
List Price:  RM477.70
You save:  RM143.32
₦88,222.39
List Price:  ₦126,037.39
You save:  ₦37,815
₨19,408.82
List Price:  ₨27,728.08
You save:  ₨8,319.25
฿2,591.14
List Price:  ฿3,701.79
You save:  ฿1,110.65
₺2,276.07
List Price:  ₺3,251.67
You save:  ₺975.60
B$359.20
List Price:  B$513.17
You save:  B$153.96
R1,339.40
List Price:  R1,913.51
You save:  R574.11
Лв128.12
List Price:  Лв183.04
You save:  Лв54.91
₩96,245.98
List Price:  ₩137,500.16
You save:  ₩41,254.17
₪263.24
List Price:  ₪376.08
You save:  ₪112.83
₱4,030.89
List Price:  ₱5,758.67
You save:  ₱1,727.77
¥10,844
List Price:  ¥15,492.10
You save:  ¥4,648.09
MX$1,188.01
List Price:  MX$1,697.24
You save:  MX$509.22
QR253.94
List Price:  QR362.79
You save:  QR108.84
P970.94
List Price:  P1,387.12
You save:  P416.17
KSh9,413.65
List Price:  KSh13,448.65
You save:  KSh4,035
E£3,350.77
List Price:  E£4,787.02
You save:  E£1,436.25
ብር3,964.81
List Price:  ብር5,664.26
You save:  ብር1,699.44
Kz58,437.56
List Price:  Kz83,485.81
You save:  Kz25,048.25
CLP$66,749.46
List Price:  CLP$95,360.46
You save:  CLP$28,611
CN¥507.15
List Price:  CN¥724.53
You save:  CN¥217.38
RD$4,131.55
List Price:  RD$5,902.47
You save:  RD$1,770.91
DA9,407.33
List Price:  DA13,439.62
You save:  DA4,032.28
FJ$160.49
List Price:  FJ$229.29
You save:  FJ$68.79
Q543.50
List Price:  Q776.47
You save:  Q232.96
GY$14,625.47
List Price:  GY$20,894.43
You save:  GY$6,268.95
ISK kr9,845.49
List Price:  ISK kr14,065.59
You save:  ISK kr4,220.10
DH706.45
List Price:  DH1,009.26
You save:  DH302.80
L1,245.82
List Price:  L1,779.82
You save:  L534
ден4,030.78
List Price:  ден5,758.50
You save:  ден1,727.72
MOP$564.26
List Price:  MOP$806.12
You save:  MOP$241.86
N$1,339.20
List Price:  N$1,913.23
You save:  N$574.02
C$2,563.14
List Price:  C$3,661.78
You save:  C$1,098.64
रु9,324.38
List Price:  रु13,321.12
You save:  रु3,996.73
S/257.63
List Price:  S/368.06
You save:  S/110.42
K264.67
List Price:  K378.12
You save:  K113.45
SAR262.51
List Price:  SAR375.03
You save:  SAR112.52
ZK1,801.79
List Price:  ZK2,574.10
You save:  ZK772.30
L325.94
List Price:  L465.65
You save:  L139.71
Kč1,653.47
List Price:  Kč2,362.20
You save:  Kč708.73
Ft25,766.18
List Price:  Ft36,810.41
You save:  Ft11,044.22
SEK kr760.51
List Price:  SEK kr1,086.49
You save:  SEK kr325.98
ARS$61,084.05
List Price:  ARS$87,266.68
You save:  ARS$26,182.62
Bs481.20
List Price:  Bs687.45
You save:  Bs206.25
COP$273,176.37
List Price:  COP$390,268.69
You save:  COP$117,092.31
₡35,080.64
List Price:  ₡50,117.34
You save:  ₡15,036.70
L1,718.72
List Price:  L2,455.43
You save:  L736.70
₲516,687.24
List Price:  ₲738,156.27
You save:  ₲221,469.03
$U2,685.15
List Price:  $U3,836.09
You save:  $U1,150.94
zł283.11
List Price:  zł404.46
You save:  zł121.35
Already have an account? Log In

Transcript

In this video we will be setting up a cutoff probability level based on which we will be converting this estimated probability into a binary variable taking value zero or one and the variable name will be status. So let's do that. First, let's set up the cutoff gravity level. So we are using the data step we are doing DATA step two create the binary variable status. So data we are creating a separate data set called Data Fred underscore result. This is a duplicate data set that is meant to be created inside work.

Just because we did not specify any library name so the pred underscore result data set will be created inside work, set result. The result is the already created data set which is there inside work. So the result is the original data set, which is getting copied in credit this concept. So the data set which I'm specifying data statement that is a duplicate data set at the state data set, which specify in the set statement that is original data set. So this original data set is getting copied in the duplicate data set that is created dischord result now I'm seeing in order to set up the V set of the cutoff priority level in such a ways that if our estimated probability is greater than the cutoff probability level, we say it's an event otherwise, it's an non event that is here we are building a model illustration the dependent variable is calculating the probability for Y equals to one so if our estimated travel there are the probability that we have estimated that is also estimated for Y equals to one that is probability of an event is estimated that is a probability.

So if our predicted probability is greater than the setup, the cutoff probability level that we have set, then we say that a particular event will occur. So then the value of status will be one, because one means event, otherwise it won't occur then the value of status will be zero that is it will be a non event. So if predicted is a variable name is greater than point five. That is I have set the cutoff level as point five. We can change according to our choice, but generally we keep it as points at five. So, if our estimated probability for Y equals to one is greater than point five, then I'm creating a status variable which will be my daughter binary variable, then status will be equal to one.

One means it's an event. Otherwise, the status vd will will take over zero then, let's run the code. So before we run the code, let me explain all the code. So here we are doing a data step, we are creating a data set called pred underscore result which is going to be created inside work as we did not specify any library named Farah underscore result data set will be created inside work set result, this result is my original data set which is already there in the work so result is getting copied in pred underscore result now I have set the cutoff probability level as point five saying that is the predicted probability that is the estimated gravity of for vehicles to work that is for bikers to event in our model, which is showing that the customer will not be alone defaulted is greater than point five then status equals to one that means there we are predicting the status value was one.

That is if we are saying that the the greatest probability is greater than point five, we're saying that status data will be one that is the event will occur, one means event. And if the predicted probability is less than point five, that is else status will be zero. So, in this way, we are converting the stated probability variable into a binary variable, you know that we have to actually calculate the accuracy of the model. So, in order to calculate the accuracy of the model, our response variable, we need one observed response variable and predicted response variable. So, both has to be binary in nature. So, in this way we are converting this predicted probability into a binary variable called status, the name of the variable can be anything.

So if the greatest probability is greater than greater probability for Y equals to one is greater than the cutoff gravity level, which is point five x taken over here, then my filtered response variable whose name is status will be one. Otherwise status with Visio. So, let's run this code So let's open the data set credit disposals see the exact exactly whatever was given in the result data set that got copied over here, that is whatever we have in our original data set that is there, this is our response value, this is my observed response variable, which is already binary nature, which is given as binary nature. And this is this is an observed response variable, and this is my estimated probability that is a greater probability that we had predicted and this status is my predicted response variable, I have named it a status I've said that if the estimated profit is greater than point 8.5 is my cutter priority level, then the status value will be one otherwise, you see, here there's two main priorities point nine two they're probably got the status value as well again here the student priorities point three seven, then the we have the status value is zero.

So that means that whichever according to our cutter priority level if the estimator probabilities good Then point five, then we are then that to that individual, there is greater probability that that individual will not be a known defaulter. So the loan officer is going to give them the loan give them is more likely to give the loan to that individual. So whoever has an estimated probability greater than point five means they're more likely to get the loan because their probability y equals to one is going to be more, it is greater than point five means greater than 50% we have set the cutoff gravity level is 50%. point five we can set the cutoff priority level or do a choice but generally we keep the cutoff values 50%. So in this way, status we will create it and if the submitted priority is less than point five that means, that particular customer is less likely to get the loan.

So see over here this one priority is point three which is greater than which is less than point five. Therefore, the Status zero that is this customer the second customer is less likely to get a loan. So, this customer is more likely to be a loan default. Because probability by equals to one is the customer will not be alone default and probability by equals to zero is the customer will be a no loan default. So does I have an observe setter response variable in the bracket setters complete one is response and other is status. So in this way, I got this middle priority with status.

I've converted estimator priority to a status variable which is binary nature that is zero or one. And here my response value is also displayed, which are all observations. My response bad results district which for all the observations are one, because we are building the model for Y equals to one and the probability that is estimated that is also estimated probability for one close to one. So, the parameter that we have created that is estimated probability for Y equals to event Okay. Now, this month we are going to learn in this video that is we have got the status variable we have converted the estimate parameter to a status variable which is taking binary whether 01 by setting up a cutoff probability level which is called 50%. Before I move to the next video, let me recap the concepts here that we have done in these videos.

We have been working with the greatest analytics data set we have done step by selection, we have selected the significant variables by residual chi square test we have done Hushmail ensure goodness of fit is where we concluded the model is a good fit, we have generated classification table and we got the different measures of classification table at every level of probability from zero to one With a gap of 0.01 that is we got sensitivity specificity false positive false negative, correct incorrectly classified events correctly classified non events incorrectly classified events incorrectly classified non events, then percentage classified percentage correctly classified for every level of priority from zero to one. We got the analysis of maximum likelihood estimates table we got the odds ratio estimates table we got the table for percentage of concordance discordance and Taipei's were higher is the percentage of concordance better is our model, because less will be the miniscule Miss classification.

For our model, we have predicted the probability for Y equals to one then we have set up one cutoff gravity level is point five saying that if the freighter probability is greater than point five That is, if the estimator probability is greater than point five, then the status will be one, otherwise status will be zero. That means that the customers for whom the predicted probability are greater than point five, those customers are more likely to get the loan because they have maximum liability that they will not be alone default. There are maximum transfer those costs to customers that they will not be alone before death and those who have less than perfect their status is equal to zero because they are less likely to get the loan because they have maximum liability for vehicles to non event then y equals to event that is they are less likely to get the look as the record probability is less than point five.

So accordingly we have converted this estimated probability into a status variable which is binary nature taking value 01 saying that it's a greater priority is greater than point five when it is one otherwise it is zero. Thank you. Goodbye. see you for the next video.

Sign Up

Share

Share with friends, get 20% off
Invite your friends to LearnDesk learning marketplace. For each purchase they make, you get 20% off (upto $10) on your next purchase.