Logistic Regression Practical part- 3

SAS Analytics Logistic Regression- Case Study & Practical
9 minutes
Share the link to this page
Copied
  Completed
You need to have access to the item to view this lesson.
One-time Fee
$69.99
List Price:  $99.99
You save:  $30
€65.51
List Price:  €93.59
You save:  €28.07
£56.31
List Price:  £80.45
You save:  £24.13
CA$95.76
List Price:  CA$136.81
You save:  CA$41.04
A$107.69
List Price:  A$153.85
You save:  A$46.15
S$95.27
List Price:  S$136.10
You save:  S$40.83
HK$548.21
List Price:  HK$783.19
You save:  HK$234.98
CHF 63.99
List Price:  CHF 91.41
You save:  CHF 27.42
NOK kr766.70
List Price:  NOK kr1,095.33
You save:  NOK kr328.63
DKK kr488.53
List Price:  DKK kr697.93
You save:  DKK kr209.40
NZ$117.93
List Price:  NZ$168.49
You save:  NZ$50.55
د.إ257.06
List Price:  د.إ367.25
You save:  د.إ110.18
৳7,672.34
List Price:  ৳10,960.96
You save:  ৳3,288.61
₹5,831.54
List Price:  ₹8,331.14
You save:  ₹2,499.59
RM334.37
List Price:  RM477.70
You save:  RM143.32
₦88,222.39
List Price:  ₦126,037.39
You save:  ₦37,815
₨19,408.82
List Price:  ₨27,728.08
You save:  ₨8,319.25
฿2,591.14
List Price:  ฿3,701.79
You save:  ฿1,110.65
₺2,276.07
List Price:  ₺3,251.67
You save:  ₺975.60
B$359.20
List Price:  B$513.17
You save:  B$153.96
R1,339.40
List Price:  R1,913.51
You save:  R574.11
Лв128.12
List Price:  Лв183.04
You save:  Лв54.91
₩96,245.98
List Price:  ₩137,500.16
You save:  ₩41,254.17
₪263.24
List Price:  ₪376.08
You save:  ₪112.83
₱4,030.89
List Price:  ₱5,758.67
You save:  ₱1,727.77
¥10,844
List Price:  ¥15,492.10
You save:  ¥4,648.09
MX$1,188.01
List Price:  MX$1,697.24
You save:  MX$509.22
QR253.94
List Price:  QR362.79
You save:  QR108.84
P970.94
List Price:  P1,387.12
You save:  P416.17
KSh9,413.65
List Price:  KSh13,448.65
You save:  KSh4,035
E£3,350.77
List Price:  E£4,787.02
You save:  E£1,436.25
ብር3,964.81
List Price:  ብር5,664.26
You save:  ብር1,699.44
Kz58,437.56
List Price:  Kz83,485.81
You save:  Kz25,048.25
CLP$66,749.46
List Price:  CLP$95,360.46
You save:  CLP$28,611
CN¥507.15
List Price:  CN¥724.53
You save:  CN¥217.38
RD$4,131.55
List Price:  RD$5,902.47
You save:  RD$1,770.91
DA9,407.33
List Price:  DA13,439.62
You save:  DA4,032.28
FJ$160.49
List Price:  FJ$229.29
You save:  FJ$68.79
Q543.50
List Price:  Q776.47
You save:  Q232.96
GY$14,625.47
List Price:  GY$20,894.43
You save:  GY$6,268.95
ISK kr9,845.49
List Price:  ISK kr14,065.59
You save:  ISK kr4,220.10
DH706.45
List Price:  DH1,009.26
You save:  DH302.80
L1,245.82
List Price:  L1,779.82
You save:  L534
ден4,030.78
List Price:  ден5,758.50
You save:  ден1,727.72
MOP$564.26
List Price:  MOP$806.12
You save:  MOP$241.86
N$1,339.20
List Price:  N$1,913.23
You save:  N$574.02
C$2,563.14
List Price:  C$3,661.78
You save:  C$1,098.64
रु9,324.38
List Price:  रु13,321.12
You save:  रु3,996.73
S/257.63
List Price:  S/368.06
You save:  S/110.42
K264.67
List Price:  K378.12
You save:  K113.45
SAR262.51
List Price:  SAR375.03
You save:  SAR112.52
ZK1,801.79
List Price:  ZK2,574.10
You save:  ZK772.30
L325.94
List Price:  L465.65
You save:  L139.71
Kč1,653.47
List Price:  Kč2,362.20
You save:  Kč708.73
Ft25,766.18
List Price:  Ft36,810.41
You save:  Ft11,044.22
SEK kr760.51
List Price:  SEK kr1,086.49
You save:  SEK kr325.98
ARS$61,084.05
List Price:  ARS$87,266.68
You save:  ARS$26,182.62
Bs481.20
List Price:  Bs687.45
You save:  Bs206.25
COP$273,176.37
List Price:  COP$390,268.69
You save:  COP$117,092.31
₡35,080.64
List Price:  ₡50,117.34
You save:  ₡15,036.70
L1,718.72
List Price:  L2,455.43
You save:  L736.70
₲516,687.24
List Price:  ₲738,156.27
You save:  ₲221,469.03
$U2,685.15
List Price:  $U3,836.09
You save:  $U1,150.94
zł283.11
List Price:  zł404.46
You save:  zł121.35
Already have an account? Log In

Transcript

Now in this video we will be predicting the probability for y equal to even for every customer or for every individual where each observation denotes one individual who has applied for loan and this criteria probability will be created in a separate data set. So let's do that. Let's predict the probability for equal treatment. So we will be using the procedure proc logistic data. I hope you'll remember that before we run this procedure proc registry data we have to execute our live name statement to bring our data set in SAS environment. So I have shown you on how to execute a lipstick in our last video.

So the live name statement is already executed over here to bring the data set in the SAS environment. So my library name is minus one doc logistic underscore rake underscore German underscore back the S is a keyword to create the model for y equal to one that is equal to even now model responses may dependent variable i will mentioned the significant independent variables or I will specify the significant independent variables that are focus on significant significant independent variables. So let's specify them we have to specify the weight is given in the data set So these are the significant independent variables that is check account, duration, history save account, new car education guaranteed Neil single other installed, installed rate, amount used car, foreign and rent. These are the significant independent variables Now I'm using the statement output Output out out is the key word that he will be using to create the duplicate data set.

My duplicate data set name is resolved which will be created in Zed work because I have not specified in library name p equal to predict it which is used to predict the probability for Viper to event then run so now let's run this code. Before I run the code Let me explain you all the code. So we have used the procedure proc logistic details equal to library name dot data set name. d s is a key word to build a model for rivals to one that is very close to even modern response. As our dependent variable equal to these are the list of significant independent variables which we have obtained in our last result viewers, that is during stepwise selection these are the significant independent variables. That is check account duration history save account new car education character a single other install install rate amount used car foreign rent, we are creating a duplicate data set that is resolved where the predicted probabilities for y equal to one will be displayed.

So the predicted probabilities that is we're predicting the probability for y equal to one that parity priority will be displayed in the data set called resolve which is created in network as I did not specify any library name it will create interwork and then run so now let's run this code. To see here we got the table for percentage concordance percentage discordant percentage tight higher is better percentage amount of concordance better is our model that is less is the risk classification This is odds ratio estimates table this is my analysis of maximum likelihood estimates table that is maximum likelihood estimation is used to estimate the parameters for logistic regression. Okay we are building a model for Y equals to one and as we know that the number of levels for a response variable will be two because response variable is binary in nature. taking two values either zero or one zero denotes non event and one denotes even now let's see the data set that we have created that is resolved.

It is created inside work So, let's open the library work. It's open that he does it result. So, the result is a copy of the original data set that is logistic German bank. So then add it as it got popped in reserve. In addition to that, we have also got another variable that is the estimated growth This is the bank depravity. For Bibles to one that is a greater priority for each customer provide most to one arrivals treatment is distinct.

So, for each customer customer what is the estimated probability that the loan can be given to the customer or the customer will not be alone default is displayed over here this is estimated probability This is listed for all the thousand observations, the thousand observations or the thousand customers who had applied for the loan now, we will be learning this much in this video. Before we move to the next video let me recap the concepts that we have done in the last videos we are using an interest magnetics dataset where we had built a model for Y equals to one and y equals to one denotes that the customer will not be alone default default that that is loan can be given to that customer we had done supply selection for our model. To select the significant independent variables there are total independent variables out of which 14 independent variables are significant ones.

The significant variables are chosen using residual chi square test variation. Notice the model does not require any more variables and each one is a model requires more variables. This is the null hypothesis and alternative hypothesis for a suitable chi square test. We had also we also did Harshman MC goodness of fit test to check the goodness of fit for the model where we concluded that the model is a good fit because our P value is greater than our level of significance that is 0.05. Because the tables are all issue estimates we got the table for analysis of maximum likelihood estimates we also got the table of percentage of components discordance or type s were higher is the percentage of concordance better is our model that is less than his classification. We also generated a classification table for every level of probability from zero to one with a gap of 0.01.

And we got the different measures of classification table that is, correctly classified events correctly classified non events in Korea classified events if the president non events total amount of percentage correctly classified then sensitivity specificity false positive false negative for every level of probability from zero to one, we had created a data set called result where we had predicted the probability for Y equals to one for each customer. That is weird. predict the gravity for each customer, we are predicting the probability for Y equals to one for each customer means that what is the probability that the each customer can be given the loan or the customer will not be a loan default what is the probability that we had predicted and it was, it was displayed in the data set result to the plaintiff gravity for y equal to one is displayed in the data set result which is created in network now in our upcoming videos, we will be setting up a cutoff probability level to convert this estimated priority to a variable called status which is going to take binary value that is zero or one by setting up the cutoff gravity level we will be converting them estimated probability variable to a status variable which will take values zero or one and then we will be doing the confusion matrix between the observed response variable and the predicted response variable which will be a status variable and then we will be calculating the different measures of the confusion matrix manually For now, let's end this video over here.

Thank you. Good. Well, we'll see y'all for the next video.

Sign Up

Share

Share with friends, get 20% off
Invite your friends to LearnDesk learning marketplace. For each purchase they make, you get 20% off (upto $10) on your next purchase.