SQL SAS Essentials

14 minutes
Share the link to this page
Copied
  Completed
You need to have access to the item to view this lesson.
One-time Fee
$69.99
List Price:  $99.99
You save:  $30
€64.96
List Price:  €92.81
You save:  €27.84
£55.77
List Price:  £79.68
You save:  £23.90
CA$95.68
List Price:  CA$136.70
You save:  CA$41.01
A$106.02
List Price:  A$151.47
You save:  A$45.44
S$94.41
List Price:  S$134.88
You save:  S$40.47
HK$546.80
List Price:  HK$781.18
You save:  HK$234.37
CHF 63.34
List Price:  CHF 90.49
You save:  CHF 27.15
NOK kr761.11
List Price:  NOK kr1,087.35
You save:  NOK kr326.23
DKK kr485.02
List Price:  DKK kr692.92
You save:  DKK kr207.89
NZ$116.42
List Price:  NZ$166.33
You save:  NZ$49.90
د.إ257.06
List Price:  د.إ367.25
You save:  د.إ110.18
৳7,660.01
List Price:  ৳10,943.35
You save:  ৳3,283.33
₹5,835.78
List Price:  ₹8,337.18
You save:  ₹2,501.40
RM331.75
List Price:  RM473.95
You save:  RM142.20
₦86,437.65
List Price:  ₦123,487.65
You save:  ₦37,050
₨19,416.31
List Price:  ₨27,738.77
You save:  ₨8,322.46
฿2,572.74
List Price:  ฿3,675.50
You save:  ฿1,102.76
₺2,264.43
List Price:  ₺3,235.04
You save:  ₺970.61
B$356.70
List Price:  B$509.60
You save:  B$152.89
R1,295.44
List Price:  R1,850.72
You save:  R555.27
Лв127.05
List Price:  Лв181.51
You save:  Лв54.46
₩94,909.58
List Price:  ₩135,590.93
You save:  ₩40,681.35
₪259.50
List Price:  ₪370.74
You save:  ₪111.23
₱3,993.87
List Price:  ₱5,705.78
You save:  ₱1,711.90
¥10,712.31
List Price:  ¥15,303.96
You save:  ¥4,591.65
MX$1,187.89
List Price:  MX$1,697.07
You save:  MX$509.17
QR254.57
List Price:  QR363.69
You save:  QR109.12
P950.82
List Price:  P1,358.38
You save:  P407.55
KSh9,247.76
List Price:  KSh13,211.65
You save:  KSh3,963.89
E£3,352.12
List Price:  E£4,788.95
You save:  E£1,436.83
ብር4,006.43
List Price:  ብር5,723.72
You save:  ብር1,717.28
Kz58,511.64
List Price:  Kz83,591.64
You save:  Kz25,080
CLP$65,950.47
List Price:  CLP$94,219
You save:  CLP$28,268.52
CN¥506.53
List Price:  CN¥723.64
You save:  CN¥217.11
RD$4,055.76
List Price:  RD$5,794.19
You save:  RD$1,738.43
DA9,420.16
List Price:  DA13,457.95
You save:  DA4,037.79
FJ$157.70
List Price:  FJ$225.30
You save:  FJ$67.59
Q542.52
List Price:  Q775.06
You save:  Q232.54
GY$14,601.52
List Price:  GY$20,860.22
You save:  GY$6,258.69
ISK kr9,764.23
List Price:  ISK kr13,949.49
You save:  ISK kr4,185.26
DH703.98
List Price:  DH1,005.73
You save:  DH301.75
L1,236.34
List Price:  L1,766.28
You save:  L529.93
ден3,998.59
List Price:  ден5,712.52
You save:  ден1,713.92
MOP$561.77
List Price:  MOP$802.57
You save:  MOP$240.79
N$1,291.99
List Price:  N$1,845.78
You save:  N$553.78
C$2,569.36
List Price:  C$3,670.67
You save:  C$1,101.31
रु9,319.09
List Price:  रु13,313.56
You save:  रु3,994.46
S/260.54
List Price:  S/372.22
You save:  S/111.67
K269.79
List Price:  K385.44
You save:  K115.64
SAR262.50
List Price:  SAR375.02
You save:  SAR112.51
ZK1,882.68
List Price:  ZK2,689.66
You save:  ZK806.98
L323.40
List Price:  L462.03
You save:  L138.62
Kč1,628.77
List Price:  Kč2,326.92
You save:  Kč698.14
Ft25,305.79
List Price:  Ft36,152.68
You save:  Ft10,846.88
SEK kr755.02
List Price:  SEK kr1,078.64
You save:  SEK kr323.62
ARS$61,468.17
List Price:  ARS$87,815.44
You save:  ARS$26,347.26
Bs483.33
List Price:  Bs690.51
You save:  Bs207.17
COP$271,845.87
List Price:  COP$388,367.89
You save:  COP$116,522.02
₡35,672.25
List Price:  ₡50,962.55
You save:  ₡15,290.29
L1,724.16
List Price:  L2,463.20
You save:  L739.03
₲522,510.75
List Price:  ₲746,475.93
You save:  ₲223,965.17
$U2,674.97
List Price:  $U3,821.56
You save:  $U1,146.58
zł281.37
List Price:  zł401.98
You save:  zł120.60
Already have an account? Log In

Transcript

Welcome to clinical data management program using SAS. In this video we will be discussing about SQL SAS essentials. So at first the data set that we are going to use, we have to bring the data set in the SAS environment. So for that we have to create the live name statement, select name library name is CBN, then we need to give the pack so here, we copy the path. This is our path includes the double quotes and then we give semicolon, so let's run this code. So these are my data sets.

Now let's start with proc SQL. So our procedure starts with the statement, proc SQL select star, which stands for that is, which stands for selecting all the variables that is we want to display all the variables in our result view. So that's what we're doing select star asterisk stands for all the variables from CDM is my library. name dot disease semicolon and then quit. Let's run this code. over here to this is my disease data set which consists of the variables like ID gender, date of birth zip code, a Brahmin status, education, marital status, children ancestry, average commute, daily internet use available services, military service and disease, and it consists of around 2000 observations.

So over here when we do progress can select Start means we want to display all the variables in our viewer from CGM for disease, this is our source table and then quit. So this is our result viewer. We have to give the non title statement to nullify the title. So we can give title and semicolon to modify the data See now the data has gone. So all the variables and all the observations are displayed over here because we haven't applied any filtering or any other modification. So, all observations with all the variables are displayed next.

Now, suppose we want to apply some filter to our table that is we want to display only those observations where marital status is equals to married an average commute is greater than 30. So how will we proceed using the same procedure proc SQL Select star from CDM doc disease with marital status Equals married and average commute is greater than 30 and then quit. So only those observations with a marital status is married. And average commute greater than 30 days this is the average commute variable is displayed. So I've just written 30 and marital status equals to married only those observations are displayed, which fulfills these two criteria such as marital status equals to married an average commute greater than 30. Next, suppose we after applying the filter on the data as marital status equals to married or average commute, greater than 30.

We also want to use one we want to sorta result in table by daily turret using descending order. So how do we proceed? So for this we are going to use proc SQL select star. That is we want to display all the variables from CG dot disease Vish merited status equals married this commute waited all day by day didn't use descending. So we give the keyword is de sc. That is we want to sort our resulting table with respect to the variable data introduce in descending order and then quit so let's run this code SR Dale introduces sorted in descending order.

All observations the average commute greater than 30 and marital status equals to monitor displayed. Next, suppose we want good displayed the variables or disease and we want to sum up average commute for each disease and then we are going to want to order a certain table with respect to disease, then how do we proceed so we'll be using proc SQL we want the sum of average compute for each disease. So basically we have to group the data with respect to disease so we'll be using the GROUP BY clause over here. So first let's give the variables that you specify in SELECT statement we want the variable as disease to come. We want to sum of average commute as some from CDM doc disease grew by disease Having disease equals to skin cancer and then we are doing order by disease. Then basically over here we want to display the variables disease and you own the sum of the average commute for each disease and we want to display only those diseases for skin cancer.

So, after we group we are applying the having clause. That means on the group data this filter will be applied so only the disease for skin cancer will be displayed. So the sum of average commute only for the diseases cancer will be displayed. So see only the disease for skin cancer and the sum of average communities district For the last case we were not getting because there was a syntax error as we gave sericata for each and every step. So, just we have removed the semicolon. So we got the proper output.

Now suppose we want the sum of average commute for each and every disease not only for skin cancer, but for each and every disease. So how do we proceed? slay proc SQL select disease, coma some average commute some from CDM dot disease, group disease and then audible disease and then quit. So let's run this code to see for each of the diseases that some of our risk communities displayed for Alzheimer's disease for HIV for breast cancer, diabetes, endometriosis, gastritis, heart disease, hypertension and kidney disease, multiple sclerosis, prostate cancer, schizophrenia, skin cancer and etc. Now suppose that after playing the group tour filter data that is we have the table where we are getting the sum of average commute for the disease can cancel but we want a filter applied to this table that is we want the sum to be calculated only for those observations where my employment status is returned.

That means I want to display the sum of average commute of the disease was suffering from skin cancer, provided their employment status is equals to two Retired that is they are retired people or their employment status is retired. How do we proceed with it we'll be using again progress keyword select disease, comma PSA average commute as someone from CDM dot disease there. Now here we are applying the filter. This is not the end of our query. So we cannot leave semicolon where employment status equals retired. group by disease having disease equals to skin cancer ordered by disease and then quick so let's run this code here here.

We want to display those variables like disease. Some of average computer someone for the disease's skin cancer. It is grouping with respect to disease and the data is filtered. There is employment status has to be equal to retire. So let's run this code. To see here, the skin cancer patients the sum of average commute is displayed, provided their employment status is equal to retired.

So only for those patients are displayed where their employment status is recorded or retired. Previously, all the patients were displayed that's where my son was born. Now see the summons reduced, because I have also added one folder that is employment status has to be retired. The next suppose I want to display 50 observations from my data. So how will it do that profit scale out tops equals to 50 that is it will take the first 50 observations, then select Select star from CDM doc disease then quit so let's run this code only 50 observations are displayed from our data out of sequence 250 mins it limits the number of observations only 50 observations are displayed. Similarly suppose we want to display only unique diseases that is we do not want the repetitions of disease names we want the unique values of the variable disease.

So how do we proceed we'll do progress keyword, semicolon, select distinct disease that is unique values of disease disease without duplicates will be displayed from CDM dot disease semicolon and then let's run this code to see only the unique names of the diseases without their deeply in repetitions or duplicates a dispute. So there are these are the different diseases which the patient is suffering from the disease Alzheimer's disease, HIV, breast cancer, diabetes, endometriosis, gastritis, heart disease, hypertension, kidney disease, multiple sclerosis, prostate cancer specific skin cancer. So in this video, we'll be doing till here in my next video, I'll be discussing about the next part of the SQL classes and just for now, let me end this video over here. Thank you Goodbye. I'll see you for the next video.

Sign Up

Share

Share with friends, get 20% off
Invite your friends to LearnDesk learning marketplace. For each purchase they make, you get 20% off (upto $10) on your next purchase.