Data Preparation with DSTK ScriptWriter

10 minutes
Share the link to this page
Copied
  Completed
You need to have access to the item to view this lesson.
One-time Fee
$49.99
List Price:  $69.99
You save:  $20
€46.70
List Price:  €65.39
You save:  €18.68
£39.94
List Price:  £55.92
You save:  £15.98
CA$68.57
List Price:  CA$96.01
You save:  CA$27.43
A$76.53
List Price:  A$107.15
You save:  A$30.62
S$67.94
List Price:  S$95.13
You save:  S$27.18
HK$390.79
List Price:  HK$547.14
You save:  HK$156.34
CHF 45.61
List Price:  CHF 63.86
You save:  CHF 18.24
NOK kr553.82
List Price:  NOK kr775.40
You save:  NOK kr221.57
DKK kr348.39
List Price:  DKK kr487.78
You save:  DKK kr139.38
NZ$84.29
List Price:  NZ$118.01
You save:  NZ$33.72
د.إ183.60
List Price:  د.إ257.06
You save:  د.إ73.45
৳5,485.75
List Price:  ৳7,680.49
You save:  ৳2,194.74
₹4,172.64
List Price:  ₹5,842.03
You save:  ₹1,669.38
RM237.74
List Price:  RM332.86
You save:  RM95.11
₦61,737.65
List Price:  ₦86,437.65
You save:  ₦24,700
₨13,922.03
List Price:  ₨19,491.96
You save:  ₨5,569.92
฿1,847.10
List Price:  ฿2,586.09
You save:  ฿738.99
₺1,618.04
List Price:  ₺2,265.39
You save:  ₺647.34
B$259.65
List Price:  B$363.53
You save:  B$103.88
R930.41
List Price:  R1,302.64
You save:  R372.23
Лв91.35
List Price:  Лв127.90
You save:  Лв36.55
₩68,760.70
List Price:  ₩96,270.48
You save:  ₩27,509.78
₪187.33
List Price:  ₪262.29
You save:  ₪74.95
₱2,881.22
List Price:  ₱4,033.94
You save:  ₱1,152.71
¥7,761.78
List Price:  ¥10,867.12
You save:  ¥3,105.33
MX$847.89
List Price:  MX$1,187.12
You save:  MX$339.22
QR182.08
List Price:  QR254.93
You save:  QR72.84
P710.02
List Price:  P994.08
You save:  P284.06
KSh6,685.83
List Price:  KSh9,360.69
You save:  KSh2,674.86
E£2,398.89
List Price:  E£3,358.63
You save:  E£959.74
ብር2,859.67
List Price:  ብር4,003.77
You save:  ብር1,144.10
Kz41,816.63
List Price:  Kz58,546.63
You save:  Kz16,730
CLP$48,009.39
List Price:  CLP$67,216.99
You save:  CLP$19,207.60
CN¥361.91
List Price:  CN¥506.70
You save:  CN¥144.79
RD$2,909.49
List Price:  RD$4,073.53
You save:  RD$1,164.03
DA6,727
List Price:  DA9,418.34
You save:  DA2,691.34
FJ$113.07
List Price:  FJ$158.31
You save:  FJ$45.24
Q388.52
List Price:  Q543.96
You save:  Q155.44
GY$10,463.89
List Price:  GY$14,650.29
You save:  GY$4,186.39
ISK kr7,010.59
List Price:  ISK kr9,815.39
You save:  ISK kr2,804.80
DH505.48
List Price:  DH707.71
You save:  DH202.23
L884.07
List Price:  L1,237.78
You save:  L353.70
ден2,875.01
List Price:  ден4,025.24
You save:  ден1,150.23
MOP$402.80
List Price:  MOP$563.96
You save:  MOP$161.15
N$931.61
List Price:  N$1,304.33
You save:  N$372.72
C$1,835.88
List Price:  C$2,570.38
You save:  C$734.50
रु6,711.95
List Price:  रु9,397.27
You save:  रु2,685.31
S/188.15
List Price:  S/263.43
You save:  S/75.27
K192.92
List Price:  K270.11
You save:  K77.18
SAR187.48
List Price:  SAR262.49
You save:  SAR75.01
ZK1,338.41
List Price:  ZK1,873.89
You save:  ZK535.47
L232.39
List Price:  L325.37
You save:  L92.97
Kč1,173.84
List Price:  Kč1,643.47
You save:  Kč469.63
Ft18,183.27
List Price:  Ft25,458.03
You save:  Ft7,274.76
SEK kr546.32
List Price:  SEK kr764.90
You save:  SEK kr218.57
ARS$43,802.69
List Price:  ARS$61,327.27
You save:  ARS$17,524.58
Bs345.39
List Price:  Bs483.57
You save:  Bs138.18
COP$195,145.11
List Price:  COP$273,218.78
You save:  COP$78,073.66
₡25,506.16
List Price:  ₡35,710.66
You save:  ₡10,204.50
L1,238.25
List Price:  L1,733.65
You save:  L495.39
₲374,580.54
List Price:  ₲524,442.73
You save:  ₲149,862.19
$U1,916.38
List Price:  $U2,683.09
You save:  $U766.70
zł202.30
List Price:  zł283.24
You save:  zł80.93
Already have an account? Log In

Transcript

Okay, so now we will do the data preparation station using the D SDK script writer. Okay, so previously we done data understanding. So now we do the data preparation. Okay, so in data preparation, I'm going to remove my missing values, okay. Remove all the rows that has the missing values normalized, normalized data is the score for variable to formalize it in the scope or the variable tree and then select the variables. Okay, there we go the data to the variables Okay.

So now I do II Okay, import a target the variables are these I can remove Okay, so now I'm going to the prepare. So prepare remove missing values okay normalize we standard score. Okay. And any one more normalized score. Okay. These two are Tree Parable tour to entry okay?

So they're about to have a tree and the new pala name should be their own to normalize and essentially tree normalize okay okay I show you the data okay run the code okay. So you can see here. So the first one in the data preparation is the Remove missing values and the next one will be repair and normalized the robot to with the score normalize the global tree with a standard score. Okay. So the new color name will be to normalize and the rubber tree normalize. Okay, so we normalize this column, but they're about to renormalize this other variable tree.

Okay, so now we have additional columns here. He's trying to be proper to normalize essentially rubber tree and normalize Okay, so you notice Okay, so the rubber to normalize probatory normalize so the Assign should be empty. Then this one is man, Marquis, AC me okay they're about to normalize a rubber tree normalize so now I want to select all NIDA normalized values, normalize our variables. So, I come here oh hi pool view variables. And I come and prepare feature selection oh my gosh no ah prepare select variables. Okay, so that rail bus is Ah, this one is now trickier STL data.

So it was Lady normalizable variables. The son is variable zero for Bo wide Rubber to the rubber tree a robot for a robot. So, when you write your script I will suggest is you write by myself write a small small portion, then run the script and see the result and then continue on. So now I normalize the variables, I like to select these two variables, so should be universal 12345 So, now I select number five okay all by no spacing in between no spacing in the brackets. Okay. I will want to do the data and the variables okay run Cool Blender run the script okay.

So in rd I have selected the to normalize variables if you want you can export is data as a CSV file or maybe something the A CMP a CME something ACM cm e tree CSV separator comma is co equal true so we can spot these selector variables ah we can spot these selected normalize variables So let's see a variable run. Okay. Okay if I spot a to d a cmeg okay d by modified AC mi t reviewing notepad plus, plus Okay, so this is a pull up a CSV file okay. So VBS file you can do some of the law you can use these data those are modification ID D and then put it into the modeling stage or the evaluation stage okay. You may need to do some modifications so there is two comma here. So, you may want to replace two comma with a comma and replace all okay, you may want to do something like this To export CSV and then modify it accordingly to your next okay though we we have completed the data preparation stage the preparation stage okay.

Previously we do the data understanding stage now we do the data preparation stage. So I as body normalize data so that normalize data can be used in a modeling stage. This is very clear the prediction models or the classifiers okay so the classifier can use the prepared data and training data and so on. Okay.

Sign Up

Share

Share with friends, get 20% off
Invite your friends to LearnDesk learning marketplace. For each purchase they make, you get 20% off (upto $10) on your next purchase.