mirror of https://github.com/01-edu/Branch-AI.git
You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
1.3 KiB
1.3 KiB
Exercise 3 One hot Encoder
The goal of this exercise is to learn how to deal with Categorical variables using the OneHot Encoder.
X_train = [['Python'], ['Java'], ['Java'], ['C++']]
-
Using
OneHotEncoder
withhandle_unknown='ignore'
, fit the One Hot Encoder and transform X_train. The expected output is:('C++',) ('Java',) ('Python',) 0 0 0 1 1 0 1 0 2 0 1 0 3 1 0 0 To get this output create a DataFrame from the transformed X_train and the attribute
categories_
. -
Transform X_test using the fitted One Hot Encoder on the train set.
X_test = [['Python'], ['Java'], ['C'], ['C++']]
The expected output is:
| | ('C++',) | ('Java',) | ('Python',) |
|---:|-----------:|------------:|--------------:|
| 0 | 0 | 0 | 1 |
| 1 | 0 | 1 | 0 |
| 2 | 0 | 0 | 0 |
| 3 | 1 | 0 | 0 |