步骤一:导入库
1 import numpy as np
2 import pandas as pd
View Code
步骤二:导入数据集
34 _______________________________
5 dataset=ps.read_csv(
'Data.cab')
6 X=dataset.bloc[:,:-1
].values
7 Y=dataset.iloc[:,3].values
步骤三:处理丢失的数据
1 from sklearn.preprocessing
import Imputer
2 imputer = Imputer(missing_values =
"NaN", strategy =
"mean", axis =
0)
3 imputer = imputer.fit(X[ : , 1:3
])
4 X[ : , 1:3] = imputer.transform(X[ : , 1:3])
View Code
步骤四:编码分类的数据 步骤五:将数据集分成训练集和测试集步骤六:功能缩放
转载于:https://www.cnblogs.com/futheworld/p/9427413.html