数据预处理第一天

it2022-05-20  67

步骤一:导入库 1 import numpy as np 2 import pandas as pd View Code 步骤二:导入数据集 34 _______________________________ 5 dataset=ps.read_csv('Data.cab') 6 X=dataset.bloc[:,:-1].values 7 Y=dataset.iloc[:,3].values

 

步骤三:处理丢失的数据 1 from sklearn.preprocessing import Imputer 2 imputer = Imputer(missing_values = "NaN", strategy = "mean", axis = 0) 3 imputer = imputer.fit(X[ : , 1:3]) 4 X[ : , 1:3] = imputer.transform(X[ : , 1:3]) View Code

 

步骤四:编码分类的数据 步骤五:将数据集分成训练集和测试集步骤六:功能缩放

转载于:https://www.cnblogs.com/futheworld/p/9427413.html


最新回复(0)