本文以干豆数据集为例,数据集下载位置如下:干豆数据集
import pandas as pd import sklearn import numpy as np
讯享网
数据读取与预处理
讯享网dry = pd.read_csv("Dry_Bean.csv")
在info返回的信息中的non-null也能看出数据集不存在缺失值。
dry.info()
讯享网<class 'pandas.core.frame.DataFrame'> RangeIndex: 13611 entries, 0 to 13610 Data columns (total 17 columns): # Column Non-Null Count Dtype --- ------ -------------- ----- 0 Area 13611 non-null int64 1 Perimeter 13611 non-null float64 2 MajorAxisLength 13611 non-null float64 3 MinorAxisLength 13611 non-null float64 4 AspectRation 13611 non-null float64 5 Eccentricity 13611 non-null float64 6 ConvexArea 13611 non-null int64 7 EquivDiameter 13611 non-null float64 8 Extent 13611 non-null float64 9 Solidity 13611 non-null float64 10 roundness 13611 non-null float64 11 Compactness 13611 non-null float64 12 ShapeFactor1 13611 non-null float64 13 ShapeFactor2 13611 non-null float64 14 ShapeFactor3 13611 non-null float64 15 ShapeFactor4 13611 non-null float64 16 Class 13611 non-null object dtypes: float64(14), int64(2), object(1) memory usage: 1.8+ MB
dry.head()

版权声明:本文内容由互联网用户自发贡献,该文观点仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容,请联系我们,一经查实,本站将立刻删除。
如需转载请保留出处:https://51itzy.com/kjqy/45790.html