使用Excel分析身高体重表
要使用excel分析数据首先要在文件->选项->加载项->分析工具库打开数据分析功能(默认是没有打开的) data:image/s3,"s3://crabby-images/2a3b5/2a3b527d48723c92e2af1eaf6cf994e5d74705c1" alt="在这里插入图片描述" 选择数据->数据分析->回归 data:image/s3,"s3://crabby-images/3f6f1/3f6f139bb3d6847eae622d016b69367addcd3462" alt="请添加图片描述" 选择x,y的值,勾选标准残差和线性拟合图 data:image/s3,"s3://crabby-images/de391/de391832312bf21211e63e116e06e82fef2f66bf" alt="请添加图片描述" 右键拟合图选择现价趋势线->线性->显示公式->显示R的平方值->确定 data:image/s3,"s3://crabby-images/57896/5789610fdba73f3b1bd0268357e1b2e2439d18fe" alt="请添加图片描述"
选取20组数据进行分析
data:image/s3,"s3://crabby-images/70b8d/70b8dfbab0ec30d54a72aad1720fadc63471e3d3" alt="请添加图片描述"
选取200组数据进行分析
data:image/s3,"s3://crabby-images/88917/88917ab4dcb2694a70423a10fb880f0b7ada845f" alt="请添加图片描述"
选取2000组数据进行分析
data:image/s3,"s3://crabby-images/47700/47700256c80a38aa9b746a12fd3dd03b718b6f3c" alt="在这里插入图片描述"
选取20000组数据进行分析
data:image/s3,"s3://crabby-images/9ef1f/9ef1fd344c0ffffe5cbc63343ad2a5fbd68a1a7d" alt="在这里插入图片描述"
用jupyter notebook编程使用最小二乘法分析身高体重表
在jupyter的web网页中新建python3文件
注意数据文件的编码方式要是utf-8,如果是gbk会报错
data:image/s3,"s3://crabby-images/944e2/944e21bbc511fe82e8e7c15c12761eb597327178" alt="在这里插入图片描述"
import numpy as np
import matplotlib.pyplot as plt
%matplotlib inline
points = np.genfromtxt("D:\Download\weights_heights.csv",delimiter=",")
x=points[1:21,1];
y=points[1:21,2];
x_mean = np.mean(x)
y_mean = np.mean(y)
xsize = x.size
zi = (x * y).sum() - xsize * x_mean *y_mean
mu = (x ** 2).sum() - xsize * x_mean ** 2
a = zi / mu
b = y_mean - a * x_mean
a = np.around(a,decimals=2)
b = np.around(b,decimals=2)
print(f'回归线方程:y = {a}x + {b}')
y1 = a*x + b
plt.scatter(x,y)
plt.plot(x,y1,c='r')
选取20组数据进行分析
data:image/s3,"s3://crabby-images/428cb/428cb44ebf856f46c3652733d5224ae49908bb56" alt="在这里插入图片描述"
选取200组数据进行分析
data:image/s3,"s3://crabby-images/3a70e/3a70eb3c019b16064d54b8261e005c3360246954" alt="请添加图片描述"
选取2000组数据进行分析
data:image/s3,"s3://crabby-images/44c6e/44c6eec8acd2f85d48ae285c869c0b40bd28b482" alt="请添加图片描述"
选取20000组数据进行分析
data:image/s3,"s3://crabby-images/76338/76338be599e6774e08d0cf44d27c45db2effcaee" alt="请添加图片描述"
用jupyter notebook编程借助skleran分析身高体重表
import pandas;
import matplotlib;
from pandas.plotting import scatter_matrix;
data = pandas.read_csv("D:\Download\weights_heights.csv",engine='python',encoding='utf-8')
font = {
'family' : 'SimHei'
}
matplotlib.rc('font', **font)
scatter_matrix(
data[["Height", "Weight"]],
figsize=(10, 10), diagonal='kde'
)
data[["Height", "Weight"]].corr()
x = data[["Height"]]
y = data[["Weight"]]
from sklearn.linear_model import LinearRegression
lrModel = LinearRegression()
lrModel.fit(x, y)
R2=lrModel.score(x, y)
print("R的平方:",R2)
lrModel.coef_
data:image/s3,"s3://crabby-images/fa02b/fa02b17539526e153f541ab084eebceaad0e0310" alt="在这里插入图片描述"
lrModel.intercept_
data:image/s3,"s3://crabby-images/f5f0b/f5f0bda9d8d7253c7917e70c42a9515ba4470e2c" alt="在这里插入图片描述"
总结
data:image/s3,"s3://crabby-images/9ef1f/9ef1fd344c0ffffe5cbc63343ad2a5fbd68a1a7d" alt="在这里插入图片描述" data:image/s3,"s3://crabby-images/76338/76338be599e6774e08d0cf44d27c45db2effcaee" alt="请添加图片描述" data:image/s3,"s3://crabby-images/91772/917723e0b87b0e9e905a29f5784442ab8de981ce" alt="在这里插入图片描述" 综上可以看出excel的拟合误差较大,最小二乘法拟合效果与skleran分析拟合效果相近。
参考
|