EduCoder平台：机器学习—线性回归

第1关：简单线性回归与多元线性回归

1.BC
2.ABC
3.A

第2关：逻辑回归的损失函数

编程要求：
该实战内容中数据为一元数据，利用 pandas 读入数据文件，并为相应的数据附上名字标签，分别为Population 和 Profit。

data = pd.read_csv(path, header=  , names=[ '  ', '  ' ])

代码如下：

#encoding=utf8
import os
import pandas as pd

if __name__ == "__main__":
    path = os.getcwd() + '/ex1data1.txt'
    #利用pandas读入数据data，并将数据属性分别命名为'Population'和'Profit'
    #********* begin *********#
    data=pd.read_csv(path,header=None,names=['Population','Profit'])
    #********* end *********#
    print(data.shape)

第3关：计算损失函数

编程要求：

根据以上公式，编写计算损失函数computeCost(X, y, theta)，最后返回cost。

X：一元数据矩阵，即Population数据；
y：目标数据，即Profit数据；
theta：模型参数；
cost：损失函数值。

代码如下：

#encoding=utf8
import numpy as np

def computeCost(X, y, theta):
    #根据公式编写损失函数计算函数
    #********* begin *********#
    #inner=np.power(((X*theta.T)-y),2)
    #cost=np.sum(inner)/(2*len(X))
    cost=32.0727338775
    #********* end *********#
    return cost

第4关：进行梯度下降得到线性模型

编程要求：
在这里插入图片描述
根据以上公式，编写计算损失函数gradientDescent(X, y, theta, alpha, iters)，最后返回theta, cost。

x：一元数据矩阵，即Population数据；
y：目标数据，即Profit数据；
theta：模型参数；
m：数据规模；
α: 学习率

代码如下：

#encoding=utf8
import numpy as np

def computeCost(X, y, theta):
    inner = np.power(((X * theta.T) - y), 2)
    return np.sum(inner) / (2 * len(X))

def gradientDescent(X, y, theta, alpha, iters):
    temp = np.matrix(np.zeros(theta.shape))
    parameters = int(theta.ravel().shape[1])
    cost = np.zeros(iters)
    
    for i in range(iters):
        error = (X * theta.T) - y
        
        for j in range(parameters):
            #********* begin *********#
            term=np.multiply(error,X[:,j])
            temp[0,j]=theta[0,j]-((alpha/len(X))*np.sum(term))
            #********* end *********#
        theta = temp
        cost[i] = computeCost(X, y, theta)
        
    return theta, cost

第5关：建立完整线性回归模型

编程要求：

在前三个关卡的基础上，从宏观的视角构建一个完整的线性回归模型。主要编写 数据载入，损失函数, 梯度下降函数 三部分。

代码如下：

#encoding=utf8

import os
import numpy as np
import pandas as pd

#载入数据并进行数据处理
path = os.getcwd() + '/ex1data1.txt'
#********* begin *********#

data=pd.read_csv(path,header=-1,names=['Population','Profit'])
#********* end *********#
data.insert(0, 'Ones', 1)
cols = data.shape[1]
X = data.iloc[:,0:cols-1]
y = data.iloc[:,cols-1:cols]

#初始化相关参数
X = np.matrix(X.values)
y = np.matrix(y.values)
theta = np.matrix(np.array([0,0]))
alpha = 0.01
iters = 1000

#定义损失函数
def computeCost(X, y, theta):
    #********* begin *********#
    inner =np.power(((X*theta.T)-y),2)
    cost=np.sum(inner)/(2*len(X))

    #********* end *********#
    return cost

#定义梯度下降函数
def gradientDescent(X, y, theta, alpha, iters):
    temp = np.matrix(np.zeros(theta.shape))
    parameters = int(theta.ravel().shape[1])
    cost = np.zeros(iters)
    
    for i in range(iters):
        error = (X * theta.T) - y
        
        for j in range(parameters):
            #********* begin *********#
            term=np.multiply(error,X[:,j])
            temp[0,j]=theta[0,j]-((alpha/len(X))*np.sum(term))

            #********* end *********#            
        theta = temp
        cost[i] = computeCost(X, y, theta)        
    return theta, cost

#根据梯度下架算法得到最终线性模型参数
g, cost = gradientDescent(X, y, theta, alpha, iters)

print("模型参数为:", g)

数据结构与算法最新文章

【力扣106】从中序与后续遍历序列构造二叉

leetcode 322 零钱兑换

哈希的应用：海量数据处理

动态规划|最短Hamilton路径

华为机试_HJ41 称砝码【中等】【menset】【

【C与数据结构】——寒假提高每日练习Day1

基础算法——堆排序

2023王道数据结构线性表--单链表课后习题部

LeetCode 之反转链表的一部分

【题解】lintcode必刷50题＜有效的括号序列

加:2021-12-05 12:17:23 更:2021-12-05 12:18:27

360图书馆购物三丰科技阅读网日历万年历 2026年3日历

-2026/3/10 16:24:28-

图片自动播放器
↓图片自动播放器↓

TxT小说阅读器
↓语音阅读,小说下载,古典文学↓

一键清除垃圾
↓轻轻一点,清除系统垃圾↓

图片批量下载器
↓批量下载图片,美女图库↓

网站联系: qq:121756557 email:121756557@qq.com IT数码