[人工智能] 深度学习和目标检测系列教程 16-300：通过全球小麦数据集训练第一个yolov5模型

开发: C++知识库 Java知识库 JavaScript Python PHP知识库人工智能区块链大数据移动开发嵌入式开发工具数据结构与算法开发测试游戏开发网络协议系统运维
教程: HTML教程 CSS教程 JavaScript教程 Go语言教程 JQuery教程 VUE教程 VUE3教程 Bootstrap教程 SQL数据库教程 C语言教程 C++教程 Java教程 Python教程 Python3教程 C#教程
数码: 电脑笔记本显卡显示器固态硬盘硬盘耳机手机 iphone vivo oppo 小米华为单反装机图拉丁

-> 人工智能 -> 深度学习和目标检测系列教程 16-300：通过全球小麦数据集训练第一个yolov5模型 -> 正文阅读

[人工智能]深度学习和目标检测系列教程 16-300：通过全球小麦数据集训练第一个yolov5模型

@Author：Runsen

之前的检测系统重新利用分类器或定位器来执行检测，将模型应用于多个位置和比例的图像。

Yolo 使用了一种完全不同的方法。它将单个神经网络应用于完整图像。该网络将图像划分为多个区域并预测每个区域的边界框和概率。这些边界框由预测概率加权。

YOLO 模型与基于分类器的系统相比有几个优点。它在测试时查看整个图像，因此它的预测是由图像中的全局上下文提供的。比 R-CNN 快 1000 倍以上，比 Fast R-CNN 快 100 倍。

理论部分已经足够了，让我们来看看 YOLOv5 的自定义数据集实现，并了解如何实现到小麦检测挑战中。

该存储库代表 Ultralytics 对未来对象检测方法的开源研究。所有代码和模型均由 Ultralytics 创建。

https://github.com/ultralytics/yolov5/wiki/Train-Custom-Data

!git clone https://github.com/ultralytics/yolov5.git
!mv yolov5/* ./

所有必需的依赖项都保存在 requirements.txt 文件中以安装所有然后运行一次

安装pycocoapi，pip install "git+https://github.com/cocodataset/cocoapi.git#subdirectory=PythonAPI"出错，改为GItee

下面脚本将train.csv读取随便分配图片到convertor文件夹中
我们创建一个文件夹转换器，所有文件都以给定的格式存储在该转换器文件夹中。

converter(main directory)
- val2017
  - labels (contains all the box dimensions)
  - images (contains images)
- train2017
  - labels
  - images

import os
import pandas as pd
import numpy as np
def convertTrainLabel():


    df = pd.read_csv('train.csv')
    bboxs = np.stack(df['bbox'].apply(lambda x: np.fromstring(x[1:-1], sep=',')))
    for i, column in enumerate(['x', 'y', 'w', 'h']):
        df[column] = bboxs[:, i]
    df.drop(columns=['bbox'], inplace=True)
    df['x_center'] = df['x'] + df['w'] / 2
    df['y_center'] = df['y'] + df['h'] / 2
    df['classes'] = 0
    from tqdm.auto import tqdm
    import shutil as sh
    df = df[['image_id', 'x', 'y', 'w', 'h', 'x_center', 'y_center', 'classes']]

    index = list(set(df.image_id))

    source = 'train'
    if True:
        for fold in [0]:
            val_index = index[len(index) * fold // 5:len(index) * (fold + 1) // 5]
            for name, mini in tqdm(df.groupby('image_id')):
                if name in val_index:
                    path2save = 'val2017/'
                else:
                    path2save = 'train2017/'
                if not os.path.exists('convertor/fold{}/labels/'.format(fold) + path2save):
                    os.makedirs('convertor/fold{}/labels/'.format(fold) + path2save)
                with open('convertor/fold{}/labels/'.format(fold) + path2save + name + ".txt", 'w+') as f:
                    row = mini[['classes', 'x_center', 'y_center', 'w', 'h']].astype(float).values
                    row = row / 1024
                    row = row.astype(str)
                    for j in range(len(row)):
                        text = ' '.join(row[j])
                        f.write(text)
                        f.write("\n")
                if not os.path.exists('convertor/fold{}/images/{}'.format(fold, path2save)):
                    os.makedirs('convertor/fold{}/images/{}'.format(fold, path2save))
                sh.copy("global-wheat-detection/{}/{}.jpg".format(source, name),
                        'convertor/fold{}/images/{}/{}.jpg'.format(fold, path2save, name))
convertTrainLabel()

训练脚本

python train.py --img 512 --batch 4 --epochs 10 --data data/wheet0.yaml --cfg data/yolov5x.yaml --name yolov5x_fold0

测试脚本

python ./detect.py --weights ./weights/last_yolov5x_fold0.pt --img 512 --conf 0.4 --source ./convertor/fold0/images/val2017

链接：https://pan.baidu.com/s/1cApWw5uPVLZk0kFIGrzVVA
提取码：e39k

人工智能最新文章

2022吴恩达机器学习课程——第二课（神经网

第十五章规则学习

FixMatch: Simplifying Semi-Supervised Le

数据挖掘Java——Kmeans算法的实现

大脑皮层的分割方法

【翻译】GPT-3是如何工作的

论文笔记:TEACHTEXT: CrossModal Generaliz

python从零学（六）

详解Python 3.x 导入(import)

【答读者问27】backtrader不支持最新版本的

加:2021-08-13 12:01:20 更:2021-08-13 12:04:44

360图书馆购物三丰科技阅读网日历万年历 2025年8日历

-2025/8/23 3:39:47-

图片自动播放器
↓图片自动播放器↓

TxT小说阅读器
↓语音阅读,小说下载,古典文学↓

一键清除垃圾
↓轻轻一点,清除系统垃圾↓

图片批量下载器
↓批量下载图片,美女图库↓

网站联系: qq:121756557 email:121756557@qq.com IT数码