YOLOv8.3 无人机航拍小目标检测实战与优化策略

YOLOv8.3 无人机航拍小目标检测实战与优化策略 | 极客日志

ssh root@<服务器 IP> -p 2222

dataset/
├── images/
│   ├── train/
│   └── val/
├── labels/
│   ├── train/
│   └── val/
└── data.yaml

train: /root/dataset/images/train
val: /root/dataset/images/val
nc: 8
names: ['pedestrian', 'people', 'bicycle', 'car', 'van', 'truck', 'tricycle', 'awning-tricycle']

from ultralytics import YOLO
# 加载 COCO 预训练的 YOLOv8n 模型
model = YOLO("yolov8n.pt")
# 可选：查看模型结构信息
model.info()

# 开始训练
results = model.train(
    data="/root/dataset/data.yaml",
    epochs=150,
    imgsz=640,
    batch=16,
    optimizer='AdamW',
    lr0=0.001,
    augment=True,
    mosaic=0.5,
    copy_paste=0.3,
    close_mosaic=10,
    device=0 # 使用 GPU 0
)

参数	推荐值	作用
`imgsz`	640 或 1280	提高输入分辨率有助于捕捉小目标细节
`mosaic`	0.5~1.0	多图拼接增强小目标出现频率
`copy_paste`	0.3	将小目标复制粘贴至新位置，缓解样本稀疏问题
`close_mosaic`	10	前几轮关闭马赛克以稳定收敛

results = model.train(..., imgsz=1280, ...)

# 示例：新增 P2 层预测（160×160）
class DetectSmall(Detect):
    def __init__(self, nc=80, ch=()):
        super().__init__(nc, ch)
        self.stride = torch.tensor([4., 8., 16., 32.]) # 新增 strides[0]=4
        self.no = nc + 5
        self.nl = len(self.stride)
        self.reg_max = 16
        self.device = None
        self.export = False
        self.assigner = TaskAlignedAssigner(topk=10, num_classes=self.nc, alpha=0.5, beta=6.0)
        self.bbox_loss = E2TV2Loss(reg_max=self.reg_max)
        self.dfl_loss = DistributionFocalLoss(loss_weight=1.0)

import cv2
import numpy as np

def copy_paste_small_objects(image, labels, paste_ratio=0.3):
    h, w = image.shape[:2]
    new_image = image.copy()
    for label in labels:
        cls, x_c, y_c, bw, bh = label
        if bw * w < 32 and bh * h < 32: # 判断是否为小目标
            if np.random.rand() < paste_ratio:
                offset_x = np.random.randint(-w//4, w//4)
                offset_y = np.random.randint(-h//4, h//4)
                new_x = np.clip(x_c + offset_x/w, 0, 1)
                new_y = np.clip(y_c + offset_y/h, 0, 1)
                # 实现图像块复制粘贴逻辑...
    return new_image, augmented_labels

# 加载最佳权重
model = YOLO("/root/runs/detect/train/weights/best.pt")
# 执行推理
results = model("/root/dataset/images/val/IMG_0001.jpg", conf=0.4, iou=0.5)
# 显示结果
for r in results:
    im_array = r.plot()
    im = Image.fromarray(im_array[..., ::-1]) # BGR to RGB
    im.show()

模型配置	[email protected]	小目标 [email protected]	推理速度 (FPS)	显存占用 (GB)
YOLOv8n (640)	0.421	0.287	142	3.2
YOLOv8n (1280)	0.463	0.352	68	7.1
YOLOv8n + Copy-Paste	0.478	0.376	65	7.3
YOLOv8s (1280)	0.491	0.394	52	8.0

YOLOv8.3 无人机航拍小目标检测实战与优化策略

YOLOv8.3 无人机航拍小目标检测实战与优化策略

1. 引言

2. YOLOv8.3 核心特性与适用性分析

2.1 YOLOv8 架构演进回顾

2.2 小目标检测的核心难点

3. 基于 YOLOv8 镜像的开发环境搭建

3.1 镜像简介与优势

3.2 环境启动与连接方式

Jupyter Notebook 使用方式

SSH 远程连接方式

4. 航拍小目标检测实战流程

4.1 数据集准备与标注规范

数据组织结构

4.2 模型初始化与训练脚本

关键参数说明

5. 小目标检测优化策略

5.1 输入分辨率提升

5.2 引入小目标专用检测头（Custom Head）

5.3 数据增强策略调优

6. 推理与结果可视化

7. 性能评估与对比分析

8. 总结

更多推荐文章

相关免费在线工具

YOLOv8.3 无人机航拍小目标检测实战与优化策略

YOLOv8.3 无人机航拍小目标检测实战与优化策略

1. 引言

2. YOLOv8.3 核心特性与适用性分析

2.1 YOLOv8 架构演进回顾

2.2 小目标检测的核心难点

3. 基于 YOLOv8 镜像的开发环境搭建

3.1 镜像简介与优势

3.2 环境启动与连接方式

Jupyter Notebook 使用方式

SSH 远程连接方式

4. 航拍小目标检测实战流程

4.1 数据集准备与标注规范

数据组织结构

4.2 模型初始化与训练脚本

关键参数说明

5. 小目标检测优化策略

5.1 输入分辨率提升

5.2 引入小目标专用检测头（Custom Head）

5.3 数据增强策略调优

6. 推理与结果可视化

7. 性能评估与对比分析

8. 总结

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具