cv/yolo笔记

yolo笔记

YOLO 技术: [输入: 416*416]

多个scale融合: [目的: 加强YOLO算法对小目标检测的精确度]
- v2:
  - 类似ResNet的passthrough layer:
  - 连接前层26*26和本层13*13的feature map
- v3:
  - 类似FPN的upsample和融合:
  - 融合 3个scale, 13*13, 26*26, 52*52的feature map
bounding box的数量: [目的: 加强YOLO算法对小目标检测的精确度]
- v2:
  - grid cell大小: 32*32
  - grid cell数量: 13*13
  - 预测5个bounding box / 每grid cell
  - bounding box数量: (13*13) * 5
- v3:
  - grid cell大小: 32*32, 16*16, 8*8
  - grid cell数量: 13*13, 26*26, 52*52
  - 预测3个bounding box / 每grid cell
  - bounding box数量: (13*13 + 26*26 + 52*52) * 3
bounding box初始尺寸的初始化: [目的: 加快训练网络速度]
- 方法: (v2, v3)
  - k-means聚类
- COCO集上结果:
  - (10*13); (16*30); (33*23); (30*61); (62*45); (59*119); (116*90); (156*198); (373*326)