개요

지난 object detection에 이어 Segmentation을 적용하는 챌린지 진행.

Objective

사진에서 쓰레기를 Sementation하는 모델
Input : 쓰레기 객체가 담긴 이미지
- segmentation annotation : COCO format
Output : pixel 좌표에 따라 카테고리 값

Data

이미지 크기 : (512, 512)
input image, target image, class(11개)
- class : Background, General trash, Paper, Paper pack, Metal, Glass, Plastic, Styrofoam, Plastic bag, Battery, Clothing
- json 파일에는 background에 대한 annotation 존재하지 않으므로 background (0) class 추가해줘야 함

Untitled

Annotation file (coco format)
- images
  - id: 파일 안에서 image 고유 id, ex) 1
  - height: 512
  - width: 512
  - filename: ex) batch01_vt/002.jpg
- annotations
  - id: 파일 안에 annotation 고유 id, ex) 1
  - segmentation: masking 되어 있는 고유의 좌표
  - bbox: 객체가 존재하는 박스의 좌표 (xmin, ymin, w, h)
  - area: 객체가 존재하는 영역의 크기
  - category_id: 객체가 해당하는 class의 id
  - image_id: annotation이 표시된 이미지 고유 id

EDA

실험 결과

Submission

Installation Issue

nvidia-Apex
ViT-Adapter > Usage
Deformable DETR install