json 정보

info

year:2021
version:"1.0"
description:"Recycle Trash"
contributor:"Upstage"
url:null
date_created:"2021-02-02 01:10:00"

licenses

id:0
name:"CC BY 4.0"
url:"<https://creativecommons.org/licenses/by/4.0/deed.ast>"

images - 0번째 정보만 출력

width:1024
height:1024
file_name:"train/0000.jpg"
license:0
flickr_url:null
coco_url:null
date_captured:"2020-12-26 14:44:23"
id:0

annotations - 0번째 정보만 출력

image_id:0
category_id:0
area:257301.66
bbox:
	0:197.6 # x_min
	1:193.7 # y_min
	2:547.8 # w
	3:469.7 # h
iscrowd:0
id:0

categories - 0번째 정보만 출력

id:0
name:"General trash"
supercategory:"General trash"

json을 dataframe으로 변환하기

train.json 파일을 불러와서 train_df로 만들어 줍니다.

image_id	class_name	class_id	x_min	y_min	x_max	y_max
0	train/0000.jpg	General trash	0	197.6	193.7	745.4
1	train/0001.jpg	Metal	3	0.0	407.4	57.6
2	train/0001.jpg	Plastic bag	7	0.0	455.6	144.6
3	train/0001.jpg	Glass	4	722.3	313.4	996.6
4	train/0001.jpg	Plastic	5	353.2	671.0	586.9
5	train/0001.jpg	Plastic	5	3.7	448.5	781.9
6	train/0001.jpg	General trash	0	425.3	681.9	641.7
7	train/0001.jpg	Plastic bag	7	92.4	601.7	231.6
8	train/0001.jpg	General trash	0	622.4	686.5	695.2
9	train/0002.jpg	Metal	3	267.9	165.2	899.5
10	train/0003.jpg	Paper pack	2	462.2	369.4	696.1
11	train/0003.jpg	Styrofoam	6	773.3	3.0	961.7
12	train/0004.jpg	Paper	1	567.5	462.2	732.7
13	train/0004.jpg	Paper	1	859.4	411.7	1023.6
14	train/0004.jpg	Paper	1	362.0	349.7	492.0

train_df 중 image_id가 train/0001.jpg인 데이터들만 따로 분리하였습니다.

	image_id	class_name	class_id	x_min	y_min	x_max	y_max
1	train/0001.jpg	Metal	3	0.0	407.4	57.6	588.0
2	train/0001.jpg	Plastic bag	7	0.0	455.6	144.6	637.2
3	train/0001.jpg	Glass	4	722.3	313.4	996.6	565.3
4	train/0001.jpg	Plastic	5	353.2	671.0	586.9	774.4
5	train/0001.jpg	Plastic	5	3.7	448.5	781.9	690.5
6	train/0001.jpg	General trash	0	425.3	681.9	641.7	861.7
7	train/0001.jpg	Plastic bag	7	92.4	601.7	231.6	654.8
8	train/0001.jpg	General trash	0	622.4	686.5	695.2	780.7

이 때, train/0001.jpg에서 몇 개의 gt bbox가 있는지 확인해보았습니다.

Untitled

→ 여기서, 어두운 화면에서도 general trash도 annotation되어 있기 때문에 dataset augmentation에서 밝기를 올려야 할 필요가 있어 보입니다.

모든 이미지가 (1024, 1024) 고정이라 최대, 최소, 평균 모두 1024로 나옵니다.

Minimum height: 1024 Maximum height: 1024 Average height: 1024.0 Minimum width: 1024 Maximum width: 1024 Average width: 1024.0