spytensor
diff --git a/‎README.md‎
Lines changed: 235 additions & 2 deletions b/‎README.md‎
Lines changed: 235 additions & 2 deletions
diff --git a/‎csv2coco.py‎
Lines changed: 149 additions & 0 deletions b/‎csv2coco.py‎
Lines changed: 149 additions & 0 deletions
@@ -1,2 +1,235 @@
-# prepare_detection_dataset
-convert dataset to coco/voc format
+**背景**
+
+万事开头难。之前写图像识别的博客教程，也是为了方便那些学了很多理论知识，却对实际项目无从下手的小伙伴，后来转到目标检测来了，师从烨兄、亚光兄，从他们那学了不少检测的知识和操作，今天也终于闲下了，准备写个检测系列的总结。一方面分享知识希望可以一起学习，另一方面让一部分人少走弯路，快速上路（入坑）。
+
+此部分代码：[Github](https://github.com/spytensor/prepare_detection_dataset)
+
+<h4 id="1">1. 内容介绍</h4>
+
+系列一主要介绍如何在常见的几种数据格式之间进行转换，以及万能中介`csv`格式的使用，这里列出以下几个：
+
+- csv to coco
+- csv to voc
+- labelme to coco
+- labelme to voc
+
+<h4 id="2">2. 标准格式</h4>
+
+在使用转换脚本之前，必须要明确的几种格式
+
+<h5 id="2.1">2.1 csv</h5>
+
+不要一看是`csv`文件就直接拿来运行，如果不是，可以自行修改代码，或者修改标注文件。
+
+转换脚本支持的csv格式应为以下形式:
+
+- `csv/`
+    - `labels.csv`
+    - `images/`
+        - `image1.jpg`
+        - `image2.jpg`
+        - `...`
+
+`labels.csv` 的形式: 
+
+`/path/to/image,xmin,ymin,xmax,ymax,label`
+
+例如:
+
+```
+/mfs/dataset/face/0d4c5e4f-fc3c-4d5a-906c-105.jpg,450,154,754,341,face
+/mfs/dataset/face/0ddfc5aea-fcdac-421-92dad-144.jpg,143,154,344,341,face
+...
+```
+注：图片路径请使用绝对路径
+
+<h5 id="2.2">2.2 voc</h5>
+
+标准的voc数据格式如下：
+
+- `VOC2007/`
+    - `Annotations/`
+        - `0d4c5e4f-fc3c-4d5a-906c-105.xml`
+        - `0ddfc5aea-fcdac-421-92dad-144/xml`
+        - `...`
+    - `ImageSets/`
+        - `Main/`
+            - `train.txt`
+            - `test.txt`
+            - `val.txt`
+            - `trainval.txt`
+    - `JPEGImages/`
+        - `0d4c5e4f-fc3c-4d5a-906c-105.jpg`
+        - `0ddfc5aea-fcdac-421-92dad-144.jpg`
+        - `...`
+
+<h5 id="2.3">2.3 coco</h5>
+
+此处未使用测试集
+
+- `coco/`
+    - `annotations/`
+        - `instances_train2017.json`
+        - `instances_val2017.json`
+    - `images/`
+        - `train2017/`
+            - `0d4c5e4f-fc3c-4d5a-906c-105.jpg`
+            - `...`
+        - `val2017`
+            - `0ddfc5aea-fcdac-421-92dad-144.jpg`
+            - `...`
+
+<h5 id="2.4">2.4 labelme</h5>
+
+
+- `labelme/`
+    - `0d4c5e4f-fc3c-4d5a-906c-105.json`
+    - `0d4c5e4f-fc3c-4d5a-906c-105.jpg`
+    - `0ddfc5aea-fcdac-421-92dad-144.json`
+    - `0ddfc5aea-fcdac-421-92dad-144.jpg`
+
+Json file 格式:
+（imageData那一块太长了，不展示了）
+
+```json
+{
+  "version": "3.6.16",
+  "flags": {},
+  "shapes": [
+    {
+      "label": "helmet",
+      "line_color": null,
+      "fill_color": null,
+      "points": [
+        [
+          131,
+          269
+        ],
+        [
+          388,
+          457
+        ]
+      ],
+      "shape_type": "rectangle"
+    }
+  ],
+  "lineColor": [
+    0,
+    255,
+    0,
+    128
+  ],
+  "fillColor": [
+    255,
+    0,
+    0,
+    128
+  ],
+  "imagePath": "004ffe6f-c3e2-3602-84a1-ecd5f437b113.jpg",
+  "imageData": ""   # too long ,so not show here
+  "imageHeight": 1080,
+  "imageWidth": 1920
+}
+```
+
+<h4 id="3">3. 如何使用转换脚本</h4>
+
+<h5 id="3.1">3.1 csv2coco</h5>
+
+首先更改`csv2coco.py`中以下几个配置
+
+```
+classname_to_id = {"person": 1}  # for your dataset classes
+csv_file = "labels.csv"  # annatations file path
+image_dir = "images/"    # original image path
+saved_coco_path = "./"   # path to save converted coco dataset
+```
+
+然后运行 `python csv2coco.py`
+
+会自动创建文件夹并复制图片到相应位置，运行结束后得到如下：
+
+- `coco/`
+    - `annotations/`
+        - `instances_train2017.json`
+        - `instances_val2017.json`
+    - `images/`
+        - `train2017/`
+            - `0d4c5e4f-fc3c-4d5a-906c-105.jpg`
+            - `...`
+        - `val2017`
+            - `0ddfc5aea-fcdac-421-92dad-144.jpg`
+            - `...`
+
+<h5 id="3.2">3.2 csv2voc</h5>
+
+首先更改`csv2voc.py`中以下几个配置
+
+```
+csv_file = "labels.csv"
+saved_path = ".VOC2007/" # path to save converted voc dataset     
+image_save_path = "./JPEGImages/"   # converted voc images path
+image_raw_parh = "images/"          # original image path
+```
+
+然后运行 `python csv2voc.py`
+
+同样会自动创建文件夹并复制图片到相应位置，运行结束后得到如下：
+
+
+- `VOC2007/`
+    - `Annotations/`
+        - `0d4c5e4f-fc3c-4d5a-906c-105.xml`
+        - `0ddfc5aea-fcdac-421-92dad-144/xml`
+        - `...`
+    - `ImageSets/`
+        - `Main/`
+            - `train.txt`
+            - `test.txt`
+            - `val.txt`
+            - `trainval.txt`
+    - `JPEGImages/`
+        - `0d4c5e4f-fc3c-4d5a-906c-105.jpg`
+        - `0ddfc5aea-fcdac-421-92dad-144.jpg`
+        - `...`
+
+<h5 id="3.3">3.3 labelme2coco</h5>
+
+首先更改`labelme2coco.py`中以下几个配置
+
+```
+classname_to_id = {"person": 1}  # for your dataset classes
+labelme_path = "labelme/"  # path for labelme dataset
+saved_coco_path = "./"     # path for saved coco dataset
+```
+然后运行 `python labelme2coco.py`，生成文件形式同`csv2coco`
+
+<h5 id="3.4">3.4 labelme2voc</h5>
+
+首先更改`labelme2voc.py`中以下几个配置
+
+```
+labelme_path = "labelme/"  # path for labelme dataset
+saved_coco_path = "./"     # path for saved coco dataset
+```
+然后运行 `python labelme2voc.py`，生成文件形式同`csv2voc`
+
+<h4 id="4">4. 万能中介csv</h4>
+
+从上面的转换格式中可以看出，并没有给出如何转到csv的，一是因为太过于简单，而是主流检测框架很少支持这种格式的数据输入。以下给出如何将标注信息写入`csv`
+
+```python
+info = [[filename0,"xmin ymin xmax ymax label0"],
+          filename1,"xmin ymin xmax ymax label1"]
+csv_labels = open("csv_labels.csv","w")
+for filename,bboxes in info:
+    bbox = bboxes.split(" ")
+    label = bbox[-1]
+    csv_labels.write(filename+","+bbox[0]+","+bbox[1]+","+bbox[2]+","+bbox[3]+","+label+"\n")
+csv_labels.close()
+```
+
+是不是非常简单。。。如果你不知道如何从原始的标签文件中读取得到标注信息，那没办法了，学学编程吧，23333
+
+<h4 id="5">5. 下一篇</h4>
+如何做数据扩充
@@ -0,0 +1,149 @@
+# -*- coding: utf-8 -*-
+'''
+@time: 2019/01/11 11:28
+spytensor
+'''
+
+import os
+import json
+import numpy as np
+import pandas as pd
+import glob
+import cv2
+import os
+import shutil
+from IPython import embed
+from sklearn.model_selection import train_test_split
+np.random.seed(41)
+
+#0为背景
+classname_to_id = {"person": 1}
+
+class Csv2CoCo:
+
+    def __init__(self,image_dir,total_annos):
+        self.images = []
+        self.annotations = []
+        self.categories = []
+        self.img_id = 0
+        self.ann_id = 0
+        self.image_dir = image_dir
+        self.total_annos = total_annos
+
+    def save_coco_json(self, instance, save_path):
+        json.dump(instance, open(save_path, 'w'), ensure_ascii=False, indent=2)  # indent=2 更加美观显示
+
+    # 由txt文件构建COCO
+    def to_coco(self, keys):
+        self._init_categories()
+        for key in keys:
+            self.images.append(self._image(key))
+            shapes = self.total_annos[key]
+            for shape in shapes:
+                bboxi = []
+                for cor in shape[:-1]:
+                    bboxi.append(int(cor))
+                label = shape[-1]
+                annotation = self._annotation(bboxi,label)
+                self.annotations.append(annotation)
+                self.ann_id += 1
+            self.img_id += 1
+        instance = {}
+        instance['info'] = 'spytensor created'
+        instance['license'] = ['license']
+        instance['images'] = self.images
+        instance['annotations'] = self.annotations
+        instance['categories'] = self.categories
+        return instance
+
+    # 构建类别
+    def _init_categories(self):
+        for k, v in classname_to_id.items():
+            category = {}
+            category['id'] = v
+            category['name'] = k
+            self.categories.append(category)
+
+    # 构建COCO的image字段
+    def _image(self, path):
+        image = {}
+        print(path)
+        img = cv2.imread(self.image_dir + path)
+        image['height'] = img.shape[0]
+        image['width'] = img.shape[1]
+        image['id'] = self.img_id
+        image['file_name'] = path
+        return image
+
+    # 构建COCO的annotation字段
+    def _annotation(self, shape,label):
+        # label = shape[-1]
+        points = shape[:4]
+        annotation = {}
+        annotation['id'] = self.ann_id
+        annotation['image_id'] = self.img_id
+        annotation['category_id'] = int(classname_to_id[label])
+        annotation['segmentation'] = self._get_seg(points)
+        annotation['bbox'] = self._get_box(points)
+        annotation['iscrowd'] = 0
+        annotation['area'] = 1.0
+        return annotation
+
+    # COCO的格式： [x1,y1,w,h] 对应COCO的bbox格式
+    def _get_box(self, points):
+        min_x = points[0]
+        min_y = points[1]
+        max_x = points[2]
+        max_y = points[3]
+        return [min_x, min_y, max_x - min_x, max_y - min_y]
+    # segmentation
+    def _get_seg(self, points):
+        min_x = points[0]
+        min_y = points[1]
+        max_x = points[2]
+        max_y = points[3]
+        h = max_y - min_y
+        w = max_x - min_x
+        a = []
+        a.append([min_x,min_y, min_x,min_y+0.5*h, min_x,max_y, min_x+0.5*w,max_y, max_x,max_y, max_x,max_y-0.5*h, max_x,min_y, max_x-0.5*w,min_y])
+        return a
+   
+
+if __name__ == '__main__':
+    csv_file = "train.csv"
+    image_dir = "images/"
+    saved_coco_path = "./"
+    # 整合csv格式标注文件
+    total_csv_annotations = {}
+    annotations = pd.read_csv(csv_file,header=None).values
+    for annotation in annotations:
+        key = annotation[0].split(os.sep)[-1]
+        value = np.array([annotation[1:]])
+        if key in total_csv_annotations.keys():
+            total_csv_annotations[key] = np.concatenate((total_csv_annotations[key],value),axis=0)
+        else:
+            total_csv_annotations[key] = value
+    # 按照键值划分数据
+    total_keys = list(total_csv_annotations.keys())
+    train_keys, val_keys = train_test_split(total_keys, test_size=0.2)
+    print("train_n:", len(train_keys), 'val_n:', len(val_keys))
+    # 创建必须的文件夹
+    if not os.path.exists('%scoco/annotations/'%saved_coco_path):
+        os.makedirs('%scoco/annotations/'%saved_coco_path)
+    if not os.path.exists('%scoco/images/train2017/'%saved_coco_path):
+        os.makedirs('%scoco/images/train2017/'%saved_coco_path)
+    if not os.path.exists('%scoco/images/val2017/'%saved_coco_path):
+        os.makedirs('%scoco/images/val2017/'%saved_coco_path)
+    # 把训练集转化为COCO的json格式
+    l2c_train = Csv2CoCo(image_dir=image_dir,total_annos=total_csv_annotations)
+    train_instance = l2c_train.to_coco(train_keys)
+    l2c_train.save_coco_json(train_instance, '%scoco/annotations/instances_train2017.json'%saved_coco_path)
+    for file in train_keys:
+        shutil.copy(image_dir+file,"%scoco/images/train2017/"%saved_coco_path)
+    for file in val_keys:
+        shutil.copy(image_dir+file,"%scoco/images/val2017/"%saved_coco_path)
+    # 把验证集转化为COCO的json格式
+    l2c_val = Csv2CoCo(image_dir=image_dir,total_annos=total_csv_annotations)
+    val_instance = l2c_val.to_coco(val_keys)
+    l2c_val.save_coco_json(val_instance, '%scoco/annotations/instances_val2017.json'%saved_coco_path)
+