| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| GluonCV 0.8.0 Release.tar.gz | 2020-08-10 | 30.2 MB | |
| GluonCV 0.8.0 Release.zip | 2020-08-10 | 30.6 MB | |
| README.md | 2020-08-10 | 2.8 kB | |
| Totals: 3 Items | 60.8 MB | 0 | |
GluonCV 0.8.0 Release Note
Highlights
GluonCV v0.8.0 features the popular depth estimation model Monodepth2, semantic segmentation models (DANet and FastSCNN), StyleGAN, and multiple usability improvements.
Monodepth2 (thanks @KuangHaofei )
We provide GluonCV implementation of Monodepth2 and the results are fully reproducible. To try out on your own images, please see our demo tutorial. To train a Monodepth2 model on your own dataset, please see our dive deep tutorial.
Following table shows its performance on the KITTI dataset. | Name | Modality | Resolution | Abs. Rel. Error | delta < 1.25 | Hashtag | | -- | -- | -- | -- | -- | -- | | monodepth2_resnet18_kitti_stereo_640x192 1 | Stereo | 640x192 | 0.114 | 0.856 | 92871317 |

More Semantic Segmentation Models (thanks @xdeng7 and @ytian8 )
We include two new semantic segmentation models in this release, one is DANet, the other is FastSCNN.
Following table shows their performance on the Cityscapes validation set. | Model | Pre-Trained Dataset | Dataset | pixAcc | mIoU | |---------------------------|--------|-----|--------|-------| | danet_resnet50_citys | ImageNet | Cityscapes | 96.3 | 78.5 | | danet_resnet101_citys | ImageNet | Cityscapes | 96.5 | 80.1 | | fastscnn_citys | - | Cityscapes | 95.1 | 72.3 |
Our FastSCNN is an improved version from a recent paper using semi-supervised learning. To our best knowledge, 72.3 mIoU is the highest-scored implementation of FastSCNN and one of the best real-time semantic segmentation models.
StyleGAN (thanks @xdeng7 )

A GluonCV implementation of StyleGAN "A Style-Based Generator Architecture for Generative Adversarial Networks": https://github.com/dmlc/gluon-cv/tree/master/scripts/gan/stylegan
Bug fixes and Improvements
- We now officially deprecated python2 support, the minimum required python 3 version is 3.6. (#1399)
- Fixed Faster-RCNN training script (#1249)
- Allow SRGAN to be hybridized (#1281)
- Fix market1501 dataset (#1227)
- Added Visdrone dataset (#1267)
- Improved video action recognition task's
train.py(#1339) - Added jetson object detection tutorial (#1346)
- Improved guide for contributing new algorithms to GluonCV (#1354)
- Fixed amp parameter that required in class ForwardBackwardTask (#1404)