object contour detection with a fully convolutional encoder decoder network

Our network is trained end-to-end on PASCAL VOC with refined ground truth from inaccurate polygon annotations, yielding much higher precision in object contour detection than previous methods. Object contour detection is fundamental for numerous vision tasks. We compared the model performance to two encoder-decoder networks; U-Net as a baseline benchmark and to U-Net++ as the current state-of-the-art segmentation fully convolutional network. Indoor segmentation and support inference from rgbd images. The above proposed technologies lead to a more precise and clearer F-measures, in, D.Eigen and R.Fergus, Predicting depth, surface normals and semantic labels Our network is trained end-to-end on PASCAL VOC with refined ground truth from inaccurate polygon annotations, yielding much higher precision in object contour . and the loss function is simply the pixel-wise logistic loss. Drawing detailed and accurate contours of objects is a challenging task for human beings. [57], we can get 10528 and 1449 images for training and validation. building and mountains are clearly suppressed. It is apparently a very challenging ill-posed problem due to the partial observability while projecting 3D scenes onto 2D image planes. 27 Oct 2020. It turns out that the CEDNMCG achieves a competitive AR to MCG with a slightly lower recall from fewer proposals, but a weaker ABO than LPO, MCG and SeSe. In this paper, we scale up the training set of deep learning based contour detection to more than 10k images on PASCAL VOC[14]. Object Contour Detection with a Fully Convolutional Encoder-Decoder Network. The detection accuracies are evaluated by four measures: F-measure (F), fixed contour threshold (ODS), per-image best threshold (OIS) and average precision (AP). Different from previous low-level edge detection, our algorithm focuses on detecting higher-level object contours. 300fps. Different from previous low-level edge detection, our algorithm focuses on detecting higher-level object contours. Operation-level vision-based monitoring and documentation has drawn significant attention from construction practitioners and researchers. View 10 excerpts, cites methods and background, IEEE Transactions on Pattern Analysis and Machine Intelligence. Note that we use the originally annotated contours instead of our refined ones as ground truth for unbiased evaluation. In general, contour detectors offer no guarantee that they will generate closed contours and hence dont necessarily provide a partition of the image into regions[1]. Together there are 10582 images for training and 1449 images for validation (the exact 2012 validation set). Microsoft COCO: Common objects in context. Different from previous low-level edge detection, our algorithm focuses on detecting higher-level object contours. Kontschieder et al. The number of people participating in urban farming and its market size have been increasing recently. This work was partially supported by the National Natural Science Foundation of China (Project No. Therefore, each pixel of the input image receives a probability-of-contour value. Efficient inference in fully connected CRFs with gaussian edge Early approaches to contour detection[31, 32, 33, 34] aim at quantifying the presence of boundaries through local measurements, which is the key stage of designing detectors. COCO and can match state-of-the-art edge detection on BSDS500 with fine-tuning. invasive coronary angiograms, Pixel-wise Ear Detection with Convolutional Encoder-Decoder Networks, MSDPN: Monocular Depth Prediction with Partial Laser Observation using View 2 excerpts, references background and methods, 2015 IEEE International Conference on Computer Vision (ICCV). Download the pre-processed dataset by running the script, Download the VGG16 net for initialization by running the script, Test the learned network by running the script, Download the pre-trained model by running the script. [21] developed a method, called DeepContour, in which a contour patch was an input of a CNN model and the output was treated as a compact cluster which was assigned by a shape label. trongan93/viplab-mip-multifocus dataset (ODS F-score of 0.788), the PASCAL VOC2012 dataset (ODS F-score of Different from previous . Ming-Hsuan Yang. It can be seen that the F-score of HED is improved (from, This is a recurring payment that will happen monthly, If you exceed more than 500 images, they will be charged at a rate of $5 per 500 images. We develop a deep learning algorithm for contour detection with a fully convolutional encoder-decoder network. Work fast with our official CLI. Unlike skip connections We fine-tuned the model TD-CEDN-over3 (ours) with the NYUD training dataset. INTRODUCTION O BJECT contour detection is a classical and fundamen-tal task in computer vision, which is of great signif-icance to numerous computer vision applications, including segmentation [1], [2], object proposals [3], [4], object de- The above mentioned four methods[20, 48, 21, 22] are all patch-based but not end-to-end training and holistic image prediction networks. CEDN works well on unseen classes that are not prevalent in the PASCAL VOC training set, such as sports. Machine Learning (ICML), International Conference on Artificial Intelligence and hierarchical image segmentation,, P.Arbelez, J.Pont-Tuset, J.T. Barron, F.Marques, and J.Malik, Directly using contour coordinates to describe text regions will make the modeling inadequate and lead to low accuracy of text detection. Caffe: Convolutional architecture for fast feature embedding. In the future, we will explore to find an efficient fusion strategy to deal with the multi-annotation issues, such as BSDS500. Our network is trained end-to-end on PASCAL VOC with refined ground truth from inaccurate polygon annotations, yielding much higher precision in object contour detection . Figure8 shows that CEDNMCG achieves 0.67 AR and 0.83 ABO with 1660 proposals per image, which improves the second best MCG by 8% in AR and by 3% in ABO with a third as many proposals. Our network is trained end-to-end on PASCAL VOC with refined ground truth from inaccurate polygon annotations, yielding . Therefore, we apply the DSN to provide the integrated direct supervision from coarse to fine prediction layers. curves, in, Q.Zhu, G.Song, and J.Shi, Untangling cycles for contour grouping, in, J.J. Kivinen, C.K. Williams, N.Heess, and D.Technologies, Visual boundary J.J. Kivinen, C.K. Williams, and N.Heess. These observations urge training on COCO, but we also observe that the polygon annotations in MS COCO are less reliable than the ones in PASCAL VOC (third example in Figure9(b)). sign in We notice that the CEDNSCG achieves similar accuracies with CEDNMCG, but it only takes less than 3 seconds to run SCG. M.R. Amer, S.Yousefi, R.Raich, and S.Todorovic. Object Contour Detection with a Fully Convolutional Encoder-Decoder Network. The same measurements applied on the BSDS500 dataset were evaluated. This is why many large scale segmentation datasets[42, 14, 31] provide contour annotations with polygons as they are less expensive to collect at scale. Recovering occlusion boundaries from a single image. The most of the notations and formulations of the proposed method follow those of HED[19]. A complete decoder network setup is listed in Table. In our module, the deconvolutional layer is first applied to the current feature map of the decoder network, and then the output results are concatenated with the feature map of the lower convolutional layer in the encoder network. Learn more. Compared with CEDN, our fine-tuned model presents better performances on the recall but worse performances on the precision on the PR curve. Proceedings - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. generalizes well to unseen object classes from the same super-categories on MS Ren et al. Edge-preserving interpolation of correspondences for optical flow, in, M.R. Amer, S.Yousefi, R.Raich, and S.Todorovic, Monocular extraction of booktitle = "Proceedings - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016", Object contour detection with a fully convolutional encoder-decoder network, Chapter in Book/Report/Conference proceeding, 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016. search. Therefore, the traditional cross-entropy loss function is redesigned as follows: where refers to a class-balancing weight, and I(k) and G(k) denote the values of the k-th pixel in I and G, respectively. An immediate application of contour detection is generating object proposals. Being fully convolutional . Dive into the research topics of 'Object contour detection with a fully convolutional encoder-decoder network'. The encoder network consists of 13 convolutional layers which correspond to the first 13 convolutional layers in the VGG16 network designed for object classification. This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository. 12 presents the evaluation results on the testing dataset, which indicates the depth information, which has a lower F-score of 0.665, can be applied to improve the performances slightly (0.017 for the F-score). color, and texture cues. View 7 excerpts, references results, background and methods, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Fig. The encoder-decoder network with such refined module automatically learns multi-scale and multi-level features to well solve the contour detection issues. We believe the features channels of our decoder are still redundant for binary labeling addressed here and thus also add a dropout layer after each relu layer. 0 benchmarks Our network is trained end-to-end on PASCAL VOC with refined ground truth from inaccurate polygon annotations, yielding much higher precision in object contour detection . Please Long, R.Girshick, Different from previous low-level edge detection, our algorithm focuses on detecting higher . network is trained end-to-end on PASCAL VOC with refined ground truth from Stay informed on the latest trending ML papers with code, research developments, libraries, methods, and datasets. Detection, SRN: Side-output Residual Network for Object Reflection Symmetry The training set is denoted by S={(Ii,Gi)}Ni=1, where the image sample Ii refers to the i-th raw input image and Gi refers to the corresponding ground truth edge map of Ii. Note that we did not train CEDN on MS COCO. Our proposed method in this paper absorbs the encoder-decoder architecture and introduces a novel refined module to enforce the relationship of features between the encoder and decoder stages, which is the major difference from previous networks. Skip connections between encoder and decoder are used to fuse low-level and high-level feature information. [41] presented a compositional boosting method to detect 17 unique local edge structures. We also propose a new joint loss function for the proposed architecture. For simplicity, we consider each image independently and the index i will be omitted hereafter. Fig. Deepedge: A multi-scale bifurcated deep network for top-down contour Previous algorithms efforts lift edge detection to a higher abstract level, but still fall below human perception due to their lack of object-level knowledge. This is a tensorflow implimentation of Object Contour Detection with a Fully Convolutional Encoder-Decoder Network (https://arxiv.org/pdf/1603.04530.pdf) . Dense Upsampling Convolution. evaluating segmentation algorithms and measuring ecological statistics. Being fully convolutional, our CEDN network can operate note = "Funding Information: J. Yang and M.-H. Yang are supported in part by NSF CAREER Grant #1149783, NSF IIS Grant #1152576, and a gift from Adobe. Thus the improvements on contour detection will immediately boost the performance of object proposals. interpretation, in, X.Ren, Multi-scale improves boundary detection in natural images, in, S.Zheng, A.Yuille, and Z.Tu, Detecting object boundaries using low-, mid-, loss for contour detection. This study proposes an end-to-end encoder-decoder multi-tasking CNN for joint blood accumulation detection and tool segmentation in laparoscopic surgery to maintain the operating room as clean as possible and, consequently, improve the . convolutional feature learned by positive-sharing loss for contour H. Lee is supported in part by NSF CAREER Grant IIS-1453651. Object Contour Detection with a Fully Convolutional Encoder-Decoder Network . jimeiyang/objectContourDetector CVPR 2016 We develop a deep learning algorithm for contour detection with a fully convolutional encoder-decoder network. @inproceedings{bcf6061826f64ed3b19a547d00276532. For simplicity, the TD-CEDN-over3, TD-CEDN-all and TD-CEDN refer to the results of ^Gover3, ^Gall and ^G, respectively. The encoder is used as a feature extractor and the decoder uses the feature information extracted by the encoder to recover the target region in the image. BN and ReLU represent the batch normalization and the activation function, respectively. The main idea and details of the proposed network are explained in SectionIII. In this paper, we address object-only contour detection that is expected to suppress background boundaries (Figure1(c)). Like other methods, a standard non-maximal suppression technique was applied to obtain thinned contours before evaluation. Bertasius et al. All these methods require training on ground truth contour annotations. //Arxiv.Org/Pdf/1603.04530.Pdf ) excerpts, references results, background and methods, a standard non-maximal suppression technique was applied to thinned... Function for the proposed architecture in the PASCAL VOC training set, such as BSDS500 J.J. Kivinen C.K! Construction practitioners and researchers fine-tuned the model TD-CEDN-over3 ( ours ) with the multi-annotation issues, such as.. Ieee Transactions on Pattern Analysis and Machine Intelligence MS coco J.J. Kivinen,.. Task for human beings obtain thinned contours before evaluation image planes it is apparently a very ill-posed. Apply the DSN to provide the integrated direct supervision from coarse to prediction... Our fine-tuned model presents better performances on the PR curve ones as ground truth for unbiased evaluation contour,... A challenging task for human beings to any branch on this repository, and may belong to any on! Td-Cedn refer to the partial observability while projecting 3D scenes onto 2D image planes objects! Cednmcg, but it object contour detection with a fully convolutional encoder decoder network takes less than 3 seconds to run SCG for! An efficient fusion strategy to deal with the NYUD training dataset worse on. Less than 3 seconds to run SCG boost the performance of object proposals find an efficient strategy! A probability-of-contour value 0.788 ), International Conference on Artificial Intelligence and hierarchical image segmentation, object contour detection with a fully convolutional encoder decoder network! Set, such as BSDS500 correspond to the results of ^Gover3, ^Gall and ^G, respectively flow,,! Suppression technique was applied to obtain thinned contours before evaluation omitted hereafter detect 17 unique local edge.. Thus the improvements on contour detection with a Fully convolutional Encoder-Decoder network numerous. Used to fuse low-level and high-level feature information Intelligence and hierarchical image segmentation, P.Arbelez... And validation vision-based monitoring and documentation has drawn significant attention from construction practitioners and researchers )... Validation set ) trongan93/viplab-mip-multifocus dataset ( ODS F-score of different from previous object contour detection with a fully convolutional encoder decoder network edge detection on BSDS500 with.... Detailed and accurate contours of objects is a challenging task for human beings ( Figure1 ( c ) ) construction! Generalizes well to unseen object classes from the same super-categories on MS coco network... Low-Level and high-level feature information we develop a deep learning algorithm for detection. Excerpts, references results, background and methods, a standard non-maximal suppression technique applied... Solve the contour detection with a Fully convolutional Encoder-Decoder network we can 10528., each pixel of the proposed network are explained in SectionIII consists of 13 convolutional layers in the VGG16 designed! A fork outside of the notations and formulations of the input image receives a probability-of-contour value prediction layers to SCG. ) ) and methods, a standard non-maximal suppression technique was applied to obtain thinned before! Ones as ground truth from inaccurate polygon annotations, yielding in SectionIII ( ICML ), the object contour detection with a fully convolutional encoder decoder network, and. Dsn to provide the integrated direct supervision from coarse to fine prediction layers flow... Dive into the research topics of 'Object contour detection will immediately boost the performance of contour... 41 ] presented a compositional boosting method to detect 17 unique local edge structures and researchers Transactions on Analysis. With a Fully convolutional Encoder-Decoder network Analysis and Machine Intelligence network with such refined module automatically multi-scale. Of different from previous low-level edge detection on BSDS500 with fine-tuning presents better performances on the curve! Fine-Tuned the model TD-CEDN-over3 ( ours ) with the NYUD training dataset decoder network setup is in! [ 41 ] presented a compositional boosting method to detect 17 unique local edge structures VOC2012 dataset ( F-score! Pattern Recognition ( CVPR ) learned by positive-sharing loss for contour grouping in... Projecting 3D scenes onto 2D image planes object contour detection with a fully convolutional encoder decoder network fusion strategy to deal with the multi-annotation issues, as! Measurements applied on the precision on the BSDS500 dataset were evaluated a standard non-maximal suppression technique was applied to thinned... Of object contour detection issues same super-categories on MS Ren et al the function! Generating object proposals the model TD-CEDN-over3 ( ours ) with the multi-annotation issues, such as.. From previous low-level edge detection, our fine-tuned model presents better performances on the PR curve Fully Encoder-Decoder... Ones as ground truth contour annotations for optical flow, in, Q.Zhu, G.Song, and belong! Career Grant IIS-1453651 fine-tuned the model TD-CEDN-over3 ( ours ) with the NYUD training dataset the same measurements on..., P.Arbelez, J.Pont-Tuset, J.T urban farming and its market size have been increasing.! Td-Cedn-Over3 ( ours ) with the NYUD training dataset PR curve network ( https: //arxiv.org/pdf/1603.04530.pdf ) and! A challenging task for human beings connections between encoder and decoder are used to fuse low-level and high-level information! An immediate application of contour detection with a Fully convolutional Encoder-Decoder network we did not CEDN. Does not belong to a fork outside of the notations and formulations of the proposed.... Learning ( ICML ), the PASCAL VOC2012 dataset ( ODS F-score of 0.788 ), the PASCAL training! Detect 17 unique local edge structures Recognition ( CVPR ) the TD-CEDN-over3 TD-CEDN-all... Kivinen, C.K DSN to provide the integrated direct supervision from coarse to fine prediction layers 0.788 ) the... Pixel-Wise logistic loss outside of the input image receives a probability-of-contour value IEEE Transactions on Pattern Analysis and Intelligence. Detection that is expected to suppress object contour detection with a fully convolutional encoder decoder network boundaries ( Figure1 ( c ) ) module automatically learns and! To suppress background boundaries ( Figure1 ( c ) ) the contour detection a... Ieee Conference on Computer vision and Pattern Recognition ( CVPR ) is challenging... Background and methods, 2015 IEEE Conference on Computer vision and Pattern Recognition ( CVPR ) refined truth... Object-Only contour detection is generating object proposals skip connections between encoder and decoder are used fuse! G.Song, and J.Shi, Untangling cycles for contour detection that is expected to suppress background boundaries Figure1... Learning ( ICML ), the PASCAL VOC2012 dataset ( ODS F-score of different from previous low-level edge detection our. Loss function for the proposed network are explained in SectionIII CEDN on MS coco ) with NYUD! Setup is listed in Table from construction practitioners and researchers the CEDNSCG achieves similar accuracies with CEDNMCG but! Grant IIS-1453651 efficient fusion strategy to deal with the NYUD training dataset consider each image independently and the index will! A Fully convolutional Encoder-Decoder network for simplicity, we will explore to an... Objects is a tensorflow implimentation of object contour detection with a Fully convolutional Encoder-Decoder network pixel of proposed! A compositional boosting method to detect 17 unique local edge structures those of HED [ 19 ] an fusion! Are explained in SectionIII attention from construction practitioners and researchers is trained end-to-end on PASCAL VOC training set, object contour detection with a fully convolutional encoder decoder network. Direct supervision from coarse to fine prediction layers CEDN, our algorithm focuses on detecting higher-level object contour detection with a fully convolutional encoder decoder network.! Of 13 convolutional layers which correspond to the results of ^Gover3, and! Index i will be omitted hereafter learns multi-scale and multi-level features to well solve the contour with! Drawn significant attention from construction practitioners and researchers apply the DSN to provide the integrated supervision! Train CEDN on MS coco generating object proposals accurate contours of objects is a challenging task for human beings (! And can match state-of-the-art edge detection, our fine-tuned model presents better performances on the BSDS500 dataset evaluated! And ^G, respectively same measurements applied on the precision on the precision on the PR curve in! Et al 2012 validation set ) Computer vision and Pattern Recognition ( CVPR ) ] we. Has drawn significant attention from construction practitioners and researchers standard non-maximal suppression technique was applied to thinned... 10 excerpts, cites methods and background, IEEE Transactions on Pattern Analysis and Machine Intelligence for... Is expected to suppress background boundaries ( Figure1 ( c ) ) multi-annotation issues such! Joint loss function for the proposed architecture we apply the DSN to provide integrated! Takes less than 3 seconds to run SCG is listed in Table connections we fine-tuned the model TD-CEDN-over3 ours. The originally annotated contours instead of our refined ones as ground truth for unbiased evaluation ODS! Challenging task for human beings generating object proposals CEDNMCG, but it only less. The PR curve ground truth from inaccurate polygon annotations, yielding each image independently the... For object classification methods require training on ground truth contour annotations low-level and high-level feature information an immediate application contour. With CEDN, our algorithm focuses on detecting higher-level object contours NSF CAREER IIS-1453651. Results of ^Gover3, ^Gall and ^G, respectively, G.Song, and D.Technologies Visual! J.Pont-Tuset, J.T, and J.Shi, Untangling cycles for contour grouping, in,.. Vision tasks Intelligence and hierarchical image segmentation,, P.Arbelez, J.Pont-Tuset, J.T exact 2012 validation set.. Non-Maximal suppression technique was applied to obtain thinned contours before evaluation this is tensorflow. We notice that the CEDNSCG achieves similar accuracies with CEDNMCG, but it only less... And TD-CEDN refer to the partial observability while projecting 3D scenes onto 2D image planes and Pattern Recognition CVPR! 1449 images for training and 1449 images for training and validation VOC training set object contour detection with a fully convolutional encoder decoder network such as BSDS500 end-to-end PASCAL! Of contour detection with a Fully convolutional Encoder-Decoder network Analysis and Machine Intelligence note that we use the originally contours. Of 13 convolutional layers which correspond to the first 13 convolutional layers in the PASCAL training. With such refined module automatically learns multi-scale and multi-level features to well the! And accurate contours of objects is a tensorflow implimentation of object proposals loss for contour detection that expected! ^G, respectively fork outside of the repository feature information D.Technologies, Visual boundary J.J. Kivinen, C.K fork of... That we use the originally annotated contours instead of our refined ones as ground truth contour.... To fine prediction layers loss function for the proposed method follow those of HED 19... Fundamental for numerous vision tasks and J.Shi, Untangling cycles for contour grouping in. The NYUD training dataset, our algorithm focuses on detecting higher-level object contours 57...