pith. sign in

hub Mixed citations

Multi-Scale Context Aggregation by Dilated Convolutions

Mixed citation behavior. Most common role is background (67%).

38 Pith papers citing it
Background 67% of classified citations
abstract

State-of-the-art models for semantic segmentation are based on adaptations of convolutional networks that had originally been designed for image classification. However, dense prediction and image classification are structurally different. In this work, we develop a new convolutional network module that is specifically designed for dense prediction. The presented module uses dilated convolutions to systematically aggregate multi-scale contextual information without losing resolution. The architecture is based on the fact that dilated convolutions support exponential expansion of the receptive field without loss of resolution or coverage. We show that the presented context module increases the accuracy of state-of-the-art semantic segmentation systems. In addition, we examine the adaptation of image classification networks to dense prediction and show that simplifying the adapted network can increase accuracy.

hub tools

citation-role summary

background 3 method 3

citation-polarity summary

representative citing papers

WaveNet: A Generative Model for Raw Audio

cs.SD · 2016-09-12 · accept · novelty 9.0

WaveNet generates realistic raw audio using an autoregressive neural network with dilated convolutions, achieving state-of-the-art naturalness in speech synthesis for English and Mandarin.

Density estimation using Real NVP

cs.LG · 2016-05-27 · accept · novelty 8.0

Real NVP uses affine coupling layers to create invertible transformations that support exact density estimation, sampling, and latent inference without approximations.

Cross-Stage Attention Propagation for Efficient Semantic Segmentation

cs.CV · 2026-04-07 · unverdicted · novelty 7.0

CSAP computes attention at the deepest scale and propagates the maps to shallower stages, bypassing per-scale query-key computations to cut decoder FLOPs while preserving multi-scale performance and beating SegNeXt-Tiny on ADE20K, Cityscapes, and COCO-Stuff.

Single Level Feature-to-Feature Forecasting with Deformable Convolutions

cs.CV · 2019-07-26 · unverdicted · novelty 6.0

Single-level feature-to-feature forecasting with deformable convolutions on coarse abstract features from a segmentation backbone achieves state-of-the-art results for nine-timestep future semantic segmentation on Cityscapes validation.

VRSTC: Occlusion-Free Video Person Re-Identification

cs.CV · 2019-07-19 · unverdicted · novelty 6.0

STCnet recovers occluded parts in video person re-ID using spatio-temporal cues to form the VRSTC framework, outperforming prior methods on three datasets.

Rethink the Role of Neural Decoders in Quantum Error Correction

quant-ph · 2026-05-12 · unverdicted · novelty 6.0

Neural decoders for surface-code QEC achieve practical microsecond FPGA latency when trained on large datasets with appropriate inductive biases and INT4 quantization, rather than relying on architectural complexity.

Cross Attention Network for Semantic Segmentation

cs.CV · 2019-07-25 · unverdicted · novelty 5.0

Cross Attention Network fuses spatial and contextual features via a cross attention module to improve semantic segmentation performance and speed on Cityscapes and CamVid.

citing papers explorer

Showing 38 of 38 citing papers.