Low-Latency Task-Oriented Image Transmission with Opportunistic Spectrum Access

Beatriz Soret; Jo\~ao Henrique Inacio de Souza; Mateus P. Mota; Mattia Merluzzi; Petar Popovski

arxiv: 2607.01921 · v1 · pith:QC6FF3TDnew · submitted 2026-07-02 · 💻 cs.IT · cs.AI· math.IT

Low-Latency Task-Oriented Image Transmission with Opportunistic Spectrum Access

Jo\~ao Henrique Inacio de Souza , Mattia Merluzzi , Mateus P. Mota , Beatriz Soret , Petar Popovski This is my paper

Pith reviewed 2026-07-03 05:22 UTC · model grok-4.3

classification 💻 cs.IT cs.AImath.IT

keywords task-oriented communicationopportunistic spectrum accessVQ-VAElatent representationsimage classificationlow-latency transmissioncross-layer latency modelclassification accuracy

0 comments

The pith

Sending VQ-VAE latent representations over idle spectrum channels cuts image classification latency by 79 times and 3.3 times with accuracy drops of only 5.7 percent and 2.4 percent.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper sets out to show that a task-oriented system can replace conventional separate source and channel coding by encoding images into discrete latent representations with a vector-quantized variational autoencoder and transmitting those representations over opportunistically accessed idle licensed channels using ordinary digital modulation. The receiver then uses its AI model to extract the information needed for the downstream task directly from the compressed stream. A cross-layer model incorporates the times for compression, block errors, retransmissions, and random channel access to predict end-to-end latency. Readers would care because the approach promises to keep task performance high even when spectrum is scarce and channels fade, something traditional bit-perfect delivery struggles to achieve.

Core claim

The transmission framework with opportunistic spectrum access sends discrete latent representations learned via a vector-quantized variational autoencoder over idle licensed channels using standard digital modulation, allowing the AI-powered receiver to reconstruct task-related information from the heavily compressed data, which results in at least 79- and 3.3-fold latency reductions with only 5.7% and 2.4% drops in classification accuracy compared to benchmarks using conventional source and channel coding.

What carries the argument

Discrete latent representations produced by a vector-quantized variational autoencoder and transmitted digitally over opportunistically accessed idle channels, so the receiver can perform the task without recovering the original image.

If this is right

The scheme delivers at least 79-fold and 3.3-fold lower latency than conventional coding while losing only 5.7 percent and 2.4 percent accuracy on image classification.
Task execution remains reliable under limited spectrum and fading channels because the system transmits only the compressed features needed for the task.
The cross-layer latency model accounts for compression time, block errors, retransmissions, and stochastic idle-channel access in a single calculation.
Conventional separate source and channel coding produces higher latency under identical spectrum and channel constraints.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same latent-representation approach could be tested on video or sensor data streams to check whether the latency gains generalize beyond still images.
Spectrum regulators might need new rules for how many devices can opportunistically use the same idle bands when many are sending compressed task data rather than full files.
The framework implies that future networks could allocate resources according to task accuracy targets instead of bit-error rates, changing how schedulers decide which packets to send first.

Load-bearing premise

The AI model at the receiver can still recover enough task-relevant information from the heavily compressed latent data.

What would settle it

A measurement showing that classification accuracy falls by more than 5.7 percent or 2.4 percent at the reported latency levels, or that the latency reduction falls below 79-fold or 3.3-fold when the same images, channels, and task are used.

Figures

Figures reproduced from arXiv: 2607.01921 by Beatriz Soret, Jo\~ao Henrique Inacio de Souza, Mateus P. Mota, Mattia Merluzzi, Petar Popovski.

**Figure 2.** Figure 2: Proposed framework for task-oriented, low-latency image transmission with opportunistic spectrum access, leveraging low-dimensional latent [PITH_FULL_IMAGE:figures/full_fig_p002_2.png] view at source ↗

**Figure 3.** Figure 3: Latency as a function of (a) SNR, (b) channel non-access probability, and (c) the number of channels. Curves denote the median, while shaded [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

**Figure 5.** Figure 5: PSNR of the successfully decoded images as a function of SNR. [PITH_FULL_IMAGE:figures/full_fig_p005_5.png] view at source ↗

**Figure 6.** Figure 6: Examples of compressed images by the CTx and received images by the CRx considering the proposed framework and the benchmark transmission [PITH_FULL_IMAGE:figures/full_fig_p006_6.png] view at source ↗

read the original abstract

Communication systems designed for reliable data reconstruction, rather than task-oriented communication, typically rely on separate source and channel coding and incur high latency under limited spectrum availability and fading channels. To address this, we propose a transmission framework with opportunistic spectrum access, in which the transmitter sends discrete latent representations learned via a vector-quantized variational autoencoder (VQ-VAE) over idle licensed channels using standard digital modulation. The AI-powered receiver is still able to reconstruct task-related information from the heavily compressed data. We develop a cross-layer latency model that accounts for compression, block errors, retransmissions, and stochastic channel access. Results on latency-accuracy trade-offs show that the proposed scheme achieves at least 79- and 3.3-fold latency reductions with only 5.7% and 2.4% drops in classification accuracy compared to benchmarks using conventional source and channel coding. The framework enables low-latency communication and reliable task execution even under limited spectrum availability and challenging channel conditions.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper claims 79x and 3.3x latency reductions for classification by sending VQ-VAE latents over opportunistic licensed channels, but the abstract does not confirm the benchmarks use the same access model.

read the letter

The one thing to know is that the authors report 79-fold and 3.3-fold latency cuts for a classification task using VQ-VAE latents sent over opportunistically accessed channels, with accuracy drops of only 5.7% and 2.4%. The abstract does not make clear whether the conventional benchmarks use the same access method.

What stands out as new is the end-to-end setup that links the VQ-VAE compression directly to a stochastic access model and a latency calculation that includes retransmissions. The paper does a solid job of explaining why task-oriented transmission can beat reconstruction-oriented approaches under spectrum limits.

The soft spot is the attribution of the gains. The stress-test note is right: if the benchmarks do not also use opportunistic access, then the big latency numbers are driven by the channel access rather than the AI receiver or the compression. The abstract gives no sign that the benchmarks match on that point. There is also zero information on the actual experiments, which makes the claims hard to evaluate.

This paper would be useful for people who work on practical implementations of semantic or task-based comms in constrained spectrum. A reader who wants to see how a cross-layer model can be built for these systems could get some ideas from it.

It deserves a serious referee to sort out the benchmark details and check the full experimental section. I would send it to review.

Referee Report

1 major / 1 minor

Summary. The manuscript proposes a task-oriented image transmission framework that encodes images into discrete latent representations using a vector-quantized variational autoencoder (VQ-VAE), transmits these over opportunistically accessed idle licensed channels with standard modulation, and employs an AI-powered receiver to extract task-relevant information. A cross-layer latency model is derived that incorporates compression overhead, block errors, retransmissions, and stochastic channel access. Numerical results are reported showing at least 79-fold and 3.3-fold latency reductions with 5.7% and 2.4% drops in classification accuracy relative to benchmarks that use conventional source and channel coding.

Significance. If the reported latency-accuracy trade-offs are shown to arise from the VQ-VAE component under controlled conditions, the work would provide concrete evidence that semantic compression combined with opportunistic access can substantially lower latency for task execution in spectrum-constrained fading channels, advancing goal-oriented communication techniques.

major comments (1)

[Abstract] Abstract: the latency-reduction claims (79-fold and 3.3-fold) are presented as resulting from the proposed scheme versus 'benchmarks using conventional source and channel coding,' yet the abstract supplies no indication that the benchmarks employ the identical opportunistic spectrum access model, idle-channel statistics, block-error model, or retransmission policy. Without this control, the quantitative gains cannot be attributed to the VQ-VAE compression and receiver reconstruction, which is the central premise of the task-oriented approach.

minor comments (1)

[Abstract] Abstract, second paragraph: the phrase 'the AI-powered receiver is still able to reconstruct task-related information' is imprecise; the manuscript should state the exact task metric (e.g., top-1 classification accuracy on a named dataset) used to quantify the 5.7% and 2.4% drops.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive comment on the abstract. The concern is valid regarding explicit control of comparison conditions, and we address it directly below.

read point-by-point responses

Referee: [Abstract] Abstract: the latency-reduction claims (79-fold and 3.3-fold) are presented as resulting from the proposed scheme versus 'benchmarks using conventional source and channel coding,' yet the abstract supplies no indication that the benchmarks employ the identical opportunistic spectrum access model, idle-channel statistics, block-error model, or retransmission policy. Without this control, the quantitative gains cannot be attributed to the VQ-VAE compression and receiver reconstruction, which is the central premise of the task-oriented approach.

Authors: We agree that the abstract should explicitly indicate the controlled comparison. In the full manuscript, the cross-layer latency model (accounting for compression overhead, block errors, retransmissions, and stochastic channel access) is applied identically to both the proposed VQ-VAE scheme and the conventional source-channel coding benchmarks; the same idle-channel statistics and retransmission policy are used throughout Section IV and the numerical results. The reported latency reductions are therefore attributable to the semantic compression and task-oriented receiver. We will revise the abstract to read 'compared to benchmarks using conventional source and channel coding under the same opportunistic spectrum access model and channel conditions.' revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation self-contained

full rationale

The paper develops a cross-layer latency model incorporating compression, errors, retransmissions and stochastic access, then reports simulation-based latency-accuracy trade-offs against external benchmarks. No equations or parameters are defined in terms of the target results, no fitted quantities are relabeled as predictions, and no load-bearing self-citations or uniqueness theorems are invoked in the supplied text. The quantitative claims rest on explicit modeling and comparison rather than reducing to the inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies insufficient detail to enumerate free parameters, axioms, or invented entities; VQ-VAE is treated as an off-the-shelf component.

pith-pipeline@v0.9.1-grok · 5720 in / 1063 out tokens · 33736 ms · 2026-07-03T05:22:15.117130+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

19 extracted references · 1 canonical work pages

[1]

Proakis and Masoud Salehi , publisher=

John G. Proakis and Masoud Salehi , publisher=. Digital Communications , edition=. 2008 , address=

2008
[2]

Searching for

Howard, Andrew and Sandler, Mark and Chen, Bo and Wang, Weijun and Chen, Liang-Chieh and Tan, Mingxing and Chu, Grace and Vasudevan, Vijay and Zhu, Yukun and Pang, Ruoming and Adam, Hartwig and Le, Quoc , booktitle=. Searching for. 2019 , month=

2019
[3]

Kingma and Jimmy Ba , booktitle=

Diederik P. Kingma and Jimmy Ba , booktitle=. Adam: A Method for Stochastic Optimization , year=
[4]

Strinati, Emilio Calvanese and Di Lorenzo, Paolo and Sciancalepore, Vincenzo and Aijaz, Adnan and Kountouris, Marios and Gündüz, Deniz and Popovski, Petar and Sana, Mohamed and Stavrou, Photios A. and Soret, Beatriz and Cordeschi, Nicola and Scardapane, Simone and Merluzzi, Mattia and Zanzi, Lanfranco and Renato, Mauro Boldi and Quek, Tony and Pietro, Nic...

2024
[5]

2024 , month=

Zhang, Anbang and Guo, Shuaishuai , journal=IEEE_J_COML, title=. 2024 , month=

2024
[6]

2023 , month=

Xie, Songjie and Ma, Shuai and Ding, Ming and Shi, Yuanming and Tang, Mingjian and Wu, Youlong , journal=IEEE_J_JSAC, title=. 2023 , month=

2023
[7]

2023 , month=

Hu, Qiyu and Zhang, Guangyi and Qin, Zhijin and Cai, Yunlong and Yu, Guanding and Li, Geoffrey Ye , journal=IEEE_J_WCOM, title=. 2023 , month=

2023
[8]

Variational image compression with a scale hyperprior , booktitle =

Johannes Ball. Variational image compression with a scale hyperprior , booktitle =
[9]

Duan, Yifan and She, Changyang and Zhao, Guodong and Quek, Tony Q. S. , booktitle=. Delay Analysis and Computing Offloading of URLLC in Mobile Edge Computing Systems , year=
[10]

and Ropokis, George A

Filippou, Miltiades C. and Ropokis, George A. and Gesbert, David and Ratnarajah, Tharmalingam , journal=. Joint Sensing and Reception Design of SIMO Hybrid Cognitive Radio Systems , year=
[11]

Analyzing and Enhancing Queue Sampling for Energy-Efficient Remote Control of Bandits , year=

Dakdouk, Hiba and Sana, Mohamed and Merluzzi, Mattia , booktitle=. Analyzing and Enhancing Queue Sampling for Energy-Efficient Remote Control of Bandits , year=
[12]

Rate and Channel Adaptation in Cognitive Radio Networks Under Time-Varying Constraints , year=

Qureshi, Muhammad Anjum and Tekin, Cem , journal=. Rate and Channel Adaptation in Cognitive Radio Networks Under Time-Varying Constraints , year=
[13]

Image Segmentation Semantic Communication over Internet of Vehicles , year=

Pan, Qiang and Tong, Haonan and Lv, Jie and Luo, Tao and Zhang, Zhilong and Yin, Changchuan and Li, Jianfeng , booktitle=. Image Segmentation Semantic Communication over Internet of Vehicles , year=
[14]

Prompt-Assisted Semantic Interference Cancelation on Moderate Interference Channels , year=

Meng, Zian and Li, Qiang and Pandharipande, Ashish and Ge, Xiaohu , journal=. Prompt-Assisted Semantic Interference Cancelation on Moderate Interference Channels , year=
[15]

Goal-oriented spectrum sharing: Trading edge inference power for data streaming performance,

Goal-oriented Spectrum Sharing: Trading Edge Inference Power for Data Streaming Performance , author=. 2025 , month=. 2503.11552 , archivePrefix=

work page arXiv 2025
[16]

Proceedings of the 31st International Conference on Neural Information Processing Systems , pages =

van den Oord, Aaron and Vinyals, Oriol and Kavukcuoglu, Koray , title =. Proceedings of the 31st International Conference on Neural Information Processing Systems , pages =. 2017 , monht =

2017
[17]

and Maguire, G.Q

Mitola, J. and Maguire, G.Q. , journal=. Cognitive radio: making software radios more personal , year=
[18]

, journal=IEEE_J_JSAC, title=

Haykin, S. , journal=IEEE_J_JSAC, title=. 2005 , month=

2005
[19]

2006 , month =

NeXt generation/dynamic spectrum access/cognitive radio wireless networks: A survey , journal =. 2006 , month =

2006

[1] [1]

Proakis and Masoud Salehi , publisher=

John G. Proakis and Masoud Salehi , publisher=. Digital Communications , edition=. 2008 , address=

2008

[2] [2]

Searching for

Howard, Andrew and Sandler, Mark and Chen, Bo and Wang, Weijun and Chen, Liang-Chieh and Tan, Mingxing and Chu, Grace and Vasudevan, Vijay and Zhu, Yukun and Pang, Ruoming and Adam, Hartwig and Le, Quoc , booktitle=. Searching for. 2019 , month=

2019

[3] [3]

Kingma and Jimmy Ba , booktitle=

Diederik P. Kingma and Jimmy Ba , booktitle=. Adam: A Method for Stochastic Optimization , year=

[4] [4]

Strinati, Emilio Calvanese and Di Lorenzo, Paolo and Sciancalepore, Vincenzo and Aijaz, Adnan and Kountouris, Marios and Gündüz, Deniz and Popovski, Petar and Sana, Mohamed and Stavrou, Photios A. and Soret, Beatriz and Cordeschi, Nicola and Scardapane, Simone and Merluzzi, Mattia and Zanzi, Lanfranco and Renato, Mauro Boldi and Quek, Tony and Pietro, Nic...

2024

[5] [5]

2024 , month=

Zhang, Anbang and Guo, Shuaishuai , journal=IEEE_J_COML, title=. 2024 , month=

2024

[6] [6]

2023 , month=

Xie, Songjie and Ma, Shuai and Ding, Ming and Shi, Yuanming and Tang, Mingjian and Wu, Youlong , journal=IEEE_J_JSAC, title=. 2023 , month=

2023

[7] [7]

2023 , month=

Hu, Qiyu and Zhang, Guangyi and Qin, Zhijin and Cai, Yunlong and Yu, Guanding and Li, Geoffrey Ye , journal=IEEE_J_WCOM, title=. 2023 , month=

2023

[8] [8]

Variational image compression with a scale hyperprior , booktitle =

Johannes Ball. Variational image compression with a scale hyperprior , booktitle =

[9] [9]

Duan, Yifan and She, Changyang and Zhao, Guodong and Quek, Tony Q. S. , booktitle=. Delay Analysis and Computing Offloading of URLLC in Mobile Edge Computing Systems , year=

[10] [10]

and Ropokis, George A

Filippou, Miltiades C. and Ropokis, George A. and Gesbert, David and Ratnarajah, Tharmalingam , journal=. Joint Sensing and Reception Design of SIMO Hybrid Cognitive Radio Systems , year=

[11] [11]

Analyzing and Enhancing Queue Sampling for Energy-Efficient Remote Control of Bandits , year=

Dakdouk, Hiba and Sana, Mohamed and Merluzzi, Mattia , booktitle=. Analyzing and Enhancing Queue Sampling for Energy-Efficient Remote Control of Bandits , year=

[12] [12]

Rate and Channel Adaptation in Cognitive Radio Networks Under Time-Varying Constraints , year=

Qureshi, Muhammad Anjum and Tekin, Cem , journal=. Rate and Channel Adaptation in Cognitive Radio Networks Under Time-Varying Constraints , year=

[13] [13]

Image Segmentation Semantic Communication over Internet of Vehicles , year=

Pan, Qiang and Tong, Haonan and Lv, Jie and Luo, Tao and Zhang, Zhilong and Yin, Changchuan and Li, Jianfeng , booktitle=. Image Segmentation Semantic Communication over Internet of Vehicles , year=

[14] [14]

Prompt-Assisted Semantic Interference Cancelation on Moderate Interference Channels , year=

Meng, Zian and Li, Qiang and Pandharipande, Ashish and Ge, Xiaohu , journal=. Prompt-Assisted Semantic Interference Cancelation on Moderate Interference Channels , year=

[15] [15]

Goal-oriented spectrum sharing: Trading edge inference power for data streaming performance,

Goal-oriented Spectrum Sharing: Trading Edge Inference Power for Data Streaming Performance , author=. 2025 , month=. 2503.11552 , archivePrefix=

work page arXiv 2025

[16] [16]

Proceedings of the 31st International Conference on Neural Information Processing Systems , pages =

van den Oord, Aaron and Vinyals, Oriol and Kavukcuoglu, Koray , title =. Proceedings of the 31st International Conference on Neural Information Processing Systems , pages =. 2017 , monht =

2017

[17] [17]

and Maguire, G.Q

Mitola, J. and Maguire, G.Q. , journal=. Cognitive radio: making software radios more personal , year=

[18] [18]

, journal=IEEE_J_JSAC, title=

Haykin, S. , journal=IEEE_J_JSAC, title=. 2005 , month=

2005

[19] [19]

2006 , month =

NeXt generation/dynamic spectrum access/cognitive radio wireless networks: A survey , journal =. 2006 , month =

2006