Low dimension suffices for near-max retrieval margins
Is Dimensionality a Barrier for Retrieval Models?
Dimension O(m^{-2} log n) nearly matches the infinite-dimension margin for any relevance matrix A.
full image
Machine Learning
Papers on all aspects of machine learning research (supervised, unsupervised, reinforcement learning, bandit problems, and so on) including also robustness, explanation, fairness, and methodology. cs.LG is also an appropriate primary category for applications of machine learning methods.
Is Dimensionality a Barrier for Retrieval Models?
Dimension O(m^{-2} log n) nearly matches the infinite-dimension margin for any relevance matrix A.
full image
Disentanglement Beyond Generative Models with Riemannian ICA
The construction drops ICA's global generative requirement while recovering sources consistently across manifold representations.
full image
Lost in Tokenization: Fundamental Trade-offs in Graph Tokenization for Transformers
Random-walk maps lose information permanently while spectral maps preserve it but hinder local tasks, creating provable depth gaps between 2
full image
LACUNA: A Testbed for Evaluating Localization Precision for LLM Unlearning
LACUNA places PII in known weights so researchers can measure whether methods erase knowledge at the source or only change outputs.
full image
Program-as-Weights: A Programming Paradigm for Fuzzy Functions
A 0.6B interpreter running compiler-generated LoRA adapters matches a 32B model on fuzzy text tasks at 1/50th the memory, entirely offline.
full image
Online Safety Monitoring for LLMs
Risk-calibrated thresholding on external verifier signals performs competitively on reasoning and red teaming tasks.
full image
Dual-channel tests show relational pressures create decision divergence absent from isolated prompts
full image
DemoPSD: Disagreement-Modulated Policy Self-Distillation
Blending teacher and student distributions by per-token disagreement preserves exploration and improves cross-domain performance on scientif
full image
Matrix optimizers reach higher accuracy in fewer steps, with largest gains under partial force labels.
full image
Controllable Sim Agents with Behavior Latents
CNeVA matches top imitation models on Waymo data while enabling monotone per-channel steering without reward hacks.
full image
OrbitQuant: Data-Agnostic Quantization for Image and Video Diffusion Transformers
A randomized permuted block-Hadamard transform creates a data-independent basis that serves every timestep and modality.
full image
Neuron-Aware Data Selection for Annotation-Free LLM Self-Distillation
The approach raises specialized-task accuracy while avoiding the out-of-domain drop and calibration problems of earlier output-only methods.
full image
Understanding the Robustness of Distributed Self-Supervised Learning Frameworks Against Non-IID Data
Theoretical analysis shows decentralized SSL robustness grows with network connectivity, placing federated learning on equal footing.
full image
Optimal Stabilizer Testing and Learning with Limited Quantum Memory
The usual constant-copy tester vanishes; learning costs Θ(n²/k) non-adaptively, so testing and learning match when memory is fractional
full image
Extreme Adaptive Transformer for Time Series Forecasting
An added attention component for rare peaks yields better 3-day predictions than standard transformers on four hydrologic datasets.
full image
QFedAgent: Quantum-Enhanced Personalized Federated Learning for Multi-Agent Activity Recognition
Replaces 33K classical parameters yet reaches 97.7% accuracy on non-IID wearable activity data.
full image
Neuron-Aware Active Few-Shot Learning for LLMs
By tracking internal patterns for diversity and low consensus, NeuFS cuts annotation cost while beating output-entropy and embedding baselin
full image
LIME: Learning Intent-aware Camera Motion from Egocentric Video
LIME mines language intents and view gains from egocentric recordings to train robots on choosing next viewpoints.
full image
Q-GAIN: A Python Package for Machine Learning and Physically Informed Analysis Applications
Q-GAIN supplies modular tools for classification and object detection, shown on solitons and vortices in cold-atom data.
full image
Fixed SAM masks let the distributional objective align objects instead of scenes, raising tracking, classification, segmentation and re-iden
full image
Fast Multi-dimensional Refusal Subspaces via RFM-AGOP
The method works on both reasoning and non-reasoning models and beats alternatives on ablation tests.
WattGPU: Predicting Inference Power and Latency on Unseen GPUs and LLMs
Public metadata alone enables matching models to hardware without profiling and halves to quarters baseline errors in server scenarios.
full image
DecompRL: Solving Harder Problems by Learning Modular Code Generation
Decomposing into sub-functions creates up to k^n candidates while cutting GPU token cost by about 50 times.
Bringing Agentic Search to Earth Observation Data Discovery
Zero-shot LLM stage added to neural-BM25 fusion improves retrieval without extra training on NASA EO queries.
full image
Transformer Geometry Observatory TGO-II: Representational Similarity Observatory
CKA, SVCCA and dimensionality measures show manifold expansion without loss of token interactions across training.
full image
The Dual Nature of LLM Persona: Aggregated Tendencies and Frame-Dependent Geometry
Aggregate trait scores resist frame changes while correlation structure drops 42% on mismatch and recovers with alignment.
full image
Stable Self-Modulating Quantum Fast-Weight Programmers with Bounded Memory Gates
Tanh on old-state memory removes divergence in long sequences and improves robustness on forecasting tasks.
full image
Self-Gating Attention for Efficient Time Series Forecasting
Shared learnable matrix plus residual replaces query-key projections while matching accuracy on nine real-world datasets.
full image
HNSW with Accuracy Guarantees Using Graph Spanners -- A Technical Report
A lightweight statistical check triggers exact fallback only when the heuristic result may be unreliable, preserving average speed.
full image
On the Role of Directionality in Structural Generalization
CCG parser beats prior best on SLOG directional tests; larger encoders then close the recursion gap by addressing a separate weakness.
full image
One More Time: Revisiting Neural Quantum States from a Reinforcement Learning Perspective
PWO reframes energy minimization as policy gradient to improve stability over Adam and earlier optimizers on spin lattices.
full image
Optimizing Visual Generative Models via Distribution-wise Rewards
Evaluating full sample distributions rather than individuals cuts FID-50K from 8.30 to 5.77 on SiT while preserving diversity.
full image
Generalization in offline RL: The structure is more important than the amount of pessimism
In contextual MDPs, matching the optimal solution's symmetries matters more than how conservative the value function is.
full image
Dendritic In-Context Learning in a Single-Layer Spiking Neural Network
Subthreshold dynamics match leaky online Widrow-Hoff updates, allowing stable ICL in one layer without plasticity or depth.
full image
HERMES: A Multi-Granularity Labeling Substrate for Pre-training Data Mixtures
Equal coverage rule boosts 16-task average by 0.0253 at one prefix length but loses edge when pools shrink 5x.
full image
Aggregation with Exponential Weights is Optimal in Expectation
The bound holds for large constant temperatures on bounded Lipschitz strongly convex losses without Bernstein assumptions
full image
Purified OPSD: On-Policy Self-Distillation Without Losing How to Think
Isolating question-conditioned corrections via reference-only teachers and PMI improves four models on two datasets without losing natural r
full image
MLP and GNN branches stay separate until the final step, enabling direct inspection of each contribution after pretraining on larger data.
full image
Prediction Sets for Counterfactual Decisions: Coverage, Optimality, and Conformal Prediction
Equivalence to risk-averse optimization produces explicit optimal sets and a conformal method with finite-sample coverage guarantees.
full image
Self-explainable Operator Learning for Discovering Spatial Patterns in Functional Data
Localized contributions from input subdomains directly link regions to output patterns in blood-flow and aerodynamics problems.
full image
Fourier Preconditioning for Neural Feature Learning
For stationary signals the transform packs dependence into dominant modes, cutting truncation error without extra training cost.
full image
Online Resource Allocation with Continuous Random Consumption: Regret under Degeneracy
The exponent p on size-weighted mass near cutoffs sets the regret rate for continuous online allocation without non-degenerate fluids.
An Optimisation Framework for the Well-Conditioned Training of Physics-Informed Neural Networks
Doubly-sketched Gauss-Newton with adaptive ratio improves accuracy five to eight orders of magnitude over prior methods while staying faster
full image
Privacy-Preserving and Verifiable Approximate Distributed Coded Computing
GPBACC plus aggregation and group testing limits leaks and isolates adversaries across federated and decentralized settings.
full image
Bayesian Sparse Low-Rank Adaptation for Large Language Model Uncertainty Estimation
Stochastic masking shifts Bayesian uncertainty estimation to the lightweight adapter ranks, keeping reasoning accuracy intact.
full image
In five expert scenarios, 52% of weight-5 criteria were missed by every tested frontier model.
Dynamic Neural Graph Encoding of Inference Processes in Deep Weight Space
Encoding weights as evolving graphs preserves layer-by-layer computation order and raises accuracy over prior static methods.
full image
Tight Lower Bounds for the Multi-Secretary Problem via Bellman Certificates
Mixture of two separated distributions at critical capacity forces quadratic-log gap to offline optimum.
After fixing missing values and imbalance, the model ranks biomarkers while comparing classifiers on standard accuracy measures.
full image
Probing Chemical Language Models: Effects of Pre-training and Fine-tuning
Upper layers improve most; fine-tuning then adjusts task-relevant substructures per chemical rules.
full image
ART for Diffusion Sampling: Continuous-Time Control and Actor-Critic Learning
ART-RL optimizes a sampling-clock speed via actor-critic RL to produce timestep grids that beat fixed schedules at the same budget and trans
full image
Models trained on the modular physics-based degradations generalize better to actual footage and the 81k-frame benchmark reveals current fai
full image
Observer-level accuracy enables automated penile tissue volumetry in 34,412 UK Biobank participants.
full image
Evaluation across 19 drivers shows the rolling-volatility method matches a simple threshold while violating its core exchangeability assumpt
Ask the Right Comparison:Bias-Aware Bayesian Active Top-k Ranking with LLM Judges
Explicit verbosity and position covariates plus targeted queries correct naive aggregation on real LLM benchmarks.
full image
Pathway propagation inside Gaussian process kernels captures both measurements and topology while reporting prediction confidence on small m
full image
WBMM: Windowed Batch Matrix Multiplication for Efficient Large Receptive Field Convolution
By partitioning inputs into windows and using bias tables for weights, it turns irregular memory access into regular batched matrix multipli
full image
Fourier Neural Operators for Rayleigh-B\'enard Convection
Compact 314k-parameter model runs in 7 ms and generalizes to finer meshes, yet accuracy stays bounded by training resolution.
SUNTA: Hierarchical Video Prediction with Surprise-based Chunking
Hierarchical models set boundaries by internal inconsistency and keep accuracy where fixed-chunk baselines collapse after 10 steps.
full image
HaloGuard 1.0: An Open Weights Constitutional Classifier for Multilingual AI Safety
HaloGuard 1.0 beats 27B baselines on seven prompt-safety tests while holding low error rates across 46 languages.
full image
Evidence-State Rewards for Long-Context Reasoning
Maven credits add, link and drop moves by how they change an editable memory, yielding sufficient evidence with fewer distractors.
full image
kNNGuard: Turning LLM Hidden Activations into a Training-Free Configurable Guardrail
A 50-prompt bank and multi-layer nearest-neighbor search classify unsafe inputs without any model training or slow inference.
full image
SA-HGNN: Sample-Adaptive Hyperbolic Graph Neural Network for EEG-Based Depression Recognition
Sample-adaptive graphs and hyperbolic convolutions model connectivity hierarchies that Euclidean networks miss, improving accuracy on public
full image
Structure-aware partitioning and curriculum robust training cut leakage and improve calibration on correlated domains.
full image
A Memory Efficient Unified Algorithm for Online Learning of Linear Dynamical Systems
It delivers sublinear regret when instability is limited to k modes and proves fewer filters cannot work.
Fast and Accurate Anomaly Detection in Time Series
Unsupervised detector beats state-of-the-art on 343 datasets by testing wavelet coefficients without labels or tuning.
full image
Cross-Platform Control for Autonomous Surface Vehicles via Adaptive Reinforcement Learning
Adaptive method with latent dynamics inference cuts position error up to 58% versus non-adaptive baselines on real platforms.
full image
Born Discrete, Made Smooth: Variational Formulation of Shallow Neural Networks
A continuum variational problem on parameter densities turns training convex and yields the minimizer directly from a linear system with exp
full image
Scalable and Distributed Silhouette Approximation
The first provable method needs only O(nkε^{-2}ln(nk/δ)) distances and extends to constant-round MapReduce and MPC implementations.
full image
Liquid Latent State Dynamics for Interpretable Turbofan Degradation Modeling
The model lowers sensor forecast RMSE to 0.2266 and raises degradation-state correlation to 0.596 on C-MAPSS data.
full image
Fixed-protocol tests on CIFAR and Tiny ImageNet place EfficientNet-B0 on every Pareto frontier and show MobileNetV3-Small beating its succes
full image
Assessing VLM Reliability for Medical Image Quality Evaluation Under Corruption and Bias
Benchmark across 16 models and seven modalities links image degradation and metadata to quality-score shifts.
Object Aligner approximates bijections with color refinement so LLM outputs can be scored without label sensitivity
full image
Chronos-2 leads on 200 real feeders; new metric links accuracy directly to grid cost versus failure-risk trade-offs
full image
Towards a Phonology-Informed Evaluation of Multilingual TTS
A classifier trained on human speech flags when synthesized output loses the ATR distinctions that mark grammatical forms.
full image
Autorelevance function and other feature relevance measures for univariate time series
Shapley-based measures with one-step forecast replacement for missing lags identify expected patterns across ARMA and neural models.
A More Accurate Algorithm Comparison through A/B Testing using Offline Evaluation Methods
Sample mean lacks positive correlation that cuts critical errors; new stepwise estimator matches accuracy with half the data
full image
Statistical Properties of k-means Clustering for Data Missing Completely at Random
Recovery of the true centers holds under a missing-probability and separation condition, provided centers differ in every dimension.
full image
Hybrid quantum-classical neural network for sentiment analysis
They match classical models on COVID-19 tweet sentiment but show stronger generalization to SMS spam detection.
Conditional Co-Ablation: Recovering Self-Repair Backups in Transformer Circuits
Scoring the growth in ablation effect after primary removal exposes repair components missed by isolated-unit tests.
full image