Multi-distribution functionals reduce to integrals of coincidence divergences
Monotonicity under data processing and additivity on independent products force every such functional to an integral over four strata
full image
Statistics
Monotonicity under data processing and additivity on independent products force every such functional to an integral over four strata
full image
When Stronger Triggers Backfire: A High-Dimensional Theory of Backdoor Attacks
Proportional-regime analysis shows attack success peaks then falls while clean performance improves with training trigger strength.
full image
Contradiction Graphs Determine VC Dimension
Vertices are realizable label sequences of length m; edges mark label disagreements on shared points, fixing whether dimension meets or tops
full image
Online Safety Monitoring for LLMs
Risk-calibrated thresholding on external verifier signals performs competitively on reasoning and red teaming tasks.
full image
Leakage-free wavelet classifier shows the two image types are equivalent yet use separate phase and magnitude channels.
full image
Inference for Group Interaction Experiments
In a sparse-sampling regime, standard cluster methods account for dependencies from interference and group formation.
full image
The Dual Nature of LLM Persona: Aggregated Tendencies and Frame-Dependent Geometry
Aggregate trait scores resist frame changes while correlation structure drops 42% on mismatch and recovers with alignment.
full image
Merging of Bayes and quasi-Bayes empirical Bayes procedures for Poisson compound decisions
Concentration rates of marginal PMFs produce matching regret decay, so the faster quasi-Bayesian method performs equivalently in the multidi
full image
Cross-Audit Projection for Model Risk Prediction
Resampling audit plus asymptotic projection corrects over-optimism in binary classification risk estimates without sacrificing leading accur
full image
Instrumented difference-in-differences under case-control sampling
After modeling retrospective sampling bias, instruments whose direct outcome effect does not change over time identify trend effects in case
full image
Accounting for center differences reveals when local adoption decisions still need more data to confirm net benefit.
full image
MATCH: Multiplier-Assisted Tests for Conditional Hypotheses in Non-Euclidean Data
Sample splitting and random multipliers on held-out losses produce Gaussian limits without tangent coordinates or residual terms.
full image
Goodness of Fit Tests Based on Joint Densities of Multiple Sample Statistics
Simulations show the procedures match or exceed classical and Zhang methods for continuous nulls with known parameters.
full image
Aggregation with Exponential Weights is Optimal in Expectation
The bound holds for large constant temperatures on bounded Lipschitz strongly convex losses without Bernstein assumptions
full image
MLP and GNN branches stay separate until the final step, enabling direct inspection of each contribution after pretraining on larger data.
full image
Prediction Sets for Counterfactual Decisions: Coverage, Optimality, and Conformal Prediction
Equivalence to risk-averse optimization produces explicit optimal sets and a conformal method with finite-sample coverage guarantees.
full image
Conformal Bayes for Two-Sided Censored Gaussian Regression under Label Shift
Mixed atom-density calibration weights yield smaller valid sets than source-score methods in two-sided censored Gaussian regression.
full image
A note on "The volume of random simplices from elliptical distributions in high dimension"
Central and stable limits for log-volumes of high-dimensional random simplices now hold under relaxed assumptions on the population matrix.
Quaternion Nondecimated Wavelet Descriptors for Multiclass Breast Histology Classification
Color-coupled nondecimated transforms produce balanced accuracy on BACH data without pretrained networks or external data.
full image
Evaluation across 19 drivers shows the rolling-volatility method matches a simple threshold while violating its core exchangeability assumpt
Sequential Structure-Sensitive Residual Diagnostics for PDE Inverse Problems
Sequential e-process rejects bad fits early using spatial residual patterns with anytime-valid error control.
full image
Three regimes of instrument strength arise from unit sizes in growing panels, dictating rates and valid inference methods.
full image
Grouped Orthogonal Arrays from Orthogonal Arrays and Difference Schemes
Provides new designs with more groups and larger sizes for experiments assuming negligible cross-group interactions.
Resolution of the Detection Threshold Conjecture for Random Geometric Graphs in the d>n Regime
Proves conjecture by showing total variation distance to Erdős–Rényi vanishes when d ≫ (nh(p))^3 and d > n.
Born Discrete, Made Smooth: Variational Formulation of Shallow Neural Networks
A continuum variational problem on parameter densities turns training convex and yields the minimizer directly from a linear system with exp
full image
Simulation of 431 tasks found only 30 met R-hat ≤ 1.01 and ESS ≥ 400, so report full diagnostics instead of fit success alone.
Plausibility: Exact inference in R
R package implements the framework for regression models and supports exact tests on data examples.
full image
Moment-Based Selection of Multiresponse Linear Mixed-Effects Models
It reduces the problem to convex optimization using cross-moment identities and establishes finite-sample guarantees under sub-Weibull error
full image
Continuous-item formulation recovers skill at r=0.96 and improves Brier score by 0.33 over expert curves on synthetic cohort of 80 riders.
full image
Autorelevance function and other feature relevance measures for univariate time series
Shapley-based measures with one-step forecast replacement for missing lags identify expected patterns across ARMA and neural models.
Statistical Properties of k-means Clustering for Data Missing Completely at Random
Recovery of the true centers holds under a missing-probability and separation condition, provided centers differ in every dimension.
full image
L2-perturbation theory converts existing covariance kernel rates into optimal sup-norm and normality results for the associated eigenfunctio
full image
Beyond Laplace: Closed-form wrapped Gaussian posterior approximations on statistical manifolds
Contrast functions approximate maps on statistical manifolds, removing geodesic solvers and curvature calculations.
full image
Regularized Variational and Spectral Log-Density-Ratio Estimation in the Gaussian Location Model
In the Gaussian location model, the risk ordering reverses with the observation-to-dimension ratio under ridge regularization.
full image
Pattern-Calibrated Multimodal Prediction under Blockwise Missingness
Bounds decompose error into overlap size, calibration gap, and representation error, showing when borrowing beats local fitting.
full image
Role-Aware Neural Convex Divergence Heads for Asymmetric Representation Learning
Source and target projections before convex neural divergences model asymmetric relations like entailment while keeping scores nonnegative.
full image
Distributed sources cannot be uniquely recovered from static patterns, but a transcription site allows physics-informed methods to infer the
full image
Full Bayesian Reinforcement Learning via LF-IBIS
LF-IBIS approximates posteriors over parameters and policies from simulation data alone to support uncertainty-aware decisions.
full image
An Adaptive Glicko-2 Rating Framework for Probabilistic Football Forecasting and Season Simulation
Dynamic ratings plus ordered-logit probabilities feed Monte Carlo simulations of remaining league fixtures.
full image
From Subgroups to Population Composition: A Transportability Approach to Effect Heterogeneity
Modeling effects in hypothetical populations with shifted prevalences ranks characteristics by their link to differential vulnerability.
full image
The construction supplies infinite series for the copula and density whose low-order truncations already match target dependence in numerica
full image
Structured CTBN model on 33,558 UK Biobank participants identifies cardiometabolic and inflammatory disease modules.
Learning Effective Soliton Dynamics from Scattering Data
Weak-form identification inside the inverse scattering framework yields low-dimensional models that hold in perturbed regimes.
An unsupervised kernel norm monitoring for fault detection in a time series photovoltaic system
KNM maps normal data windows into kernel space to flag sensor and shading issues in solar systems better than standard baselines.
full image
The Agentic Garden of Forking Paths
Different personas cause agents to reach opposing conclusions from the same dataset, showing selective reporting among valid paths is the co
full image
The benefit reverses when privacy is weaker and the usual tension with robustness returns.
full image
How to Allocate Your Tokens? Scaling Laws with Training Steps and Batch Size
Splitting training data into steps and batch size lets the scaling law fit with fewer experiments while predicting best token use.
full image
Sensitivity Analysis and Optimization of Stochastic Epidemic Models under Parameter Uncertainty
Unbiased estimators for stochastic models show weaker herd-immunity effects and more cautious intervention levels when parameters are drawn
full image
Sampling for Region-Aggregated Spatial Scan Statistics
Uniform sampling from each area's geometry and even value spreading recovers most detection ability lost by using centroids alone.
full image
Conditional Inference Trees and Forests for Feature Selection
Benchmarks on 30 datasets show competitive top-k performance with bias-controlled split selection.
full image
The MJ statistic checks whether at least one candidate model is correct and supplies a selection rule, all from null-model estimates alone.
full image
Beyond Degree: Rooted Motif Signatures for Latent Position Identifiability in Graphon Models
In generic finite-rank graphons, higher-order rooted patterns recover unique connectivity profiles even when degrees are identical.
full image
From Approximation to Emergence: A Theory of Deep Learning
The account organizes results on optimization, scaling, transformers, and alignment by what each explains and what each leaves out.
full image
Decision-Aware Training for Sample-Based Generative Models
Sample-based generative models learn to penalize downstream decision costs while retaining full probabilistic output.
full image
Characterizing and Identifying Separable Graphical Models
Missing edges always admit separating sets, enabling canonical representations and an identification algorithm for equivalence classes
Function-Counting Theory for Low-Dimensional Data Structures
Extending Cover's counting theory shows how manifold structure shapes classification capacity and generalization.
full image
Deep Multitask Learning for Mixed-Type Outcomes with Shared Sparsity
Shared first layer and group Lasso yield excess risk bounds plus consistent selection of common predictors.
full image
The doubly robust estimator uses an instrumental variable and martingale residuals, without needing a no-switching subset, and applies to mu
The algorithm simulates conditional mean changes to quantify how node manipulations alter psychological network distributions.
Beyond the Flow: A Bayesian Latent Clustering Framework for Shared Micro-mobility Users in Venice
Users are grouped from raw trip sequences rather than summaries, separating localized, commuter, and tourist patterns.
full image
The smallest balanced induced subgraph's densest part determines when exact recovery from a random graph becomes possible with high probabil
full image
Hierarchical Variational Kalman Filtering
Reformulated inference cuts iterations and supports higher-order trackers that become zero-phase over full history.
full image
Transfert learning and adaptive LASSO quantile
Two L1 penalties from a source database estimator deliver sparsity and faster computation than standard adaptive LASSO.
Formulation yields interpretable link and faster variable selection than joint model in mixed-scale data
full image
Convolutional Symmetric AutoEncoders: enhancing latent stability via differential geometry
Extending representation consistency to convolutional layers cuts reconstruction errors and boosts robustness on advection, Burgers and Kura
full image
Approximate full-conformal multi-task regression with reproducing kernels
The construction yields a computable region guaranteed to contain the exact full-conformal one, with a volume bound when task covariances ar
full image
Optimal scaling of MCMC algorithms: exploiting the symmetry of the Metropolis-Hastings formula
Symmetry in the acceptance rule lets gradient-based proposals use variance O(1/d^μ) with μ arbitrarily small instead of the MALA rate of 1/3
full image
Active-GRPO: Adaptive Imitation and Self-Improving Reasoning for Molecular Optimization
The method switches from imitation to self-reinforcement per instance and replaces the reference with better policy candidates to exceed pri
full image
Framework tests stability of relationships when moving from structural models to machine learning adjustments on construct scores
full image
Sparse mixtures of learned prototypes keep accuracy within 2.5 points and localize influence to training neighborhoods.
full image
Ghost in the Kernel: In-Context Learning with Efficient Transformers via Domain Generalization
Domain generalization analysis shows how linear attention achieves in-context learning and informs activation choices for large-model linear
full image
Neural Network-Based Estimation of Time-Dependent Parameters in AR(p) Processes
The method keeps an explicit parametric structure while estimating changing coefficients and producing intervals under two noise distributio
full image
Distributed Prediction under Heterogeneity with Unidentifiable Parameter
Trace-similarity penalty and invex relaxation deliver model-free bounds with lower communication cost.
full image
The distribution-based method matches existing widths while reflecting skewness that asymptotic approaches miss in small samples.
full image
Tutorial workflow covers eligibility alignment, matching and selective strategies with R packages to improve efficiency while keeping infere
full image
The reconstruction defect from the backward-coherence penalty acts as a change detector that precedes the standard index in several climates
full image
Joint normality couples drift and scale via third moment of Lévy noise while switching rates stay uncorrelated
full image
From Spectral Methods to Sample Complexity Bounds for Fourier Neural Operators
Bounds hold uniformly over families of equations when operators admit spectral discretizations, with rates set by smoothness and dimension.
LASSO and correlation analysis on five-state data rank predictors and highlight key links to adverse health outcomes
full image
Post-selection inference for network structure
Two methods ensure coverage when communities or markets are identified using the observed connections themselves.
full image