SessionBound: Turning Enterprise Task Approval into Budgeted Database Sessions

Minmin Wu

arxiv: 2607.00751 · v1 · pith:6OJXKCN2new · submitted 2026-07-01 · 💻 cs.DB · cs.CR

SessionBound: Turning Enterprise Task Approval into Budgeted Database Sessions

Minmin Wu This is my paper

Pith reviewed 2026-07-02 03:12 UTC · model grok-4.3

classification 💻 cs.DB cs.CR

keywords AI agentsdatabase authorizationtask approvalsession managementquery budgetingenterprise securityaccess controlPostgreSQL

0 comments

The pith

SessionBound converts approved enterprise tasks into short-lived budgeted database sessions that AI agents must respect.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents SessionBound as a method to bridge the gap between high-level task approvals by managers and low-level SQL execution by AI agents. A control plane handles task templates, approvals, budgets, and signed tokens, while the database runtime binds each token to a session and directly enforces limits on views, rows, fields, operations, queries, and disclosures. Agents can generate any SQL they like, but the system blocks anything outside the token's boundary without using an LLM to judge safety. This matters for enterprises that need auditable, bounded data access for analysis and compliance work while still allowing agents flexibility. A PostgreSQL implementation shows the approach works across 24 validation scenarios with low absolute latency.

Core claim

SessionBound turns approved enterprise tasks into short-lived, budgeted, and auditable database sessions for AI agents. A control plane defines task templates, accepts task applications, records approvals, assigns budgets, and issues signed task tokens. A database runtime, SessionBoundDB, binds a token to a session and enforces safe views, row scope, denied fields, operation limits, query budgets, disclosure budgets, and receipts. The database does not rely on an LLM to decide whether a query is safe. The agent may generate SQL freely, but each attempt must stay inside the approved boundary.

What carries the argument

The signed task token from the control plane, which SessionBoundDB binds to a session to enforce all defined budgets and scopes at runtime.

If this is right

AI agents can generate open-ended SQL while remaining confined to the pre-approved task boundary.
Safety enforcement happens entirely inside the database without calling an LLM to evaluate queries.
Every session produces a receipt that links execution back to the original task approval.
Budgets can separately cap query count, data volume disclosed, and operation types per session.
The prototype adds 1.4-1.5 ms median latency on small queries compared to raw PostgreSQL.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

This token-binding pattern could apply to other agent tools such as API endpoints or file systems by issuing similar scoped execution contexts.
Enterprises might reduce manual query review volume if budgets prove sufficient to cover typical task variations.
If many tasks need frequent budget increases, the system could shift from one-time approvals to more iterative template updates.
Measuring how often real tasks hit budget limits would test whether static templates match dynamic analysis needs.

Load-bearing premise

Predefined task templates and static budgets can accurately and completely capture the intended scope of an approved business task without needing runtime LLM judgment or dynamic re-approval.

What would settle it

A real approved task whose legitimate data needs require queries or operations that fall outside the static budgets and scopes, so the system blocks work the approver intended to allow.

Figures

Figures reproduced from arXiv: 2607.00751 by Minmin Wu.

read the original abstract

Enterprise AI agents are useful for internal analysis, audit, compliance review, and operational investigation, but they create a difficult authorization problem. A manager or data owner may approve a business task, while the agent later generates open-ended SQL below the application layer. Existing systems help identify agents, delegate authority, govern data products, or enforce database policy, but they do not directly turn an approved enterprise task into a bounded database execution context. SessionBound fills this gap. It turns approved enterprise tasks into short-lived, budgeted, and auditable database sessions for AI agents. A control plane defines task templates, accepts task applications, records approvals, assigns budgets, and issues signed task tokens. A database runtime, SessionBoundDB, binds a token to a session and enforces safe views, row scope, denied fields, operation limits, query budgets, disclosure budgets, and receipts. The database does not rely on an LLM to decide whether a query is safe. The agent may generate SQL freely, but each attempt must stay inside the approved boundary. A PostgreSQL prototype passed a 24-scenario validation suite. Microbenchmarks show p50 SessionBound execution around 1.4--1.5 ms versus raw PostgreSQL p50 around 0.052--0.074 ms on small synthetic queries: high relative overhead, but low absolute latency.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

SessionBound gives a working Postgres prototype that binds signed task tokens to multi-budget session enforcement without LLM query checks, but the static template assumption needs more scrutiny for real tasks.

read the letter

The core idea is straightforward: a control plane approves a task, issues a token with budgets and scopes, and the database runtime enforces those limits on any SQL the agent produces. The agent can generate queries freely as long as they stay inside the pre-approved row predicates, denied fields, operation counts, and disclosure limits. No runtime LLM judgment is involved.

What stands out is the concrete binding mechanism at the database layer itself, combining token validation with view restrictions, numeric budgets, and audit receipts. The PostgreSQL prototype handled a 24-scenario suite and kept absolute latency low (around 1.4 ms p50), which matters for practical deployment even with the relative slowdown.

The main soft spot is the reliance on static task templates defined ahead of time. If a business task has boundaries that depend on runtime data values or conditional aggregates, the fixed predicates and budgets may either over-restrict or leave gaps. The abstract does not show how the validation suite tested such cases, and the evaluation details on coverage or failure modes are thin. Microbenchmarks are reported but without deeper workload characterization.

This is aimed at enterprise teams handling AI agents on internal databases who need auditable, non-LLM enforcement. It deserves peer review because it ships a functional implementation that directly tackles an authorization gap, even if the template completeness claim would benefit from more evidence.

Referee Report

2 major / 1 minor

Summary. The paper presents SessionBound, a system architecture that converts approved enterprise tasks into short-lived, budgeted, and auditable database sessions for AI agents. A control plane handles task templates, applications, approvals, budget assignment, and signed task tokens; the SessionBoundDB runtime binds tokens to sessions and enforces constraints including safe views, row scopes, denied fields, operation limits, query budgets, disclosure budgets, and receipts. The database enforces boundaries without relying on an LLM for safety decisions, allowing agents to generate arbitrary SQL as long as it stays within the pre-approved boundary. A PostgreSQL prototype is reported to have passed a 24-scenario validation suite, with microbenchmarks showing p50 latency of 1.4-1.5 ms versus 0.052-0.074 ms for raw PostgreSQL.

Significance. If the central claims hold, the work provides a practical mechanism for task-based authorization in enterprise database settings that supports AI agents while maintaining auditability and avoiding runtime LLM-based policy decisions. The explicit design separating control-plane approval from runtime enforcement, combined with the prototype implementation and absolute-latency microbenchmarks, demonstrates feasibility for low-overhead enforcement. This could strengthen compliance and security for internal analysis and operational tasks.

major comments (2)

[Abstract and Evaluation] Abstract and Evaluation section: the claim that static task templates plus budgets (row scope, denied fields, query/disclosure budgets) can completely delimit an approved business task without runtime LLM judgment or re-approval is load-bearing, yet the 24-scenario validation suite provides no details on coverage of data-dependent or conditional boundaries (e.g., scopes defined by runtime aggregates or dynamic sensitivity labels). Without such coverage or an explicit scope limitation, the central claim that arbitrary SQL stays inside the boundary cannot be assessed.
[Abstract] Abstract: the statement that the PostgreSQL prototype 'passed a 24-scenario validation suite' is presented without reporting test coverage, failure modes, how budgets were chosen, or the specific constraints exercised; this directly affects soundness assessment of the enforcement claims.

minor comments (1)

[Abstract] The microbenchmark description would benefit from explicit query sizes, table cardinalities, and whether the overhead includes token validation or view materialization.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback highlighting the need for greater transparency in the evaluation claims. We address each major comment below and will revise the manuscript accordingly to improve assessability of the central claims.

read point-by-point responses

Referee: [Abstract and Evaluation] Abstract and Evaluation section: the claim that static task templates plus budgets (row scope, denied fields, query/disclosure budgets) can completely delimit an approved business task without runtime LLM judgment or re-approval is load-bearing, yet the 24-scenario validation suite provides no details on coverage of data-dependent or conditional boundaries (e.g., scopes defined by runtime aggregates or dynamic sensitivity labels). Without such coverage or an explicit scope limitation, the central claim that arbitrary SQL stays inside the boundary cannot be assessed.

Authors: The SessionBound mechanisms are designed exclusively for boundaries that can be expressed statically at approval time via task templates, row scopes, denied fields, and budgets; data-dependent boundaries such as those based on runtime aggregates or dynamic sensitivity labels are outside the supported scope and would require separate runtime policy mechanisms. We will revise the Abstract and Evaluation sections to state this scope limitation explicitly. We will also expand the validation description to enumerate the 24 scenarios and confirm they exercise only the supported static boundary types, allowing the claim to be assessed within its intended limits. revision: yes
Referee: [Abstract] Abstract: the statement that the PostgreSQL prototype 'passed a 24-scenario validation suite' is presented without reporting test coverage, failure modes, how budgets were chosen, or the specific constraints exercised; this directly affects soundness assessment of the enforcement claims.

Authors: We agree that the current presentation lacks necessary detail on the validation suite. In the revision we will update the Abstract with a concise qualifier and add a dedicated Evaluation subsection that reports the scenario selection criteria, parameterization of budgets and scopes, the specific constraints exercised in each scenario, and any failure modes observed during prototype development. The suite is intended as targeted validation of the enforcement primitives rather than exhaustive coverage of all possible SQL. revision: yes

Circularity Check

0 steps flagged

No circularity: system design with no derivations or fitted predictions

full rationale

The paper describes a system architecture (control plane for task templates/approvals/budgets, SessionBoundDB runtime enforcing views/row scopes/budgets) and a PostgreSQL prototype validated on 24 scenarios. No equations, no parameter fitting, no predictions that reduce to inputs, and no self-citation chains appear in the provided text. The central claim is an engineering construction whose correctness is evaluated by prototype behavior rather than by any derivation that could be circular. This matches the default expectation for non-mathematical system papers.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 2 invented entities

The design rests on the domain assumption that task approvals can be translated into static, enforceable templates and budgets that remain valid for the duration of the session; the system introduces new constructs (task tokens, SessionBoundDB) whose correctness depends on correct template definition rather than runtime verification.

axioms (1)

domain assumption Enterprise tasks can be accurately represented by predefined templates with associated budgets and scopes that do not require dynamic LLM-based adjustment.
The control plane and database enforcement both depend on the templates being sufficient; this is stated in the description of how approvals are recorded and tokens are issued.

invented entities (2)

task token no independent evidence
purpose: Signed credential that binds an approved task to a database session and carries the budget and scope limits.
New construct introduced to link the control-plane approval to runtime enforcement.
SessionBoundDB no independent evidence
purpose: Database runtime layer that intercepts queries and enforces the token-defined boundaries without LLM involvement.
Core new component described as the enforcement point.

pith-pipeline@v0.9.1-grok · 5760 in / 1520 out tokens · 62500 ms · 2026-07-02T03:12:43.709220+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

11 extracted references

[1]

Now Publishers, 2014

Cynthia Dwork and Aaron Roth.The Algorith- mic Foundations of Differential Privacy, volume9 ofFoundations and Trends in Theoretical Com- puter Science. Now Publishers, 2014

2014
[2]

D. Hardt. The OAuth 2.0 Authorization Frame- work. RFC 6749, 2012

2012
[3]

Authenticated Delegation and Authorized AI Agents, 2025

Tobin South, Samuele Marro, Thomas Hard- jono, Robert Mahari, Cedric Deslandes Whitney, Dazza Greenwood, Alan Chan, and Alex Pent- land. Authenticated Delegation and Authorized AI Agents, 2025

2025
[4]

Sharma, Linxi Jiang, Zhiqiang Lin, and Shuo Chen

Reshabh K. Sharma, Linxi Jiang, Zhiqiang Lin, and Shuo Chen. PAuth - Precise Task-Scoped Authorization For Agents, 2026

2026
[5]

Marco Tonnarelli, Filippo Scaramuzza, Simon Harrer, and Linus W. Dietz. Data Product MCP: Chat with your Enterprise Data, 2026

2026
[6]

Model Context Protocol.Model Context Protocol Specification, 2025

2025
[7]

Or- acle, 2026

Oracle.What Is Oracle Deep Data Security. Or- acle, 2026

2026
[8]

Oracle, 2026

Oracle.Oracle Deep Data Security Technical Brief. Oracle, 2026

2026
[9]

Korn, Abhishek Parmar, Christina Richards, and Mengzhi Wang

Ruoming Pang, Ramon Caceres, Mike Burrows, Zhifeng Chen, Pratik Dave, Nathan Germer, Alexander Golynski, Kevin Graney, Nina Kang, Lea Kissner, Jeffrey L. Korn, Abhishek Parmar, Christina Richards, and Mengzhi Wang. Zanz- ibar: Google’s Consistent, Global Authorization 6 System. In2019 USENIX Annual Technical Conference (USENIX ATC 19), pages 33–46. USENIX...

2019
[10]

PostgreSQL, 2026

PostgreSQL Global Development Group.Row Security Policies. PostgreSQL, 2026

2026
[11]

N. R. Adam and J. C. Wortmann. Security- Control Methods for Statistical Databases: A Comparative Study.ACM Computing Surveys, 21(4):515–556, 1989. 7

1989

[1] [1]

Now Publishers, 2014

Cynthia Dwork and Aaron Roth.The Algorith- mic Foundations of Differential Privacy, volume9 ofFoundations and Trends in Theoretical Com- puter Science. Now Publishers, 2014

2014

[2] [2]

D. Hardt. The OAuth 2.0 Authorization Frame- work. RFC 6749, 2012

2012

[3] [3]

Authenticated Delegation and Authorized AI Agents, 2025

Tobin South, Samuele Marro, Thomas Hard- jono, Robert Mahari, Cedric Deslandes Whitney, Dazza Greenwood, Alan Chan, and Alex Pent- land. Authenticated Delegation and Authorized AI Agents, 2025

2025

[4] [4]

Sharma, Linxi Jiang, Zhiqiang Lin, and Shuo Chen

Reshabh K. Sharma, Linxi Jiang, Zhiqiang Lin, and Shuo Chen. PAuth - Precise Task-Scoped Authorization For Agents, 2026

2026

[5] [5]

Marco Tonnarelli, Filippo Scaramuzza, Simon Harrer, and Linus W. Dietz. Data Product MCP: Chat with your Enterprise Data, 2026

2026

[6] [6]

Model Context Protocol.Model Context Protocol Specification, 2025

2025

[7] [7]

Or- acle, 2026

Oracle.What Is Oracle Deep Data Security. Or- acle, 2026

2026

[8] [8]

Oracle, 2026

Oracle.Oracle Deep Data Security Technical Brief. Oracle, 2026

2026

[9] [9]

Korn, Abhishek Parmar, Christina Richards, and Mengzhi Wang

Ruoming Pang, Ramon Caceres, Mike Burrows, Zhifeng Chen, Pratik Dave, Nathan Germer, Alexander Golynski, Kevin Graney, Nina Kang, Lea Kissner, Jeffrey L. Korn, Abhishek Parmar, Christina Richards, and Mengzhi Wang. Zanz- ibar: Google’s Consistent, Global Authorization 6 System. In2019 USENIX Annual Technical Conference (USENIX ATC 19), pages 33–46. USENIX...

2019

[10] [10]

PostgreSQL, 2026

PostgreSQL Global Development Group.Row Security Policies. PostgreSQL, 2026

2026

[11] [11]

N. R. Adam and J. C. Wortmann. Security- Control Methods for Statistical Databases: A Comparative Study.ACM Computing Surveys, 21(4):515–556, 1989. 7

1989