pith. sign in

hub Canonical reference

Agent skills: A data-driven analysis of claude skills for extending large language model functionality

Canonical reference. 80% of citing Pith papers cite this work as background.

28 Pith papers citing it
Background 80% of classified citations

hub tools

citation-role summary

background 5

citation-polarity summary

years

2026 28

roles

background 5

polarities

background 4 unclear 1

representative citing papers

FermiLink: A Unified Agent Framework for Multidomain Autonomous Scientific Simulations

physics.chem-ph · 2026-04-03 · conditional · novelty 8.0

FermiLink is a unified AI agent framework that automates multidomain scientific simulations via separated package knowledge bases and a four-layer progressive disclosure mechanism, reproducing 56% of target figures in benchmarks and generating research-grade results on unpublished problems.

Skill Coverage: A Test Adequacy Metric for Agent Skills

cs.AI · 2026-06-09 · unverdicted · novelty 6.0

Skill coverage is a binary test adequacy metric that extracts observable behavior constraints from skill documents and judges whether trajectories provide sufficient evidence to cover each constraint, revealing 39.90-43.98% coverage on SkillsBench.

FederatedSkill: Federated Learning for Agentic Skill Evolution

cs.LG · 2026-06-02 · unverdicted · novelty 6.0

FederatedSkill aggregates client semantic skill diffs via a server evolution agent to enable strictly personalized skill evolution, reporting up to 44.4% higher success rates and 37.5% lower compute cost than self-evolving baselines across 20 task families.

SkillGuard: A Permission Framework for Agent Skills

cs.CR · 2026-06-02 · unverdicted · novelty 6.0

SkillGuard presents a dual-plane permission framework for agent skills that achieves 99.76% taxonomy coverage and reduces attack success rates in evaluations on 315 skills.

CODESKILL: Learning Self-Evolving Skills for Coding Agents

cs.AI · 2026-05-25 · unverdicted · novelty 6.0

CODESKILL trains an LLM policy via RL on hybrid rewards to extract and maintain multi-granularity skills from agent trajectories, raising pass rates 9.69 points over no-skill baselines on three coding benchmarks while keeping the skill bank compact.

Skill Retrieval Augmentation for Agentic AI

cs.CL · 2026-04-27 · unverdicted · novelty 6.0 · 3 refs

Introduces SRA paradigm and SRA-Bench benchmark (5,400 tasks, 26,262 skills) showing retrieval improves performance but LLMs fail to selectively incorporate retrieved skills.

Unsupervised Skill Discovery for Agentic Data Analysis

cs.AI · 2026-06-04 · unverdicted · novelty 5.0

DataCOPE uses verifier-guided contrastive distillation from agent trajectories to discover skills, yielding average gains of 9.71% on report-style and 32.30% on reasoning-style data analysis tasks across four model settings.

citing papers explorer

Showing 28 of 28 citing papers.