Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks

Karimi Mahabadi, Rabeeh, Ruder, Sebastian, Dehghani, Mostafa, Henderson, James · 2021 · DOI 10.18653/v1/2021.acl-long.47

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

Program-as-Weights: A Programming Paradigm for Fuzzy Functions

cs.LG · 2026-07-02 · conditional · novelty 6.0

A 4B compiler model generates LoRA adapters from natural-language specs, enabling a frozen 0.6B interpreter to match Qwen3-32B performance on fuzzy text tasks at 50× less memory.

SCOPE: Structured Prototype-Guided Adaptation for EEG Foundation Models with Limited Labels

cs.LG · 2026-02-19 · unverdicted · novelty 6.0

SCOPE uses cohort-level external supervision, confidence-aware pseudo-labels, and a lightweight prototype-conditioned adapter (ProAdapter) to adapt frozen EEG foundation models in label-limited settings, reporting consistent gains across 50 experimental configurations.

Enhancing Chat Language Models by Scaling High-quality Instructional Conversations

cs.CL · 2023-05-23 · conditional · novelty 6.0

UltraChat supplies 1.5 million high-quality multi-turn dialogues that, when used to fine-tune LLaMA, produce UltraLLaMA, which outperforms prior open-source chat models including Vicuna.

citing papers explorer

Showing 3 of 3 citing papers.

Program-as-Weights: A Programming Paradigm for Fuzzy Functions cs.LG · 2026-07-02 · conditional · none · ref 19
A 4B compiler model generates LoRA adapters from natural-language specs, enabling a frozen 0.6B interpreter to match Qwen3-32B performance on fuzzy text tasks at 50× less memory.
SCOPE: Structured Prototype-Guided Adaptation for EEG Foundation Models with Limited Labels cs.LG · 2026-02-19 · unverdicted · none · ref 10
SCOPE uses cohort-level external supervision, confidence-aware pseudo-labels, and a lightweight prototype-conditioned adapter (ProAdapter) to adapt frozen EEG foundation models in label-limited settings, reporting consistent gains across 50 experimental configurations.
Enhancing Chat Language Models by Scaling High-quality Instructional Conversations cs.CL · 2023-05-23 · conditional · none · ref 188
UltraChat supplies 1.5 million high-quality multi-turn dialogues that, when used to fine-tune LLaMA, produce UltraLLaMA, which outperforms prior open-source chat models including Vicuna.

Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks

fields

years

verdicts

representative citing papers

citing papers explorer