Mostafa Dehghani
Identifiers
- name variant Mostafa Dehghani 0.60 · backfill
Papers (58)
- Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities cs.CL · 2025 · author #1873
- Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context cs.CL · 2024 · author #557
- Gemini: A Family of Highly Capable Multimodal Models cs.CL · 2023 · author #483
- Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution cs.CV · 2023 · author #1
- PaLI-X: On Scaling up a Multilingual Vision and Language Model cs.CV · 2023 · author #12
- PaLM 2 Technical Report cs.CL · 2023 · author #40
- End-to-End Spatio-Temporal Action Localisation with Video Transformers cs.CV · 2023 · author #4
- Scaling Vision Transformers to 22 Billion Parameters cs.CV · 2023 · author #1
- Dual PatchNorm cs.CV · 2023 · author #2
- Adaptive Computation with Elastic Input Sequence cs.LG · 2023 · author #5
- Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints cs.LG · 2022 · author #8
- Scaling Instruction-Finetuned Language Models cs.LG · 2022 · author #9
- Transcending Scaling Laws with 0.1% Extra Compute cs.CL · 2022 · author #16
- $\Lambda$-DARTS: Mitigating Performance Collapse by Harmonizing Operation Selection among Cells cs.LG · 2022 · author #5
- Intersection of Parallels as an Early Stopping Criterion cs.LG · 2022 · author #3
- Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling? cs.LG · 2022 · author #2
- Confident Adaptive Language Modeling cs.CL · 2022 · author #4
- Beyond Transfer Learning: Co-finetuning for Action Localisation cs.CV · 2022 · author #6
- Simple Open-Vocabulary Object Detection with Vision Transformers cs.CV · 2022 · author #9
- UL2: Unifying Language Learning Paradigms cs.CL · 2022 · author #2
- Retrieval-Enhanced Machine Learning cs.LG · 2022 · author #3
- Transformer Memory as a Differentiable Search Index cs.CL · 2022 · author #3
- VUT: Versatile UI Transformer for Multi-Modal Multi-Task User Interface Modeling cs.CV · 2021 · author #4
- PolyViT: Co-training Vision Transformers on Images, Videos and Audio cs.CV · 2021 · author #7
- Discrete Representations Strengthen Vision Transformer Robustness cs.CV · 2021 · author #3
- The Efficiency Misnomer cs.LG · 2021 · author #1
- SCENIC: A JAX Library for Computer Vision Research and Beyond cs.CV · 2021 · author #1
- Exploring the Limits of Large Scale Pre-training cs.LG · 2021 · author #2
- Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers cs.CL · 2021 · author #2
- The Benchmark Lottery cs.LG · 2021 · author #1
- TokenLearner: What Can 8 Learned Tokens Do for Images and Videos? cs.CV · 2021 · author #4
- Gradual Domain Adaptation in the Wild:When Intermediate Distributions are Absent cs.LG · 2021 · author #4
- Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks cs.CL · 2021 · author #3
- Are Pre-trained Convolutions Better than Pre-trained Transformers? cs.CL · 2021 · author #2
- ViViT: A Video Vision Transformer cs.CV · 2021 · author #2
- OmniNet: Omnidirectional Representations from Transformers cs.CV · 2021 · author #2
- Long Range Arena: A Benchmark for Efficient Transformers cs.LG · 2020 · author #2
- An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale cs.CV · 2020 · author #7
- Efficient Transformers: A Survey cs.LG · 2020 · author #2
- IDF++: Analyzing and Improving Integer Discrete Flows for Lossless Compression cs.LG · 2020 · author #3
- Transferring Inductive Biases through Knowledge Distillation cs.LG · 2020 · author #2
- MetNet: A Neural Weather Model for Precipitation Forecasting cs.LG · 2020 · author #4
- HiTR: Hierarchical Topic Model Re-estimation for Measuring Topical Diversity of Documents cs.CL · 2018 · author #2
- Universal Transformers cs.CL · 2018 · author #1
- Learning to Rank from Samples of Variable Quality cs.IR · 2018 · author #1
- Neural Networks for Information Retrieval cs.IR · 2018 · author #4
- Learning to Learn from Weak Supervision by Full Supervision stat.ML · 2017 · author #1
- Words are Malleable: Computing Semantic Shifts in Political and Media Discourse cs.CL · 2017 · author #2
- Fidelity-Weighted Learning cs.LG · 2017 · author #1
- Avoiding Your Teacher's Mistakes: Training Neural Networks with Controlled Weak Supervision cs.LG · 2017 · author #1
- On Search Powered Navigation cs.IR · 2017 · author #1
- Learning to Attend, Copy, and Generate for Session-Based Query Suggestion cs.IR · 2017 · author #1
- Share your Model instead of your Data: Privacy Preserving Mimic Learning for Ranking cs.IR · 2017 · author #1
- Neural Networks for Information Retrieval cs.IR · 2017 · author #4
- Neural Ranking Models with Weak Supervision cs.IR · 2017 · author #1
- Hierarchical Re-estimation of Topic Models for Measuring Topical Diversity cs.IR · 2017 · author #2
- On Horizontal and Vertical Separation in Hierarchical Text Classification cs.IR · 2016 · author #1
- Generalized Group Profiling for Content Customization cs.IR · 2016 · author #1
Mentions
- 2305.10403 #40 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2307.06304 #1 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2301.13195 #5 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2302.01327 #2 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2304.12160 #4 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2210.07998 #5 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2205.05131 #2 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2212.05055 #8 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2302.05442 #1 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2210.11416 #9 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2210.11399 #16 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2207.07061 #4 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2202.06991 #3 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2208.09529 #3 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2207.10551 #2 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2205.06230 #9 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2207.03807 #6 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2205.01230 #3 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2106.11297 #4 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2111.10493 #3 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2110.12894 #1 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2009.06732 #2 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2109.10686 #2 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2105.03322 #2 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2112.05692 #4 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2111.12993 #7 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2103.15691 #2 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2110.11403 #1 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2110.02095 #2 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2107.07002 #1 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2106.06080 #4 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2106.04489 #3 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2010.11929 #7 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2006.12459 #3 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2103.01075 #2 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2011.04006 #2 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2006.00555 #2 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2003.12140 #4 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 1807.03819 #1 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 1810.05436 #2 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 1806.08694 #1 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 1711.02799 #1 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 1801.02178 #4 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 1711.00313 #1 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 1711.11383 #1 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 1711.05603 #2 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 1708.03418 #1 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 1711.00310 #1 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 1707.07605 #1 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 1707.04242 #4 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 1704.08803 #1 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 1701.04273 #2 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 1609.00514 #1 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 1609.00511 #1 · arxiv_oai · confidence 0.70 Mostafa Dehghani
- 2305.18565 #12 · arxiv_oai · confidence 0.70 Mostafa Dehghani
Frequent Coauthors
- Yi Tay 20 shared papers
- Neil Houlsby 13 shared papers
- Anurag Arnab 12 shared papers
- Donald Metzler 12 shared papers
- Jaap Kamps 12 shared papers
- Dara Bahri 8 shared papers
- Josip Djolonga 8 shared papers
- Alexey Gritsenko 7 shared papers
- Basil Mustafa 7 shared papers
- Denny Zhou 7 shared papers
- Hosein Azarbonyad 7 shared papers
- Mario Lu\v{c}i\'c 7 shared papers
- Siamak Shakeri 7 shared papers
- Aliaksei Severyn 6 shared papers
- Maarten de Rijke 6 shared papers
- Maarten Marx 6 shared papers
- Matthias Minderer 6 shared papers
- Samira Abnar 6 shared papers
- Slav Petrov 6 shared papers
- Xavier Garcia 6 shared papers