Florian Metze
Identifiers
- name variant Florian Metze 0.60 · backfill
Papers (79)
- Beyond Words: Towards Effective Modeling of Non-Verbal Vocalizations in ASR eess.AS · 2026 · author #10
- Enhancing Conversational TTS with Cascaded Prompting and ICL-Based Online Reinforcement Learning eess.AS · 2026 · author #7
- Error-aware Quantization through Noise Tempering cs.LG · 2022 · author #4
- Normalized Contrastive Learning for Text-Video Retrieval cs.IR · 2022 · author #5
- Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models cs.CL · 2022 · author #4
- SQuAT: Sharpness- and Quantization-Aware Training for BERT cs.LG · 2022 · author #4
- CTC Alignments Improve Autoregressive Translation cs.CL · 2022 · author #5
- ASR2K: Speech Recognition for Around 2000 Languages without Audio cs.CL · 2022 · author #2
- Masked Autoencoders that Listen cs.SD · 2022 · author #7
- LegoNN: Building Modular Encoder-Decoder Models cs.CL · 2022 · author #6
- On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization cs.CL · 2022 · author #4
- Robustness of Neural Architectures for Audio Event Detection cs.SD · 2022 · author #4
- AudioTagging Done Right: 2nd comparison of deep learning methods for environmental sound classification cs.SD · 2022 · author #4
- On Adversarial Robustness of Large-scale Audio Visual Learning cs.SD · 2022 · author #5
- Speech Summarization using Restricted Self-Attention cs.CL · 2021 · author #4
- VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding cs.CV · 2021 · author #6
- Differentiable Allophone Graphs for Language-Universal Speech Recognition cs.CL · 2021 · author #4
- Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding cs.CL · 2021 · author #5
- Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers cs.CV · 2021 · author #5
- VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding cs.CV · 2021 · author #7
- Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks cs.CL · 2021 · author #4
- Self-supervised object detection from audio-visual correspondence cs.CV · 2021 · author #5
- Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning cs.CV · 2021 · author #5
- Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models cs.CV · 2021 · author #5
- NoiseQA: Challenge Set Evaluation for User-Centric Question Answering cs.CL · 2021 · author #4
- Audio-Visual Event Recognition through the lens of Adversary cs.CV · 2020 · author #5
- Multimodal Speech Recognition with Unstructured Audio Masking cs.CL · 2020 · author #3
- On Long-Tailed Phenomena in Neural Machine Translation cs.CL · 2020 · author #4
- Support-set bottlenecks for video-text representation learning cs.CV · 2020 · author #4
- Fine-Grained Grounding for Multimodal Speech Recognition cs.CL · 2020 · author #3
- Revisiting Factorizing Aggregated Posterior in Learning Disentangled Representations stat.ML · 2020 · author #7
- How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language cs.CV · 2020 · author #6
- Contextual RNN-T For Open Domain ASR eess.AS · 2020 · author #5
- AlloVera: A Multilingual Allophone Database cs.CL · 2020 · author #8
- ASR Error Correction and Domain Adaptation Using Machine Translation eess.AS · 2020 · author #5
- Universal Phone Recognition with a Multilingual Allophone System cs.CL · 2020 · author #11
- Towards Zero-shot Learning for Automatic Phonemic Transcription cs.CL · 2020 · author #6
- Looking Enhances Listening: Recovering Missing Speech Using Images cs.CL · 2020 · author #3
- Gun Source and Muzzle Head Detection cs.CV · 2020 · author #3
- Enforcing Encoder-Decoder Modularity in Sequence-to-Sequence Models cs.CL · 2019 · author #4
- On Compositionality in Neural Machine Translation cs.CL · 2019 · author #3
- Adversarial Music: Real World Audio Adversary Against Wake-word Detection System cs.CR · 2019 · author #6
- Multitask Learning For Different Subword Segmentations In Neural Machine Translation cs.CL · 2019 · author #3
- On Leveraging the Visual Modality for Neural Machine Translation cs.CL · 2019 · author #5
- On Dimensional Linguistic Properties of the Word Embedding Space cs.CL · 2019 · author #4
- SANTLR: Speech Annotation Toolkit for Low Resource Languages cs.CL · 2019 · author #5
- Multilingual Speech Recognition with Corpus Relatedness Sampling cs.CL · 2019 · author #4
- Cross-Attention End-to-End ASR for Two-Party Conversations eess.AS · 2019 · author #3
- Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions cs.CL · 2019 · author #3
- Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion cs.CL · 2019 · author #3
- Multimodal Abstractive Summarization for How2 Videos cs.CL · 2019 · author #4
- Grounding Object Detections With Transcriptions cs.MM · 2019 · author #3
- Acoustic-to-Word Models with Conversational Context Information cs.CL · 2019 · author #2
- The ARIEL-CMU Systems for LoReHLT18 cs.CL · 2019 · author #18
- Phoneme Level Language Models for Sequence Based Low Resource ASR cs.CL · 2019 · author #4
- Learned In Speech Recognition: Contextual Acoustic Word Embeddings cs.CL · 2019 · author #3
- Learning from Multiview Correlations in Open-Domain Videos cs.LG · 2018 · author #4
- Multimodal Grounding for Sequence-to-Sequence Speech Recognition cs.CL · 2018 · author #5
- How2: A Large-scale Dataset for Multimodal Language Understanding cs.CL · 2018 · author #7
- Connectionist Temporal Localization for Sound Event Detection with Sequential Labeling cs.SD · 2018 · author #2
- A Comparison of Five Multiple Instance Learning Pooling Functions for Sound Event Detection with Weak Labeling cs.SD · 2018 · author #3
- Activity Recognition on a Large Scale in Short Videos - Moments in Time Dataset cs.CV · 2018 · author #6
- Dialog-context aware end-to-end speech recognition cs.CL · 2018 · author #2
- Domain Robust Feature Extraction for Rapid Low Resource ASR Development cs.CL · 2018 · author #3
- Acoustic-to-Word Recognition with Sequence-to-Sequence Models eess.AS · 2018 · author #2
- Hierarchical Multi Task Learning With CTC cs.CL · 2018 · author #2
- End-to-End Multimodal Speech Recognition eess.AS · 2018 · author #3
- Comparing the Max and Noisy-Or Pooling Functions in Multiple Instance Learning for Weakly Supervised Sequence Learning Tasks cs.SD · 2018 · author #3
- Sequence-based Multi-lingual Low Resource Speech Recognition cs.CL · 2018 · author #3
- Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop cs.CL · 2018 · author #5
- A Light-Weight Multimodal Framework for Improved Environmental Audio Tagging cs.SD · 2017 · author #4
- Multiple Instance Deep Learning for Weakly Supervised Small-Footprint Audio Event Detection cs.SD · 2017 · author #5
- Subword and Crossword Units for CTC Acoustic Models cs.CL · 2017 · author #3
- Visual Features for Context-Aware Speech Recognition cs.CL · 2017 · author #4
- Annotating High-Level Structures of Short Stories and Personal Anecdotes cs.CL · 2017 · author #4
- Comparison of Decoding Strategies for CTC Acoustic Models cs.CL · 2017 · author #3
- A Comparison of deep learning methods for environmental sound cs.SD · 2017 · author #3
- Robust end-to-end deep audiovisual speech recognition cs.CL · 2016 · author #2
- EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding cs.CL · 2015 · author #3
Mentions
- 2206.03318 #6 · arxiv_oai · confidence 0.70 Florian Metze
- 2207.06405 #7 · arxiv_oai · confidence 0.70 Florian Metze
- 2212.11790 #5 · arxiv_oai · confidence 0.70 Florian Metze
- 2212.05603 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 2210.15734 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 2205.11686 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 2210.07171 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 2210.05200 #5 · arxiv_oai · confidence 0.70 Florian Metze
- 2209.02842 #2 · arxiv_oai · confidence 0.70 Florian Metze
- 2203.13448 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 2205.03268 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 2104.06401 #5 · arxiv_oai · confidence 0.70 Florian Metze
- 2203.12122 #5 · arxiv_oai · confidence 0.70 Florian Metze
- 2110.06263 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 2103.10211 #5 · arxiv_oai · confidence 0.70 Florian Metze
- 2106.05392 #5 · arxiv_oai · confidence 0.70 Florian Metze
- 2109.14084 #6 · arxiv_oai · confidence 0.70 Florian Metze
- 2105.09996 #7 · arxiv_oai · confidence 0.70 Florian Metze
- 2107.11628 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 2106.15065 #5 · arxiv_oai · confidence 0.70 Florian Metze
- 2001.11120 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 2009.05739 #7 · arxiv_oai · confidence 0.70 Florian Metze
- 2105.00573 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 2103.08849 #5 · arxiv_oai · confidence 0.70 Florian Metze
- 2008.08143 #6 · arxiv_oai · confidence 0.70 Florian Metze
- 2102.08345 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 2010.02824 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 2011.07430 #5 · arxiv_oai · confidence 0.70 Florian Metze
- 2010.08642 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 2010.04924 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 2010.02384 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 2006.03411 #5 · arxiv_oai · confidence 0.70 Florian Metze
- 1910.02211 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 2004.08031 #8 · arxiv_oai · confidence 0.70 Florian Metze
- 2003.07692 #5 · arxiv_oai · confidence 0.70 Florian Metze
- 2002.11800 #11 · arxiv_oai · confidence 0.70 Florian Metze
- 2002.11781 #6 · arxiv_oai · confidence 0.70 Florian Metze
- 2002.05639 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 1907.00477 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 1911.01497 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 1911.00126 #6 · arxiv_oai · confidence 0.70 Florian Metze
- 1911.03782 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 1910.12368 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 1910.02754 #5 · arxiv_oai · confidence 0.70 Florian Metze
- 1908.01067 #5 · arxiv_oai · confidence 0.70 Florian Metze
- 1908.01060 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 1906.06147 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 1907.10726 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 1906.11604 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 1906.07901 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 1905.08796 #2 · arxiv_oai · confidence 0.70 Florian Metze
- 1811.08890 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 1902.08899 #18 · arxiv_oai · confidence 0.70 Florian Metze
- 1902.07613 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 1811.03865 #5 · arxiv_oai · confidence 0.70 Florian Metze
- 1902.06833 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 1810.09052 #2 · arxiv_oai · confidence 0.70 Florian Metze
- 1810.09050 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 1807.07104 #2 · arxiv_oai · confidence 0.70 Florian Metze
- 1811.00347 #7 · arxiv_oai · confidence 0.70 Florian Metze
- 1807.10984 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 1809.00241 #6 · arxiv_oai · confidence 0.70 Florian Metze
- 1807.09597 #2 · arxiv_oai · confidence 0.70 Florian Metze
- 1808.02171 #2 · arxiv_oai · confidence 0.70 Florian Metze
- 1712.06855 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 1804.09713 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 1804.01146 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 1712.09673 #5 · arxiv_oai · confidence 0.70 Florian Metze
- 1802.07420 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 1712.09680 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 1710.06917 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 1802.05092 #5 · arxiv_oai · confidence 0.70 Florian Metze
- 1712.00489 #4 · arxiv_oai · confidence 0.70 Florian Metze
- 1708.04469 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 1703.06902 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 1611.06986 #2 · arxiv_oai · confidence 0.70 Florian Metze
- 1507.08240 #3 · arxiv_oai · confidence 0.70 Florian Metze
- 2607.01563 #10 · arxiv_oai · confidence 0.70 Florian Metze
- 1507.08240 #3 · backfill · confidence 0.70 Florian Metze
Frequent Coauthors
- Siddharth Dalmia 19 shared papers
- Ramon Sanabria 14 shared papers
- Shruti Palaskar 12 shared papers
- Xinjian Li 12 shared papers
- Alan W Black 11 shared papers
- Juncheng Li 9 shared papers
- Po-Yao Huang 8 shared papers
- Shuhui Qu 8 shared papers
- Shinji Watanabe 7 shared papers
- Graham Neubig 6 shared papers
- Juncheng B Li 6 shared papers
- Vikas Raunak 6 shared papers
- Alan W. Black 5 shared papers
- David R. Mortensen 5 shared papers
- Tejas Srinivasan 5 shared papers
- Yun Wang 5 shared papers
- Alexander Hauptmann 4 shared papers
- Andrea Vedaldi 4 shared papers
- Brian Yan 4 shared papers
- Christoph Feichtenhofer 4 shared papers