Aryaman Arora » Acknowledgements
    
    2025
    Language Models May Verbatim Complete Text They Were Not Explicitly Trained On
Ken Ziyu Liu, Christopher A. Choquette-Choo, Matthew Jagielski, Peter Kairouz, Sanmi Koyejo, Percy Liang, Nicolas Papernot
    In-Context Learning Boosts Speech Recognition via Human-like Adaptation to Speakers and Language Varieties
Nathan Roll, Calbert Graham, Yuka Tatsumi, Kim Tien Nguyen, Meghan Sumner, Dan Jurafsky
    Soft production preferences emerge from a bottleneck on memory
Neil Rathi, Richard Futrell, Dan Jurafsky
    2024
    Self-refining diffusion samplers: Enabling parallelization via parareal iterations
        Nikil Roashan Selvam, Amil Merchant, Stefano Ermon
    
    The Semantic Hub Hypothesis: Language models share semantic representations across languages and modalities
        Zhaofeng Wu, Xinyan Velocity Yu, Dani Yogatama, Jiasen Lu, Yoon Kim
    
    
    Attacking vision-language computer agents via pop-ups
Yanzhe Zhang, Tao Yu, Diyi Yang
    Mechanistic?
Naomi Saphra, Sarah Wiegreffe
    From insights to actions: The impact of interpretability and analysis research on NLP
Marius Mosbach, Vagrant Gautam, Tomás Vergara-Browne, Dietrich Klakow, Mor Geva
    Demystifying Verbatim Memorization in Large Language Models
Jing Huang, Diyi Yang, Christopher Potts
    Hydragen: High-Throughput LLM Inference with Shared Prefixes
Jordan Juravsky, Bradley Brown, Ryan Ehrlich, Daniel Y. Fu, Christopher Ré, Azalia Mirhoseini
    Coding reliable LLM-based integrated task and knowledge agents with GenieWorksheets
Harshit Joshi, Shicheng Liu, James Chen, Robert Weigle, Monica S. Lam
    Representation fine-tuning on vision tasks
Zheng Wang
    Sketch2Code: Evaluating vision-language models for interactive web design prototyping
Ryan Li, Yanzhe Zhang, Diyi Yang
    Improving access to untranscribed speech corpora using AI
Nay Myo San
    Language learning meets Generative AI: Utilizing large language models for metalinguistic explanations
Shabnam Behzad
    Can LLMs generate novel research ideas? A large-scale human study with 100+ NLP researchers
Chenglei Si, Diyi Yang, Tatsunori Hashimoto
    Machine unlearning in 2024
Ken Ziyu Liu
    gzip predicts data-dependent scaling laws
Rohan Pandey
    A Universal Dependencies treebank for Gujarati
Mayank Jobanputra, Maitrey Mehta, Çağrı Çöltekin
    Connecting language technologies with rich, diverse data sources covering thousands of languages
Daan van Esch, Sandy Ritchie, Sebastian Ruder, Julia Kreutzer, Clara Rivera, Ishank Saxena, Isaac Caswell
    Improved neural protoform reconstruction via reflex prediction
Liang Lu, Jingzhi Wang, David R. Mortensen
    Design2Code: How far are we from automating front-end engineering?
Chenglei Si, Yanzhe Zhang, Zhengyuan Yang, Ruibo Liu, Diyi Yang
    ColorSwap: A color and word order dataset for multimodal evaluation
Jirayu Burapacheep, Ishan Gaur, Agam Bhatia, Tristan Thrush
    I am a Strange Dataset: Metalinguistic tests for language models
Tristan Thrush, Jared Moore, Miguel Monares, Christopher Potts, Douwe Kiela
    2023
    Human raters cannot distinguish English translations from original English texts
Shira Wein
    Verifying annotation agreement without multiple experts: A case study with Gujarati SNACS
Maitrey Mehta, Vivek Srikumar
    Semantic composition in visually grounded language models
Rohan Pandey
    Syntax-guided neural module distillation to probe compositionality in sentence embeddings
Rohan Pandey
    2022
    Putting context in SNACS: A 5-Way classification of adpositional pragmatic markers
Yang Janet Liu, Jena D. Hwang, Nathan Schneider, Vivek Srikumar
    The Borrowings Kṣuta-/kṣut- (“Inimical”) and Vidumāla- (“Retrograde”) in Sanskrit Astrological Texts and the Representation of Semitic *ʿayn* in Similar Loans
Ola Wikander
    2021
    Mischievous nominal constructions in Universal Dependencies
Nathan Schneider, Amir Zeldes
    Las tecnologías de Reconocimiento Automático de Voz y su incorporación a los métodos de transcripción de lenguas indígenas
Hilaria Cruz
    Results of the Second SIGMORPHON Shared Task on Multilingual Grapheme-to-Phoneme Conversion
Lucas F.E. Ashby, ..., Winnie Yan
    Draw *mir* a Sheep: A supersense-based analysis of German case and adposition semantics
Jakob Prange, Nathan Schneider
    Inverse problems for a class of stochastic ordinary differential equations in a generalized fiducial framework
Samopriya Basu
    2014
    Sustainability strategies in supply chain management
Amit Arora