publications

  1. arXiv
    SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
    John Yang, Carlos E. Jimenez, Alex L. Zhang, Kilian Lieret, Joyce Yang, Xindi Wu, Ori Press, Niklas Muennighoff, Gabriel Synnaeve, Karthik R. Narasimhan, Diyi Yang, Sida I. Wang, Ofir Press
    In 2024
  2. NeurIPS
    SWE-agent: Agent-computer interfaces enable automated software engineering
    In NeurIPS 2024
  3. ICLR (oral)
    SWE-bench: Can Language Models Resolve Real-world Github Issues?
    In ICLR 2024
  4. EMNLP
    C-STS: Conditional Semantic Textual Similarity
    Ameet Deshpande, Carlos E. Jimenez, Howard Chen, Vishvak Murahari, Victoria Graf, Tanmay Rajpurohit, Ashwin Kalyan, Danqi Chen, Karthik Narasimhan
    In EMNLP 2023
  5. EMNLP Findings
    MUX-PLMs: Data Multiplexing for High-throughput Language Models
    Vishvak Murahari, Ameet Deshpande, Carlos E. Jimenez, Izhak Shafran, Mingqiu Wang, Yuan Cao, Karthik Narasimhan
    In EMNLP Findings 2023
  6. NeurIPS
    DataMUX: Data Multiplexing for Neural Networks
    Vishvak Murahari, Carlos E. Jimenez, Runzhe Yang, Karthik Narasimhan
    In NeurIPS 2022
  7. ACL
    CARETS: A Consistency And Robustness Evaluative Test Suite for VQA
    Carlos E. Jimenez, Olga Russakovsky, Karthik Narasimhan
    In ACL, 2022