article thumbnail

Common Flaws in NLP Evaluation Experiments

Ehud Reiter

However I think journals such as Computational Linguistics and TACL could adjust reviewing procedures to check some of above. Carlisle (2020) analysed papers submitted to a medical journal in order to identify worthless “zombie” papers. When Computational Linguistics. DOI 10.1162/coli_a_00508

NLP 259
article thumbnail

What are they thinking?

Allen AI

2020) and Macaw (Tafjord and Clark, 2021), our results show that mental models derived using these LMs’ predictions are significantly inconsistent, with 19–43% conditional violation. 2020) and CSQA (Talmor et al., Association for Computational Linguistics. Association for Computational Linguistics.

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

ClarifyDelphi

Allen AI

2020; Awad et al., 2020) and enriching it with questions obtained from GPT3. 42 (2020): 26158–26169. In Findings of the Association for Computational Linguistics: EMNLP 2020 , pp. 2022; Levine et al., SocialChem, Rudinger et al. Proceedings of the National Academy of Sciences 117, no. Smith, and Yejin Choi.

article thumbnail

SQuARE: Towards Multi-Domain and Few-Shot Collaborating Question Answering Agents

ODSC - Open Data Science

Examples are the ACL fellow award 2020 and the first Hessian LOEWE Distinguished Chair award (2,5 mil. She is currently the president of the Association of Computational Linguistics. Iryna’s work has received numerous awards. Euro) in 2021.

article thumbnail

AI2 at ACL 2023

Allen AI

The *CL conferences created the NLP Reproducibility Checklist in 2020 to be completed by authors at submission to remind them of key information to include. Magnusson*, Noah A. Smith*, Jesse Dodge* Scientific progress in NLP rests on the reproducibility of researchers’ claims.

article thumbnail

Testing the Robustness of LSTM-Based Sentiment Analysis Models

John Snow Labs

The 49th Annual Meeting of the Association for Computational Linguistics (ACL 2011). References Langtest GitHub Repository Large Movie Review Dataset v1.0 Gopalakrishnan, K., & Salem, F. Sentiment Analysis Using Simplified Long Short-term Memory Recurrent Neural Networks. abs/2005.03993 Andrew L. Maas, Raymond E. Daly, Peter T.

article thumbnail

Selective Classification Can Magnify Disparities Across Groups

The Stanford AI Lab Blog

In International Conference on Learning Representations (ICLR), 2020. ↩ ↩ 2 ↩ 3 Jeremy Irvin, Pranav Rajpurkar, Michael Ko, Yifan Yu, Silviana Ciurea-Ilcus, Chris Chute, Henrik Marklund, Behzad Haghgoo, Robyn Ball, Katie Shpanskaya, et al. In Association for Computational Linguistics (ACL), pp.