Curriculum NLI @ NAACL 2022: Where Models Fail

NLP
NLU
knowledge graphs
NAACL
conference
paper
Curriculum NLI at NAACL 2022 — a sobering look at lexical/logical/commonsense/comprehension fail-modes. Knowledge graphs should help.
Author

synesis

Published

July 13, 2022

Curriculum NLI fail modes. Image: LinkedIn.

A few months ago I posted my summary of this excellent paper, which was presented earlier today in NAACL2022 (my earlier post, Michael Witbrock):

Zeming Chen, and Qiyue Gao. 2022. “Curriculum: A Broad-Coverage Benchmark for Linguistic Phenomena in Natural Language Understanding.” arXiv [cs.CL]. arXiv. http://arxiv.org/abs/2204.06283.

Looking again at the current-gen NLI fail modes evident in the charts, it’s worth singling out those tasks where the learning curves fall much short of expectation:

It’s sobering that the current tech can’t even get lexical phenomena right. This is where KnowledgeGraph can potentially help.

Originally posted on LinkedIn.