Richard Bergmair's Publikationen

Bergmair, Richard. 2009. “A Proposal on Evaluation Measures for RTE.” In Proceedings of the 2009 Workshop on Applied Textual Inference (TextInfer), 10–17. Suntec, Singapore: Association for Computational Linguistics.

We outline problems with the interpretation of accuracy in the presence of bias, arguing that the issue is a particularly pressing concern for RTE evaluation. Furthermore, we argue that average precision scores are unsuitable for RTE, and should not be reported.

We advocate mutual information as a new evaluation measure that should be reported in addition to accuracy and confidence-weighted score.

==> wie für Konferenz [PDF]
==> Folien Konferenz [PDF]
==> vom Konferenz-Veranstalter
==> auf ResearchGate
==> auf archive.org