science
Monte Carlo Semantics: Robust Inference and Logical Pattern Processing with Natural Language Text.   thesis submitted to the University of Cambridge in partial fulfillment of the degree requirements for the Doctor of Philosophy, July 2010.
A Proposal on Evaluation Measures for RTE.   Proceedings of the 2009 Workshop on Applied Textual Inference (TextInfer). August 2009. Singapore.
McPIET at RTE-4: Robust Inference and Logical Pattern Processing Based on Integrated Deep and Shallow Semantics.   Proceedings of the Text Analysis Conference (TAC '08). November 2008. Gaithersburg, MD.
Some Notes on the Economics and Evaluation of Automatic Retrieval and Filtering of Communication Goods.   unpublished manuscript, November 2007. • talk presented at the Software Competence Center Hagenberg, Hagenberg, September 2007.
A Comprehensive Bibliography of Linguistic Steganography.   Edward J. Delp and Ping Wah Wong (eds.), Proceedings of the SPIE International Conference on Security, Steganography, and Watermarking of Multimedia Contents, volume 6505, January 2007.
Closed Domain Question Answering Using Fuzzy Semantics.   thesis submitted to the University of Cambridge in partial fulfillment of the degree requirements for the Master of Philosophy, July 2006. • talk at the Cambridge NLIP Group Friday Seminar, Cambridge, November 2006. • talk at the 23rd Chaos Communication Congress, Berlin, December 2006.
In the present report we give a thorough exposition of our first steps towards a theory of fuzzy semantics and towards the development of a closed domain question answering system in the form of a natural language database interface that produces result sets with records ranked according to the degree to which they fulfill our intuitions about vague expressions in natural language, and vague adjectives in particular.
We outline our ordering-based approach to semantics and introduce some of the issues involved in modelling vagueness. We show how fuzzy sets can be used as intermediate semantic representations of vague expressions. From the family of possible fuzzy logics we pin down which one best fits our modelling needs in fuzzy semantics.
We then describe the overall design of a controlled experiment involving human subjects, the software infrastructure necessary for administrating it and the statistical analyses required to draw conclusions from the data about the adequacy of our model of fuzzy semantics. We will also discuss the results from a small-scale preliminary instantiation of this experiment.
Finally we show how we could put those theoretic insights to use in a working natural language interface to a database that produces rankings of "small cities" or "rainy cities near San Francisco".
official report [PDF]
director's cut [PDF]
slides used at the viva [PDF]
slides used at the seminar [PDF]
slides used at 23C3 [PDF]
video of the presentation at 23C3 [M4V]
audio of the presentation at 23C3 [MP3]
(with U. Bodenhofer) Syntax-Driven Analysis of Context-Free Languages with Respect to Fuzzy Relational Semantics.   Proceedings of the 15th IEEE International Conference on Fuzzy Systems, pages 9647-9654, Vancouver, July 2006. best session paper award. • Technical Report 0601, Software Competence Center Hagenberg, July 2006. • Technical Report UCAM-CL-TR-663, University of Cambridge, Computer Laboratory, March 2006.
(with S. Katzenbeisser) Content-Aware Steganography: About Lazy Prisoners and Narrow-Minded Wardens.   Proceedings of the 8th Information Hiding Conference, volume 4437 of the Lecture Notes in Computer Science, Springer Verlag, 2007. • Technical Report FKI-252-05, Technische Universität München, Institut für Informatik AI/Cognition Group, December 2005.
(with S. Katzenbeisser) Towards Human Interactive Proofs in the Text-Domain.   Kan Zhang and Yuliang Zheng (eds.), Proceedings of the 7th Information Security Conference, volume 3225 of the Lecture Notes in Computer Science, pages 257-267, Springer Verlag, September 2004.
Towards Linguistic Steganography: A Systematic Investigation of Approaches, Systems, and Issues.   final year thesis submitted to the University of Derby in partial fulfillment of the degree requirements for the Bachelor of Science, April 2004. • talk at the 21st Chaos Communication Congress, Berlin, December 2004.
Some Experimental Results on Feed-Forward Networks for Text Classification.   coursework submitted to the University of Derby in partial fulfillment of the degree requirements for the Bachelor of Science, May 2004.
Ethical Lessons Learned from Computer Science.   ACM Crossroads, 10(3):23-28, February 2004. • reprinted as ACM Crossroads, 14(4):17-21, June 2008.
A Summary of Traditional Approaches to Natural Language Processing.   Diplomarbeit submitted to the Höhere Technische Bundeslehranstalt Leonding in partial fulfillment of the degree requirements for the Diplom HTL Ingenieur, May 2003.