science
Monte Carlo Semantics: Robust Inference and Logical Pattern Processing with Natural Language Text.   thesis submitted to the University of Cambridge in partial fulfillment of the degree requirements for the Doctor of Philosophy, July 2010.
A Proposal on Evaluation Measures for RTE.   Proceedings of the 2009 Workshop on Applied Textual Inference (TextInfer). August 2009. Singapore.
McPIET at RTE-4: Robust Inference and Logical Pattern Processing Based on Integrated Deep and Shallow Semantics.   Proceedings of the Text Analysis Conference (TAC '08). November 2008. Gaithersburg, MD.
The Monte Carlo Pseudo Inference Engine for Text (McPIET) addresses the RTE problem within a new theoretic framework for robust inference and logical pattern processing based on integrated deep and shallow semantics.
In this report we outline, in some detail, this new theoretic framework, and we will use it to shed some light on the informativity and robustness characteristics for the extreme cases of deep and shallow processing. Unsurprisingly, it will turn out that there is a tradeoff between informativity and robustness.
We will be able to characterize an important new notion of a degree of validity, and provide some evidence to suggest that this concept plays a crucial role in the robustness of shallow inference. At the same time our framework still supports informationally rich semantic representations and background theories, which play the central role in the informativity of deep inference.
Within our new theory we can then pose, from a completely new perspective, the problem of deep/shallow integration, and also propose a solution to it, which we will call Monte Carlo Semantics.
TAC preproceedings paper [PDF]
slides used at TAC [PDF]
Some Notes on the Economics and Evaluation of Automatic Retrieval and Filtering of Communication Goods.   unpublished manuscript, November 2007. • talk presented at the Software Competence Center Hagenberg, Hagenberg, September 2007.
A Comprehensive Bibliography of Linguistic Steganography.   Edward J. Delp and Ping Wah Wong (eds.), Proceedings of the SPIE International Conference on Security, Steganography, and Watermarking of Multimedia Contents, volume 6505, January 2007.
Closed Domain Question Answering Using Fuzzy Semantics.   thesis submitted to the University of Cambridge in partial fulfillment of the degree requirements for the Master of Philosophy, July 2006. • talk at the Cambridge NLIP Group Friday Seminar, Cambridge, November 2006. • talk at the 23rd Chaos Communication Congress, Berlin, December 2006.
(with U. Bodenhofer) Syntax-Driven Analysis of Context-Free Languages with Respect to Fuzzy Relational Semantics.   Proceedings of the 15th IEEE International Conference on Fuzzy Systems, pages 9647-9654, Vancouver, July 2006. best session paper award. • Technical Report 0601, Software Competence Center Hagenberg, July 2006. • Technical Report UCAM-CL-TR-663, University of Cambridge, Computer Laboratory, March 2006.
(with S. Katzenbeisser) Content-Aware Steganography: About Lazy Prisoners and Narrow-Minded Wardens.   Proceedings of the 8th Information Hiding Conference, volume 4437 of the Lecture Notes in Computer Science, Springer Verlag, 2007. • Technical Report FKI-252-05, Technische Universität München, Institut für Informatik AI/Cognition Group, December 2005.
(with S. Katzenbeisser) Towards Human Interactive Proofs in the Text-Domain.   Kan Zhang and Yuliang Zheng (eds.), Proceedings of the 7th Information Security Conference, volume 3225 of the Lecture Notes in Computer Science, pages 257-267, Springer Verlag, September 2004.
Towards Linguistic Steganography: A Systematic Investigation of Approaches, Systems, and Issues.   final year thesis submitted to the University of Derby in partial fulfillment of the degree requirements for the Bachelor of Science, April 2004. • talk at the 21st Chaos Communication Congress, Berlin, December 2004.
Some Experimental Results on Feed-Forward Networks for Text Classification.   coursework submitted to the University of Derby in partial fulfillment of the degree requirements for the Bachelor of Science, May 2004.
Ethical Lessons Learned from Computer Science.   ACM Crossroads, 10(3):23-28, February 2004. • reprinted as ACM Crossroads, 14(4):17-21, June 2008.
A Summary of Traditional Approaches to Natural Language Processing.   Diplomarbeit submitted to the Höhere Technische Bundeslehranstalt Leonding in partial fulfillment of the degree requirements for the Diplom HTL Ingenieur, May 2003.