Stefan Müller » Publications » An Integrated Archictecture

An Integrated Archictecture for Shallow and Deep Processing

Authors: Berthold Crysmann, Anette Frank, Bernd Kiefer, Stefan Müller, Günter Neumann, Jakub Piskorski, Ulrich Schäfer, Melanie Siegel, Hans Uszkoreit and Feiyu Xu

Subject Areas:

Title: An Integrated Archictecture for Shallow and Deep Processing Systems

In 40th Annual Meeting of the Association for Computational Linguistics. Proceedings of the Conference.

We present a flexible architecture for the integration of shallow and deep NLP components which is aimed at flexible combination of different language technologies for a range of practical current and future applications. In particular, we describe the integration of a high-level HPSG parsing system with different high-performance shallow components, ranging from named entity recognition to chunk parsing and shallow clause recognition. The NLP components enrich a representation of natural language text with layers of new XML meta-information using a single shared data structure, called the text chart. We describe details of the integration methods, and show how information extraction and language checking applications for real-world German text benefit from a deep grammatical analysis.