Department of Computational Linguistics at the University of Erlangen
Natural Language Engineering
The process of acquisition, conversion, and processing of linguistic on-line
resources, such as corpora and electronic dictionaries using methods like
probabilistic tagging, parsing, and various kinds of heuristics with the help
of programs like Emacs, Unix text processing tools (lex, yacc, egrep, ...).
Such engineering work is inevitable to make linguistics a proper
science on a sound empirical basis.
Natural language engineering has to face questions about the actual representation of text (Unicode, SGML, databases) that are not of linguistic interest, but nevertheless matter very much in practice.
Important areas include tagset organization, encoding standards, robust parsing of real text, semi-automatic lexicon acquisition, machine-aided translation, multilingual corpora, and automatic extraction of special-domain terminology and collocations from corpora.
Links
Linguistic Research and Engineering 1/2
Linguistic Research and Engineering 2/2
Language Engineering 1
Language Engineering 2
Language Engineering 3
Jochen Leidner, 1998-04-29