Applications of parallel corpora to the (平行语料库的应用程序).pdf
文本预览下载声明
Applications of parallel corpora to the development of monolingual language
technologies
Kevin P. Scannell
Department of Mathematics and Computer Science
Saint Louis University
Saint Louis, Missouri, USA
Abstract In 2 we describe the development and contents of
an aligned parallel corpus of English and Irish texts,
We describe the development of an
and in 3 we discuss an application of (a subset of)
aligned parallel corpus of English and
the corpus; namely, a program to standardize Irish
Irish texts, along with a simple application
documents written in either prestandard or dialect
enabling the standardization of documents
forms of the language.
written in prestandard or dialect forms of
A number of other projects have involved minor-
Irish.
ity/global language parallel corpora in one way or
another, including (but certainly not limited to) the
1 Introduction
following:
Parallel corpora have many applications in natural
language processing for problems involving multi- The EMILLE project (McEnery et al., 2000)
ple l
显示全部