文档详情

语言学(语料库).ppt

发布:2017-05-25约1.18万字共45页下载文档
文本预览下载声明
Good afternoon Types of corpora General corpora: useful for language research as a whole. A general reference corpus is not a collection of material from different specialist areas – technical, dialectal, juvenile, etc. It is a collection of material which is broadly homogeneous, but which is gathered from a variety of sources, so that the individuality of a source is obscured, unless the researcher isolates a particular text. What uses can we make of corpora? ? Frequency information Why do we need frequency information? Corpora can tell us how frequently certain language items or structures are used. This kind of information is useful when we try to select what to teach, select what to focus on, and decide what senses to focus on in the language classroom. Context and co-text information Context: situational environments Co-text: linguistic environments Sometimes it is very difficult to tell the differences of two words or phrases which have similar meaning. However, if we look at the context and co-text in which they are used, the difference becomes clear. Collocation and phraseology 措辞information It is usually difficult for second and foreign language learners to learn which words are frequently used together. So, this kind of information helps a lot. e.g. make effort or take effort? A search in corpus will do the job. Pragmatics 语用学information Information from corpora can tell us how language is actually used in communication. Computational linguists - e.g. to see if their grammatical parsing programs will work on naturally occurring language Language learning researchers - e.g. to see how often learners with a particular L1 get something wrong Writers of teaching syllabuses - e.g. to see how often the passive really occurs in academic English Writers of teaching course materials - e.g. to incorporate authentic examples into their material
显示全部
相似文档