登入
選單
返回
Google圖書搜尋
Designing and Evaluating Language Corpora
Jesse Egbert
Douglas Biber
Bethany Gray
其他書名
A Practical Framework for Corpus Representativeness
出版
Cambridge University Press
, 2022-04-14
主題
Computers / Artificial Intelligence / Natural Language Processing
Language Arts & Disciplines / Linguistics / General
Language Arts & Disciplines / Linguistics / Historical & Comparative
Language Arts & Disciplines / Library & Information Science / General
Reference / Research
ISBN
1107151384
9781107151383
URL
http://books.google.com.hk/books?id=zBNkEAAAQBAJ&hl=&source=gbs_api
EBook
SAMPLE
註釋
Corpora are ubiquitous in linguistic research, yet to date, there has been no consensus on how to conceptualize corpus representativeness and collect corpus samples. This pioneering book bridges this gap by introducing a conceptual and methodological framework for corpus design and representativeness. Written by experts in the field, it shows how corpora can be designed and built in a way that is both optimally suited to specific research agendas, and adequately representative of the types of language use in question. It considers questions such as 'what types of texts should be included in the corpus?', and 'how many texts are required?' - highlighting that the degree of representativeness rests on the dual pillars of domain considerations and distribution considerations. The authors introduce, explain, and illustrate all aspects of this corpus representativeness framework in a step-by-step fashion, using examples and activities to help readers develop practical skills in corpus design and evaluation.