登入
選單
返回
Google圖書搜尋
Corpus Linguistics and Linguistically Annotated Corpora
Sandra Kuebler
Heike Zinsmeister
出版
Bloomsbury Publishing
, 2014-12-18
主題
Language Arts & Disciplines / Linguistics / General
Computers / Artificial Intelligence / Natural Language Processing
ISBN
1441119809
9781441119803
URL
http://books.google.com.hk/books?id=B-pQBQAAQBAJ&hl=&source=gbs_api
EBook
SAMPLE
註釋
Linguistically annotated corpora are becoming a central part of the corpus linguistics field. One of their main strengths is the level of searchability they offer, but with the annotation come problems of the initial complexity of queries and query tools. This book gives a full, pedagogic account of this burgeoning field.
Beginning with an overview of corpus linguistics, its prerequisites and goals, the book then introduces linguistically annotated corpora. It explores the different levels of linguistic annotation, including morphological, parts of speech, syntactic, semantic and discourse-level, as well as advantages and challenges for such annotations. It covers the main annotated corpora for English, the Penn Treebank, the International Corpus of English, and OntoNotes, as well as a wide range of corpora for other languages. In its third part, search strategies required for different types of data are explored. All chapters are accompanied by exercises and by sections on further reading.