課程資訊
課程名稱
語料庫語言學
Corpus Linguistics 
開課學期
99-2 
授課對象
文學院  語言學研究所  
授課教師
謝舒凱 
課號
LING7424 
課程識別碼
142 M1060 
班次
 
學分
全/半年
半年 
必/選修
選修 
上課時間
星期二3,4,5(10:20~13:10) 
上課地點
樂學館304 
備註
本課程中文授課,使用英文教科書。應用語言學領域/計算語言學領域。
總人數上限:12人 
Ceiba 課程網頁
http://ceiba.ntu.edu.tw/992corpus_ling 
課程簡介影片
 
核心能力關聯
核心能力與課程規劃關聯圖
課程大綱
為確保您我的權利,請尊重智慧財產權及不得非法影印
課程概述

This course offers an introduction of corpus linguistics for graduate students, including the necessary tools and techniques for doing corpus-based studies and annotation projects. Existing major corpora will be scrutinized for a better understanding of their linguistic uses. Some speci?c goals of this course are to enable students to make, annotate and search corpora, and to perform a quantitative analysis of some linguistic phenomenon. Students will also gain hands-on experience in these areas by working on a speci?c topic of their own interest. 

課程目標
This course offers an introduction of corpus linguistics for graduate students, including the necessary tools and techniques for doing corpus-based studies and annotation projects. Existing major corpora will be scrutinized for a better understanding of their linguistic uses. Some speci?c goals of this course are to enable students to make, annotate and search corpora, and to perform a quantitative analysis of some linguistic phenomenon. Students will also gain hands-on experience in these areas by working on a speci?c topic of their own interest. 
課程要求
Week Topic Lab
1 Orientation
2 Corpus and corpus-based linguistics
3 Corpora and tools
4 Corpus annotation (I)
5 Corpus annotation (II)
6 Corpus-based analysis (I)
8 Corpus-based analysis (II)
9 Corpus-based analysis (III)
10 Applications of corpus-based analysis (I)
11 Applications of corpus-based analysis (II)
12 Applications of corpus-based analysis (III)
13 Methodological issues and term project proposal
14 Basic corpus statistics (I)
15 Basic corpus statistics (II) (Guest lecture)
16 New trends in corpus linguistics
17 Class project workshop (I)
18 Class project workshop (II)

• You are expected to complete weekly readings and assignments,
and actively participate in both class and online activities. There will
be no required paper textbooks for this course,instead,a Course Reader
compiled by me will be soon available. Additional lecture notes and
slides will be made available in class or via the "Resources" section of
the course web page.

• Each class will be divided into two sessions: (i) lecture, presentation
and discussion (two hours), and (ii) lab (one hour). For homework
assignments, we will be using the server of LOPE lab. Instruction
about how to use the lab server will be made available before the
?rst assignment. In the Lab session, you will learn to construct and
search text databases using Unix and other corpus tools,to write sim-
ple programs to manipulate large natural language corpora, and to
perform quantitative analysis of linguistic data.
• You are expected to submit a term project involving original,corpus-
based research. It can be either a local corpus construction project
(e.g., sociolinguistic corpus) to be held at NTU, a program (with
documentation) to perform some substantial corpus processing task,
or a research paper on a corpus-based topic. You will be asked to turn
in a proposal early in the semester so that I can help you design and
execute your project. Proposed topics will also be discussed in class
later.
• There will be (in principle) no exams. Grading is based on : Home-
work (20%); Discussion and Presentation (40%); Term paper (40%).

 
預期每週課後學習時數
 
Office Hours
 
參考書目
- Baker, Paul. (ed). 2009. Contemporary Corpus Linguistics. Continuum
Publisher.
- Garside, Roger et al. 1997. Corpus Annotation: Linguistic Information
from Computer Text Corpora. Addison Wesley.
- Gries, Stefan. 2008. Quantitative Corpus Linguistics with R: A Practical
Introduction. Routledge.
- Martin Wynne (ed). Developing Linguistic Corpora: a Guide to
Good Practice. available at http://www.ahds.ac.uk/creating/guides/
linguistic-corpora/index.htm
 
指定閱讀
 
評量方式
(僅供參考)
   
課程進度
週次
日期
單元主題
第2週
3/01  Introduction 
第10週
4/26  特邀蘇新春老師於課堂演講,歡迎非本課學生聆聽 
第11週
5/03  CLSW2011會議 -- 上課地點改在霖澤館聽James Pustejovsky演講 http://lope.linguistics.ntu.edu.tw/clsw2011/