Course Syllabus

Course Information

Course title	Machine Learning and Econometrics
Semester	110-2
Designated for	COLLEGE OF SOCIAL SCIENCES GRADUATE INSTITUTE OF ECONOMICS
Instructor	JUI-CHUNG YANG
Curriculum Number	ECON7225
Curriculum Identity Number	323EM1790
Class
Credits	3.0
Full/Half Yr.	Half
Required/ Elective	Elective
Time	Thursday 2,3,4(9:10~12:10)
Remarks	Restriction: MA students and beyond OR Restriction: Ph. D students The upper limit of the number of students: 20.

Course introduction video
Table of Core Capabilities and Curriculum Planning	Table of Core Capabilities and Curriculum Planning
Course Syllabus
Please respect the intellectual property rights of others and do not copy any of the course information without permission
Course Description	This is a course about machine learning and econometrics for graduate students. Machine learning uses data to predict some variable as a function of other variables, and econometrics use statistical methods for prediction, inference, and causal modeling of economic relationships (Varian, 2014). In the past few decades, computer-based technology allows people to collect enormous data sets, orders of magnitude larger than those that classic statistical theory was designed to deal with. Huge data demands new methodology, and the demand is being met by a burst of innovative computer-based statistical algorithms in estimation and prediction. While algorithmic invention is a more free-wheeling and adventurous enterprise, inference is playing catch-up as it strives to assess the accuracy, good or bad, of some hot new algorithmic methodology. In other words, the inference / algorithm race is a tortoise-and-hare affair (Efron and Hastie, 2016).
Course Objective	In this course, we as a group of well-trained econometricians are going to learn some machine learning together. On the estimation / prediction side, we will talk about techniques such as Lasso, neural networks, and random forests. On the inference / causal modeling side, we will discuss some ideas about inference after using these machine learning techniques, and inference methods based on / inspired by the computer-based statistical algorithms, such as generalized random forests and double/debiased machine learning. More importantly, we will talked about the inference questions which haven't been answered, and you are more than welcomed to join us and work on these unanswered questions as your research projects.
Course Requirement	1. A midterm exam (40%) by April 14. 2. A final presentation (60%) by May 26. - The final project can be an individual work or a teamwork with <= 2 team members in total. - It can be a theoretical study, a Monde Carlo experiment or an empirical work. - You should talk to me about your work by April 21.
Student Workload (expected study time outside of class per week)
Office Hours
Designated reading	James, Gareth, Daniela Witten, Trevor Hastie, and Robert Tibshirani (2021). An Introduction to Statistical Learning: With Applications in R, Second Edition. Springer.
References	Efron, Bradley and Trevor Hastie (2016). Computer Age Statistical Inference: Algorithms, Evidence and Data Science. Cambridge.
Grading

Progress

Week	Date	Topic
No data