Department of Computer Science and Software Engineering
计算机科学与软件工程系
MODULE CODE MODULE LEADER DEPARTMENT Phone
CSE315
2019/20 SEMESTER 1
BACHELOR DEGREE – YEAR 4 & MASTER DEGREE
MACHINE LEARNING
ASSIGNMENT 2
START DATE: 4 NOVEMBER
DUE DATE: 9 DECEMBER by 23:59
GENERAL INSTRUCTIONS
1. The assignment 2 consists of 3 tasks, which together are worth 100 marks. The
figure in [] denotes the number of marks available for that task or part of the
task.
2. The programs can be written in Matlab or Python in a professional style (e.g.,
appropriate comments, indents, meaningful variable names).
3. Please make sure that your programs are executable and your PDF report is
produced by computer.
4. No plagiarism, which will be strictly checked by Turnitin.
5. You are supposed to finish all the tasks.
6. Partial marks may be awarded depending on the degree of completeness and
clarity of your answers.
Department of Computer Science and Software Engineering
计算机科学与软件工程系
Task 1 Classification [35 Marks]
1. Using the MNIST database available at http://yann.lecun.com/exdb/mnist/, select two
classification algorithms and implement them to achieve a high accuracy (more than 90%).
[15 marks]
2. Describe the techniques, including data preparation, feature reduction, and training tricks in
your classification algorithms. [10 marks]
3. Analyse some other techniques that can be applied in your classification algorithms to
improve your model’s performance such as accuracy, efficiency, and storage. [10 marks]
Task 2 Support vector machine (SVM) and principal component analysis (PCA) [40
Marks]
1. Using the iris.data available on the ICE, select the training dataset and validation dataset, and
implement the SVM algorithm (based on public packages or libraries) to classify the types of
iris (achieving an accuracy of 90%). [10 marks]
2. Using the same iris.data, reduce the dimension of features applying the PCA and extract the
first, second, and third principal components. [10 marks]
3. Using the extracted first, second, and third principal component in Task 2.2, respectively, to
train a SVM model to classify the types of iris and compare their accuracies. [10 marks]
4. For each combination of the extracted first, second, and third principal components, train a
SVM model to classify the types of iris, and then compare their accuracies. [10 marks]
Task 3 Clustering [25 Marks]
1. Specify the number of clustering (e.g., 1, 2, 3, 4, 5, and 6) and implement the k-means
algorithm (based on public packages or libraries) to classify the iris.data. [10 marks]
2. Apply the PCA to reduce the dimension of features and combine the first, second, and third
principal components to implement the k-means algorithm (based on public packages or
libraries) to classify the iris.data. [15 marks]
END OF ASSIGNMENT 2
版权所有:编程辅导网 2021 All Rights Reserved 联系方式:QQ:99515681 微信:codinghelp 电子信箱:99515681@qq.com
免责声明:本站部分内容从网络整理而来,只供参考!如有版权问题可联系本站删除。