7CCSMPNN Assignment: Ensemble Methods

Department of Engineering/Informatics, King’s College London

Pattern Recognition, Neural Networks and Deep Learning

(7CCSMPNN)

Assignment: Ensemble Methods

This coursework is assessed. A type-written report needs to be submitted online

through KEATS by the deadline specified on the module’s KEATS webpage. This coursework considers your “own created” dataset to investigate the classification performance

using the techniques of Bagging and Boosting. Some simple “weak” classifiers will be

designed and combined to achieve an improved classification performance for a two-class

classification problem.

Q1. Create a non-linearly separable dataset consisting of at least 20 two-dimensional

dataset. Each data is characterised by two points x1 ∈ [[10, 10] and x2 ∈ [[10, 10]

and associated with a class y ∈ {−1, +1}. List the data in a table in a format as

shown in Table 1 where the first column is for the data points of class “\1” and the

second column is for the data points of class “+1”. (20 Marks)

Class 1: y = =1 Class 2: y = +1

(x1, x2) (x1, x2) ... ... (x1, x2) (x1, x2)

Table 1: Dataset of two classes.

Q2. Plot the dataset (x axis is x1 and y axis is x2) and show that the dataset is nonlinearly separable. Represent class “\1” and class “+1” using “×” and ‘◦”, respectively. Explain why your dataset is non-linearly separable. Hint: the Matlab built-in

function plot can be used. (20 Marks)

Q3. Design Bagging classifiers consisting of 3, 4 and 5 weak classifiers using the steps

shown in Appendix 1. A linear classifier should be used as the weak classifier. Explain and show the design of the hyperplanes of weak classifiers. List the parameters

of the design hyperplanes.

After designing the weak classifiers, apply the designed weak classifiers and bagging classifier to all the samples in Table 1. Present the classification results in

a table as shown in Table 2. The columns “Weak classifier 1” to ‘Weak classifier

n” list the output class ({−1, +1}) of the corresponding weak classifiers. The column “Overall classifier” list the output class ({−1, +1}) of the bagging classifier.

The last row lists the classification accuracy in percentage for all classifiers, i.e.,

Number of correct classifications

Total number of samples ×100%. Explain how to determine the class (for each weak

classifier and over all classifier) using one test sample. You will have 3 tables (for 3, 4

and 5 weak classifiers) for this question. Comment on the results (in terms of classi-

fication performance when different number of weak classifiers are used). (30 Marks)

Data Weak classifier 1 · · · Weak classifier n Overall classifier

(x1, x2), y {−1, +1} · · · {−1, +1} {−1, +1} ... ... ... ... (x1, x2), y {−1, +1} · · · {−1, +1} {−1, +1}

Accuracy (%) · · ·

Table 2: Classification results using Bagging technique combining n weak classifiers. The

first row “Data” are the samples (both classes 1 and 2) in Table 1.

Q4. Design a Boosting classifier consisting of 3 weak classifiers using the steps shown

in Appendix 2. A linear classifier should be used as a weak classifier. Explain and

show the design of the hyperplanes of weak classifiers. List the parameters of the

design hyperplanes. After designing the weak classifiers, apply the designed weak

classifiers and boosting classifier to all the samples in Table 1. Present the classifi-

cation results in a table as shown in Table 2. Explain how to determine the class

(for each weak classifier and boosting classifier) using one test sample. Comment

on the results of the overall classifier in terms of classification performance when

comparing with the 1st, 2nd and the 3rd weak classifiers, and with the bagging

classifier with 3-weak classifiers in Q.3.

(30 Marks)

Appendix 1: Bagging1

Q1. Start with dataset D.

Q2. Generate M dataset D1, D2, . . ., DM. • Each distribution is created by drawing n0 < n samples from D with replacement. • Some samples can appear more than once while others do not appear at all.

Q3. Learn weak classifier for each dataset.

• weak classifiers fi(x) for dataset Di, i = 1, 2, . . ., M.

Q4. Combine all weak classifiers using a majority voting scheme.

• ffinal(x) = sgnXMi=1

1M fi(x)

Appendix 2: Boosting 2 • Dataset D with n patterns

• Training procedure:

1Details can be found in Section “Bagging” in the Lecture notes

2Details can be found in Section “Boosting” in the Lecture notes

Step 1: Randomly select a set of n1 ≤ n patterns (without replacement) from D

to create dataset D1. Train a weak classifier C1 using D1 (C1 should have at

least 50% classification accuracy).

Step 2: Create an “informative” dataset D2 (n2 ≤ n) from D of which roughly

half of the patterns should be correctly classified by C1 and the rest is wrongly

classified. Train a weak classifier C2 using D2.

Step 3: Create an “informative” dataset D3 from D of which the patterns are not

well classified by C1 and C2 (C1 and C2 disagree). Train a weak classifier C3

using D3. • The final decision of classification is based on the votes of the weak classifiers.

– e.g., by the first two weak classifiers if they agree, and by the third weak

classifier if the first two disagree.

Marking: The learning outcomes of this assignment are that student understands the

fundamental principle and concepts of ensemble methods (Bagging and Boosting); is able

to design weak classifies; knows the way to form Bagging/Boosting classifier and knows

how to determine the classification of test samples with the designed Bagging/Boosting

classifiers. The assessment will look into the knowledge and understanding on the topic.

When answering the questions, show/explain/describe clearly the steps/design/concepts

with reference to the equations/theory/algorithms (stated in the lecture slides). When

making comments, provide statements with the support from the results obtained.

Purposes of Assignment: This assignment goes through the detailed steps of handling

classification problem using ensemble methods. You have full control of the datasets

which is not the case in real scenarios but allows you to achieve the design easier with a

small size of dataset. Through this assignment, it helps you to make clear the concept,

working principle, theory, classification of samples, design procedure and multiple-class

classification techniques using ensemble methods.

联系我们

QQ：99515681
邮箱：99515681@qq.com
工作时间：8:00-21:00
微信：codinghelp

热点文章

辅导 comm2000 creating socia... 2026-01-08
讲解 isen1000 – introductio... 2026-01-08
讲解 cme213 radix sort讲解 c... 2026-01-08
辅导 csc370 database讲解迭代 2026-01-08
讲解 ca2401 a list of colleg... 2026-01-08
讲解 nfe2140 midi scale play... 2026-01-08
讲解 ca2401 the universal li... 2026-01-08
辅导 engg7302 advanced compu... 2026-01-08
辅导 comp331/557 – class te... 2026-01-08
讲解 soft2412 comp9412 exam辅... 2026-01-08
讲解 scenario # 1 honesty讲解... 2026-01-08
讲解 002499 accounting infor... 2026-01-08
讲解 comp9313 2021t3 project... 2026-01-08
讲解 stat1201 analysis of sc... 2026-01-08
辅导 stat5611: statistical m... 2026-01-08
辅导 mth2010-mth2015 - multi... 2026-01-08
辅导 eeet2387 switched mode ... 2026-01-08
讲解 an online payment servi... 2026-01-08
讲解 textfilter辅导 r语言 2026-01-08
讲解 rutgers ece 434 linux o... 2026-01-08

热点标签

msinm014/msing014/msing014b

联系我们 - QQ: 99515681 微信：codinghelp

程序辅导网！