Description
You must submit two separate copies (one Word file and one PDF file) using the Assignment Template on
Blackboard via the allocated folder. These files must not be in compressed format.
• It is your responsibility to check and make sure that you have uploaded both the correct files.
• Zero mark will be given if you try to bypass the SafeAssign (e.g. misspell words, remove spaces between
words, hide characters, use different character sets, convert text into image or languages other than English
or any kind of manipulation).
• Email submission will not be accepted.
• You are advised to make your work clear and well-presented. This includes filling your information on the cover
page.
• You must use this template, failing which will result in zero mark.
• You MUST show all your work, and text must not be converted into an image, unless specified otherwise by
the question.
• Late submission will result in ZERO mark.
• The work should be your own, copying from students or other resources will result in ZERO mark.
• Use Times New Roman font for all your answers.
Project
Deadline: Monday 03/12/2025 @ 23:59
[Total Mark is 14]
Student Details:
CRN: 11797
Name:
Name:
Name:
ID:
ID:
ID:
Instructions:
• You must submit two separate copies (one Word file and one PDF file) using the Assignment Template on
Blackboard via the allocated folder. These files must not be in compressed format.
• It is your responsibility to check and make sure that you have uploaded both the correct files.
• Zero mark will be given if you try to bypass the SafeAssign (e.g. misspell words, remove spaces between
words, hide characters, use different character sets, convert text into image or languages other than English
or any kind of manipulation).
• Email submission will not be accepted.
• You are advised to make your work clear and well-presented. This includes filling your information on the cover
page.
• You must use this template, failing which will result in zero mark.
• You MUST show all your work, and text must not be converted into an image, unless specified otherwise by
the question.
• Late submission will result in ZERO mark.
• The work should be your own, copying from students or other resources will result in ZERO mark.
• Use Times New Roman font for all your answers.
Restricted – مقيد
Pg. 01
خطأ! استخدم عالمة التبويب “الصفحة الرئيسية” لتطبيق
Heading 1.على النص الذي ترغب في أن يظهر هنا
Project Overview and Instructions
Project Overview:
This project provides a comprehensive overview of data mining, covering both
fundamental and advanced techniques, while offering real-world insights into customer
behavior and business decision-making in e-commerce.
The primary objective of this project is to familiarize students with performing data
mining tasks on a dataset. The dataset will include various data, such as age, workclass,
education, sex, and occupation, among others. You will go through the different stages
of data mining to extract meaningful insights from the data.
Instructions:
•
In this project, each group of students will select a dataset and apply the steps
mentioned in each question provided below. Each group is required to utilize a
distinct data mining algorithm and technique to produce the results.
•
Students will work in groups of 2-3 students and compile their work into a single
report for submission along with other project materials. The project must be
submitted on Blackboard by one designated member of the group (the project
leader).
Dataset:
•
Restricted – مقيد
You must use the following dataset from the UCI Machine Learning
Repository. Adult – UCI Machine Learning Repository
Pg. 02
Learning
Outcome(s):1
خطأ! استخدم عالمة التبويب “الصفحة الرئيسية” لتطبيق
Heading 1.على النص الذي ترغب في أن يظهر هنا
Question One
3 Marks
Data Preparation and Loading
Explain different
data mining tasks,
problems and the
algorithms most
appropriate for
addressing them.
Task:
•
Download the dataset and convert it to .arff or use .csv directly.
•
Load the dataset into WEKA.
•
Describe the attributes and the target variable (income).
Expected Answer:
Restricted – مقيد
•
Screenshot of the dataset loaded in WEKA.
•
Description of attributes (e.g., age, education, occupation, etc.).
•
Explanation of the classification task: predicting whether income >50K or
50K).
•
Identification of outliers.
•
Confusion matrix for classification algorithms
o
Restricted – مقيد
Pg. 05
خطأ! استخدم عالمة التبويب “الصفحة الرئيسية” لتطبيق
Heading 1.على النص الذي ترغب في أن يظهر هنا
Question Four
3 Mark
Learning
Outcome(s): 4
Evaluation and Analysis
Evaluate the
Task:
performance of
•
Evaluate classification performance using accuracy, precision, recall, and F1score.
•
Discuss strengths and weaknesses of each algorithm used.
data mining
algorithms.
Expected Answer:
Restricted – مقيد
•
Performance metrics table.
•
Comparison of algorithms.
•
Justification of which algorithm performed best and why.
Purchase answer to see full
attachment