Project for Data Mining

Daniel Kevins

0 comments

I have attached a solution form
for the past years but it is divided into two phases.. I want my project to be
solved in the same way.

1-Select one of the datasets from UCI Machine Learning
Repositories

OR use your own dataset if available.

2-The dataset may follow the following requirements (Data
description)

Number of instances: between 300-500

Number of attributes: between 10 to 15

3-Prepare a CSV OR ARFF format data file of the data.

4-Load the dataset in Weka or if you prefer to use any python
tools such as Google Collaborate Lab https://research.google.com/colaboratory/

5-Do a basic preprocessing to the dataset such data cleaning /
Data reduction /Normalization (if exist or required) etc.

6-Based on dataset run Apriori algorithm with different support
and confidence values. Discuss the generated rules.

7-Based on your dataset selection, apply SVM data mining
algorithm.

Provide the result and accuracies of the algorithms and discuss
it with supporting screenshots.

8-Based on your selection dataset, Apply the Decision tree data
mining algorithm with different parameter setting and record the accuracies.

9-Apply the K-mean algorithm on the dataset (for k=4) and study
the clusters formed.

About the Author

Follow me