The project topics should be related to analyzing healthcare data in order to solve clinical or administrative problems. You will be required to submit this project in 3 portions:
- Report
- Presentation
- Source code, results, materials, findings
You need to find data that is suitable to solve your problem. You may want to use only newer
data (after 2004 or newer). You need to decide based on variable availability
and sample size. Most importantly the data need to be on the right level of aggregation.
The project report should include, but be not limited to:
- problem description
- data selection
- data pre-processing
- selection DM methods
- application of methods
- analysis of results
- review of available literature and related work
- conclusions and description of impact on healthcare
- As well as a brief description of what you learned in the project.
Direct application of existing software to publically available datasets is not sufficient. The projects must demonstrate significant efforts in data manipulation, processing, and mining. Projects must also illustrate understanding of applied techniques as well as the healthcare problem addressed.


0 comments