Wrapper feature selection
Feature Selection in Parkinson’s Disease (PPMI Data)
Use the 06_PPMI_ClassificationValidationData_Short dataset setting ResearchGroup
as class variable.
- Delete irrelevant columns (e.g.
X
, FID_IID
) and select only the PD and Control cases
- Properly convert the variables types
- Apply
Boruta
to train a model, try different parameters (e.g., try different pValue
, maxRuns
). What are the differences?
- Summarize and visualize the results
- Apply Random Feature Elimination (RFE) and tune the model size
- Evaluate the
Boruta
model performance by comparing with REF
- Report and compare the variables selected by both methods. How much overlap is there in the selected variables?
SOCR Resource Visitor
number