Team Name: Liad & Yuval
IDs:
ID #1: 300822954
ID #2: 311434047
-
Applied-Machine-Learning-Final-Project-300822954-31143407.ipynb - which is a notebook contains all relevant code. Allow to run the experiment and contains other sections of the project such as statistical hypothesis test, meta-learning-model, graphs, etc. Was tested on Colab.
-
flow.py - experiement flow controller + hyper-parameter search grid.
-
nested_cv.py - nested cross-validation infrastructure.
-
utils.py
-
rotboost.py - RotBoost implementation
-
rotation_forest.py - Rotation-Forest implementation
-
results/experiments_results.csv - our experiments results as depicted in the file 'Final-Project-Report-300822954-31143407.docx'
-
classification_datasets-20200531T065549Z-001.zip - holds all csv datasets zipped
-
/classification_datasets - holds all csv datasets
-
ClassificationAllMetaFeatures.csv - meta learning model's meta features
-
dataset-metadata.csv - holds datasets metadata such as binary/multiclass type, number of attributes etc.
-
results/feature_importance_weight.png - meta learning model feature importance type 'weight'
-
results/feature_importance_gain.png - meta learning model feature importance type 'gain'
-
results/feature_importance_cover.png - meta learning model feature importance type 'cover'
-
results/shap_summary_plot.png
-
results/shap_training_set_prediction.png
-
results/hyperparameters-search-space.png
-
Applied-Machine-Learning-Final-Project-300822954-31143407.docx
-
Notebook - Applied-Machine-Learning-Final-Project-300822954-31143407.ipynb:
Note: please use colab.
- Download notebook Applied-Machine-Learning-Final-Project-300822954-31143407.ipynb
- Download classification_datasets-20200531T065549Z-001.zip
- Download results/experiments_results.csv
- Download ClassificationAllMetaFeatures.csv
- Open Applied-Machine-Learning-Final-Project-300822954-31143407.ipynb notebook in colab
- The following files are needed to be uploaded to the notebook under '/content' dir (which is the default):
- classification_datasets-20200531T065549Z-001.zip
- results/experiments_results.csv (this file is not required if run_nested_cross_validation flag is set to 'True')
- ClassificationAllMetaFeatures.csv
- Run all cells
There are two options to run the notebook:
- Skip the nested-cross-validation section and run only the processing results section + meta learning section. Default option.
- Run the whole exercise (nested-cross-validation section + processing the results section + meta learning section). In order to that, please set in the settings cell (#2 cell) ‘run_nested_cross_validation = True’. Please note that this might take a while.
-
Directly: run flow.py file. Note: this option is less favorable and allow to run the experiment only, does not support other sections of the project, such as statistical hypothesis test, meta-learning- model, graphs, etc.