Reviewing PR #539 which tests dataset uploading, I found four things which we could improve to make dataset uploading nicer for the user:
- There should be no necessity to provide the format as this will always be
'arff'.
- The current example encodes the class label as a
REAL, although it should actually be a categorical.
- We could add an example for uploading a list of lists, which could then also contain strings and categories.
- The function
create_dataset should be imported into the openml.datasets namespace.
@ArlindKadra @glemaitre
Reviewing PR #539 which tests dataset uploading, I found four things which we could improve to make dataset uploading nicer for the user:
'arff'.REAL, although it should actually be a categorical.create_datasetshould be imported into theopenml.datasetsnamespace.@ArlindKadra @glemaitre