1
0
Fork 0
This commit is contained in:
Alejandro Moreo Fernandez 2021-08-26 15:52:35 +02:00
parent ab746eed8d
commit d6abc7ac85
3 changed files with 26 additions and 1 deletions

View File

@ -1,5 +1,22 @@
Classifiers
- Classifiers binary, single-label, OneVsRest or MultiOutput:
- LR
- LinearSVC (?)
- Classifiers natively multi-label:
- from scikit-multilearn (x11)
-
Protocols:
- NPP
- APP (for each class)
Things to test:
- MultiChain for classification, MultiChain for regression?
- MultiChain for classification, MultiChain for regression...
- Reimplement stacking with sklearn.ensemble.StackingClassifier? No parece facil.
- Independent classifiers + independent quantifiers
- Stacking + independent quantifiers
@ -12,3 +29,10 @@ Things to test:
- Model Selection for specific protocols?
TODO:
- decide methods
- decide classifiers binary
- decide classifiers multi-label
- decide quantifiers naive
- decide quantifiers multi-label
- decide datasets

View File

@ -64,6 +64,7 @@ class MultilabelledCollection:
return MultilabelledCollection(documents, labels)
def train_test_split(self, train_prop=0.6, random_state=None):
raise ValueError('use the scikit-multilearn implementation')
tr_docs, te_docs, tr_labels, te_labels = \
train_test_split(self.instances, self.labels, train_size=train_prop, random_state=random_state)
return MultilabelledCollection(tr_docs, tr_labels), MultilabelledCollection(te_docs, te_labels)