QuaPy/quapy/evaluation.py

import quapy as qp
from typing import Union, Callable, Iterable
from data import LabelledCollection
from method.base import BaseQuantifier
from util import temp_seed
import numpy as np
from joblib import Parallel, delayed
from tqdm import tqdm
import error


def artificial_sampling_prediction(
        model: BaseQuantifier,
        test: LabelledCollection,
        sample_size,
        n_prevpoints=210,
        n_repetitions=1,
        n_jobs=-1,
        random_seed=42,
        verbose=True
):
    """
    Performs the predictions for all samples generated according to the artificial sampling protocol.
    :param model: the model in charge of generating the class prevalence estimations
    :param test: the test set on which to perform arificial sampling
    :param sample_size: the size of the samples
    :param n_prevpoints: the number of different prevalences to sample
    :param n_repetitions: the number of repetitions for each prevalence
    :param n_jobs: number of jobs to be run in parallel
    :param random_seed: allows to replicate the samplings. The seed is local to the method and does not affect
    any other random process.
    :param verbose: if True, shows a progress bar
    :return: two ndarrays of shape (m,n) with m the number of samples (n_prevpoints*n_repetitions) and n the
     number of classes. The first one contains the true prevalences for the samples generated while the second one
     contains the the prevalence estimations
    """

    with temp_seed(random_seed):
        indexes = list(test.artificial_sampling_index_generator(sample_size, n_prevpoints, n_repetitions))

    if isinstance(model, qp.method.aggregative.AggregativeQuantifier):
        # print('\tinstance of aggregative-quantifier')
        quantification_func = model.aggregate
        if isinstance(model, qp.method.aggregative.AggregativeProbabilisticQuantifier):
            # print('\t\tinstance of probabilitstic-aggregative-quantifier')
            preclassified_instances = model.posterior_probabilities(test.instances)
        else:
            # print('\t\tinstance of hard-aggregative-quantifier')
            preclassified_instances = model.classify(test.instances)
        test = LabelledCollection(preclassified_instances, test.labels)
    else:
        # print('\t\tinstance of base-quantifier')
        quantification_func = model.quantify

    def _predict_prevalences(index):
        sample = test.sampling_from_index(index)
        true_prevalence = sample.prevalence()
        estim_prevalence = quantification_func(sample.instances)
        return true_prevalence, estim_prevalence

    pbar = tqdm(indexes, desc='[artificial sampling protocol] predicting') if verbose else indexes
    results = Parallel(n_jobs=n_jobs)(
        delayed(_predict_prevalences)(index) for index in pbar
    )

    true_prevalences, estim_prevalences = zip(*results)
    true_prevalences = np.asarray(true_prevalences)
    estim_prevalences = np.asarray(estim_prevalences)

    return true_prevalences, estim_prevalences


def evaluate(model: BaseQuantifier, test_samples:Iterable[LabelledCollection], err:Union[str, Callable], n_jobs:int=-1):
    if isinstance(err, str):
        err = getattr(error, err)
    assert err.__name__ in error.QUANTIFICATION_ERROR_NAMES, \
        f'error={err} does not seem to be a quantification error'
    scores = Parallel(n_jobs=n_jobs)(
        delayed(_delayed_eval)(model, Ti, err) for Ti in test_samples
    )
    return np.mean(scores)


def _delayed_eval(model:BaseQuantifier, test:LabelledCollection, error:Callable):
    prev_estim = model.quantify(test.instances)
    prev_true  = test.prevalence()
    return error(prev_true, prev_estim)
import bug fixed 2021-01-12 09:35:49 +01:00			`import quapy as qp`
added Ensemble methods (methods ALL, ACC, Ptr, DS from Pérez-Gallego et al 2017 and 2019) and some UCI ML datasets used in those articles (only 5 datasets out of 32 they used) 2021-01-06 14:58:29 +01:00			`from typing import Union, Callable, Iterable`
evaluation by artificial prevalence sampling added. New methods added. New util functions added to quapy.functional and quapy.utils 2020-12-10 19:04:33 +01:00			`from data import LabelledCollection`
			`from method.base import BaseQuantifier`
uniform sampling added if *prevs is empty 2020-12-17 18:17:17 +01:00			`from util import temp_seed`
evaluation by artificial prevalence sampling added. New methods added. New util functions added to quapy.functional and quapy.utils 2020-12-10 19:04:33 +01:00			`import numpy as np`
			`from joblib import Parallel, delayed`
			`from tqdm import tqdm`
added Ensemble methods (methods ALL, ACC, Ptr, DS from Pérez-Gallego et al 2017 and 2019) and some UCI ML datasets used in those articles (only 5 datasets out of 32 they used) 2021-01-06 14:58:29 +01:00			`import error`
evaluation by artificial prevalence sampling added. New methods added. New util functions added to quapy.functional and quapy.utils 2020-12-10 19:04:33 +01:00

			`def artificial_sampling_prediction(`
			`model: BaseQuantifier,`
			`test: LabelledCollection,`
			`sample_size,`
refactoring aggregative methods as methods that not only implement 'classify' and 'quantify', but that also implement 'aggregate' and that, by default, have a default implementation of 'quantify' as a pipeline of 'classify' and 'aggregate'; this helps speeding up evaluations A LOT, since the documents can be pre-classified and the samples are carried out across pre-classified values (labels, or posterior probabilities), and thus only aggregate is called many times within the artificial sampling protocol 2020-12-11 19:28:17 +01:00			`n_prevpoints=210,`
			`n_repetitions=1,`
evaluation by artificial prevalence sampling added. New methods added. New util functions added to quapy.functional and quapy.utils 2020-12-10 19:04:33 +01:00			`n_jobs=-1,`
added model selection for quantification 2020-12-22 17:43:23 +01:00			`random_seed=42,`
			`verbose=True`
			`):`
evaluation by artificial prevalence sampling added. New methods added. New util functions added to quapy.functional and quapy.utils 2020-12-10 19:04:33 +01:00			`"""`
			`Performs the predictions for all samples generated according to the artificial sampling protocol.`
			`:param model: the model in charge of generating the class prevalence estimations`
			`:param test: the test set on which to perform arificial sampling`
			`:param sample_size: the size of the samples`
refactoring aggregative methods as methods that not only implement 'classify' and 'quantify', but that also implement 'aggregate' and that, by default, have a default implementation of 'quantify' as a pipeline of 'classify' and 'aggregate'; this helps speeding up evaluations A LOT, since the documents can be pre-classified and the samples are carried out across pre-classified values (labels, or posterior probabilities), and thus only aggregate is called many times within the artificial sampling protocol 2020-12-11 19:28:17 +01:00			`:param n_prevpoints: the number of different prevalences to sample`
			`:param n_repetitions: the number of repetitions for each prevalence`
evaluation by artificial prevalence sampling added. New methods added. New util functions added to quapy.functional and quapy.utils 2020-12-10 19:04:33 +01:00			`:param n_jobs: number of jobs to be run in parallel`
			`:param random_seed: allows to replicate the samplings. The seed is local to the method and does not affect`
			`any other random process.`
added model selection for quantification 2020-12-22 17:43:23 +01:00			`:param verbose: if True, shows a progress bar`
plot functionality added 2021-01-07 17:58:48 +01:00			`:return: two ndarrays of shape (m,n) with m the number of samples (n_prevpoints*n_repetitions) and n the`
evaluation by artificial prevalence sampling added. New methods added. New util functions added to quapy.functional and quapy.utils 2020-12-10 19:04:33 +01:00			`number of classes. The first one contains the true prevalences for the samples generated while the second one`
plot functionality added 2021-01-07 17:58:48 +01:00			`contains the the prevalence estimations`
evaluation by artificial prevalence sampling added. New methods added. New util functions added to quapy.functional and quapy.utils 2020-12-10 19:04:33 +01:00			`"""`

			`with temp_seed(random_seed):`
refactoring aggregative methods as methods that not only implement 'classify' and 'quantify', but that also implement 'aggregate' and that, by default, have a default implementation of 'quantify' as a pipeline of 'classify' and 'aggregate'; this helps speeding up evaluations A LOT, since the documents can be pre-classified and the samples are carried out across pre-classified values (labels, or posterior probabilities), and thus only aggregate is called many times within the artificial sampling protocol 2020-12-11 19:28:17 +01:00			`indexes = list(test.artificial_sampling_index_generator(sample_size, n_prevpoints, n_repetitions))`

import bug fixed 2021-01-12 09:35:49 +01:00			`if isinstance(model, qp.method.aggregative.AggregativeQuantifier):`
testing quapy via replicating Tweet Quantification experiments 2021-01-12 17:39:00 +01:00			`# print('\tinstance of aggregative-quantifier')`
refactoring aggregative methods as methods that not only implement 'classify' and 'quantify', but that also implement 'aggregate' and that, by default, have a default implementation of 'quantify' as a pipeline of 'classify' and 'aggregate'; this helps speeding up evaluations A LOT, since the documents can be pre-classified and the samples are carried out across pre-classified values (labels, or posterior probabilities), and thus only aggregate is called many times within the artificial sampling protocol 2020-12-11 19:28:17 +01:00			`quantification_func = model.aggregate`
import bug fixed 2021-01-12 09:35:49 +01:00			`if isinstance(model, qp.method.aggregative.AggregativeProbabilisticQuantifier):`
testing quapy via replicating Tweet Quantification experiments 2021-01-12 17:39:00 +01:00			`# print('\t\tinstance of probabilitstic-aggregative-quantifier')`
refactoring aggregative methods as methods that not only implement 'classify' and 'quantify', but that also implement 'aggregate' and that, by default, have a default implementation of 'quantify' as a pipeline of 'classify' and 'aggregate'; this helps speeding up evaluations A LOT, since the documents can be pre-classified and the samples are carried out across pre-classified values (labels, or posterior probabilities), and thus only aggregate is called many times within the artificial sampling protocol 2020-12-11 19:28:17 +01:00			`preclassified_instances = model.posterior_probabilities(test.instances)`
			`else:`
testing quapy via replicating Tweet Quantification experiments 2021-01-12 17:39:00 +01:00			`# print('\t\tinstance of hard-aggregative-quantifier')`
refactoring aggregative methods as methods that not only implement 'classify' and 'quantify', but that also implement 'aggregate' and that, by default, have a default implementation of 'quantify' as a pipeline of 'classify' and 'aggregate'; this helps speeding up evaluations A LOT, since the documents can be pre-classified and the samples are carried out across pre-classified values (labels, or posterior probabilities), and thus only aggregate is called many times within the artificial sampling protocol 2020-12-11 19:28:17 +01:00			`preclassified_instances = model.classify(test.instances)`
			`test = LabelledCollection(preclassified_instances, test.labels)`
			`else:`
testing quapy via replicating Tweet Quantification experiments 2021-01-12 17:39:00 +01:00			`# print('\t\tinstance of base-quantifier')`
refactoring aggregative methods as methods that not only implement 'classify' and 'quantify', but that also implement 'aggregate' and that, by default, have a default implementation of 'quantify' as a pipeline of 'classify' and 'aggregate'; this helps speeding up evaluations A LOT, since the documents can be pre-classified and the samples are carried out across pre-classified values (labels, or posterior probabilities), and thus only aggregate is called many times within the artificial sampling protocol 2020-12-11 19:28:17 +01:00			`quantification_func = model.quantify`
evaluation by artificial prevalence sampling added. New methods added. New util functions added to quapy.functional and quapy.utils 2020-12-10 19:04:33 +01:00
			`def _predict_prevalences(index):`
			`sample = test.sampling_from_index(index)`
			`true_prevalence = sample.prevalence()`
refactoring aggregative methods as methods that not only implement 'classify' and 'quantify', but that also implement 'aggregate' and that, by default, have a default implementation of 'quantify' as a pipeline of 'classify' and 'aggregate'; this helps speeding up evaluations A LOT, since the documents can be pre-classified and the samples are carried out across pre-classified values (labels, or posterior probabilities), and thus only aggregate is called many times within the artificial sampling protocol 2020-12-11 19:28:17 +01:00			`estim_prevalence = quantification_func(sample.instances)`
evaluation by artificial prevalence sampling added. New methods added. New util functions added to quapy.functional and quapy.utils 2020-12-10 19:04:33 +01:00			`return true_prevalence, estim_prevalence`

added model selection for quantification 2020-12-22 17:43:23 +01:00			`pbar = tqdm(indexes, desc='[artificial sampling protocol] predicting') if verbose else indexes`
evaluation by artificial prevalence sampling added. New methods added. New util functions added to quapy.functional and quapy.utils 2020-12-10 19:04:33 +01:00			`results = Parallel(n_jobs=n_jobs)(`
added model selection for quantification 2020-12-22 17:43:23 +01:00			`delayed(_predict_prevalences)(index) for index in pbar`
evaluation by artificial prevalence sampling added. New methods added. New util functions added to quapy.functional and quapy.utils 2020-12-10 19:04:33 +01:00			`)`

			`true_prevalences, estim_prevalences = zip(*results)`
			`true_prevalences = np.asarray(true_prevalences)`
			`estim_prevalences = np.asarray(estim_prevalences)`

			`return true_prevalences, estim_prevalences`


added Ensemble methods (methods ALL, ACC, Ptr, DS from Pérez-Gallego et al 2017 and 2019) and some UCI ML datasets used in those articles (only 5 datasets out of 32 they used) 2021-01-06 14:58:29 +01:00			`def evaluate(model: BaseQuantifier, test_samples:Iterable[LabelledCollection], err:Union[str, Callable], n_jobs:int=-1):`
			`if isinstance(err, str):`
			`err = getattr(error, err)`
			`assert err.__name__ in error.QUANTIFICATION_ERROR_NAMES, \`
			`f'error={err} does not seem to be a quantification error'`
			`scores = Parallel(n_jobs=n_jobs)(`
			`delayed(_delayed_eval)(model, Ti, err) for Ti in test_samples`
			`)`
			`return np.mean(scores)`

evaluation by artificial prevalence sampling added. New methods added. New util functions added to quapy.functional and quapy.utils 2020-12-10 19:04:33 +01:00
added Ensemble methods (methods ALL, ACC, Ptr, DS from Pérez-Gallego et al 2017 and 2019) and some UCI ML datasets used in those articles (only 5 datasets out of 32 they used) 2021-01-06 14:58:29 +01:00			`def _delayed_eval(model:BaseQuantifier, test:LabelledCollection, error:Callable):`
			`prev_estim = model.quantify(test.instances)`
			`prev_true = test.prevalence()`
			`return error(prev_true, prev_estim)`
evaluation by artificial prevalence sampling added. New methods added. New util functions added to quapy.functional and quapy.utils 2020-12-10 19:04:33 +01:00