predict ¶

A module for making predictions with PyTorch models using DataLoaders.

Classes:

PredictWrapper –

A wrapper to predict the output of a model on a datset loaded into a torch DataLoader.

PredictWrapper ¶

PredictWrapper(
    model: Module,
    dataloader: DataLoader,
    loss_dict: Optional[dict[str, Any]] = None,
)

A wrapper to predict the output of a model on a datset loaded into a torch DataLoader.

It also provides the functionalities to measure the performance of the model.

Parameters:

model (Module) –

The PyTorch model to make predictions with
dataloader (DataLoader) –

DataLoader containing the evaluation data
loss_dict (Optional[dict[str, Any]], default: None ) –

Optional dictionary of loss functions

Methods:

compute_loss –

Compute the loss.
compute_metric –

Wrapper to compute the performance metric.
compute_metrics –

Wrapper to compute the performance metrics.
compute_other_metric –

Compute the performance metric.
handle_predictions –

Handle the model outputs from forward pass, into a dictionary of tensors, just like y.
predict –

Get the model predictions.

Source code in src/stimulus/learner/predict.py

def __init__(self, model: nn.Module, dataloader: DataLoader, loss_dict: Optional[dict[str, Any]] = None) -> None:
    """Initialize the PredictWrapper.

    Args:
        model: The PyTorch model to make predictions with
        dataloader: DataLoader containing the evaluation data
        loss_dict: Optional dictionary of loss functions
    """
    self.model = model
    self.dataloader = dataloader
    self.loss_dict = loss_dict
    try:
        self.model.eval()
    except RuntimeError as e:
        # Using logging instead of print
        import logging

        logging.warning("Not able to run model.eval: %s", str(e))

compute_loss ¶

compute_loss() -> float

Compute the loss.

The current implmentation basically computes the loss for each batch and then averages them. TODO we could potentially summarize the los across batches in a different way. Or sometimes we may potentially even have 1+ losses.

Source code in src/stimulus/learner/predict.py

def compute_loss(self) -> float:
    """Compute the loss.

    The current implmentation basically computes the loss for each batch and then averages them.
    TODO we could potentially summarize the los across batches in a different way.
    Or sometimes we may potentially even have 1+ losses.
    """
    if self.loss_dict is None:
        raise ValueError("Loss function is not provided.")
    loss = 0.0
    with torch.no_grad():
        for x, y, _ in self.dataloader:
            # the loss_dict could be unpacked with ** and the function declaration handle it differently like **kwargs. to be decided, personally find this more clean and understable.
            current_loss = self.model.batch(x=x, y=y, **self.loss_dict)[0]
            loss += current_loss.item()
    return loss / len(self.dataloader)

compute_metric ¶

compute_metric(metric: str = 'loss') -> float

Wrapper to compute the performance metric.

Source code in src/stimulus/learner/predict.py

def compute_metric(self, metric: str = "loss") -> float:
    """Wrapper to compute the performance metric."""
    if metric == "loss":
        return self.compute_loss()
    return self.compute_other_metric(metric)

compute_metrics ¶

compute_metrics(metrics: list[str]) -> dict[str, float]

Wrapper to compute the performance metrics.

Source code in src/stimulus/learner/predict.py

def compute_metrics(self, metrics: list[str]) -> dict[str, float]:
    """Wrapper to compute the performance metrics."""
    return {m: self.compute_metric(m) for m in metrics}

compute_other_metric ¶

compute_other_metric(metric: str) -> float

Compute the performance metric.

TODO currently we computes the average performance metric across target y, but maybe in the future we want something different¶

Source code in src/stimulus/learner/predict.py

def compute_other_metric(self, metric: str) -> float:
    """Compute the performance metric.

    # TODO currently we computes the average performance metric across target y, but maybe in the future we want something different
    """
    if not hasattr(self, "predictions") or not hasattr(self, "labels"):
        predictions, labels = self.predict(return_labels=True)
        self.predictions = predictions
        self.labels = labels

    # Explicitly type the labels and predictions as dictionaries with str keys
    labels_dict: dict[str, Tensor] = self.labels if isinstance(self.labels, dict) else {}
    predictions_dict: dict[str, Tensor] = self.predictions if isinstance(self.predictions, dict) else {}

    return sum(
        Performance(labels=labels_dict[k], predictions=predictions_dict[k], metric=metric).val for k in labels_dict
    ) / len(labels_dict)

handle_predictions ¶

handle_predictions(
    predictions: Any, y: dict[str, Tensor]
) -> dict[str, Tensor]

Handle the model outputs from forward pass, into a dictionary of tensors, just like y.

Source code in src/stimulus/learner/predict.py

def handle_predictions(self, predictions: Any, y: dict[str, Tensor]) -> dict[str, Tensor]:
    """Handle the model outputs from forward pass, into a dictionary of tensors, just like y."""
    if len(y) == 1:
        return {next(iter(y.keys())): predictions}
    return dict(zip(y.keys(), predictions))

predict ¶

predict(*, return_labels: bool = False) -> Union[
    dict[str, Tensor],
    tuple[dict[str, Tensor], dict[str, Tensor]],
]

Get the model predictions.

Basically, it runs a foward pass on the model for each batch, gets the predictions and concatenate them for all batches. Since the returned current_predictions are formed by tensors computed for one batch, the final predictions are obtained by concatenating them.

At the end it returns predictions as a dictionary of tensors with the same keys as y.

If return_labels if True, then the labels will be returned as well, also as a dictionary of tensors.

Parameters:

return_labels (bool, default: False ) –

Whether to also return the labels

Returns:

Union[dict[str, Tensor], tuple[dict[str, Tensor], dict[str, Tensor]]] –

Dictionary of predictions, and optionally labels

Source code in src/stimulus/learner/predict.py

def predict(
    self,
    *,
    return_labels: bool = False,
) -> Union[dict[str, Tensor], tuple[dict[str, Tensor], dict[str, Tensor]]]:
    """Get the model predictions.

    Basically, it runs a foward pass on the model for each batch,
    gets the predictions and concatenate them for all batches.
    Since the returned `current_predictions` are formed by tensors computed for one batch,
    the final `predictions` are obtained by concatenating them.

    At the end it returns `predictions` as a dictionary of tensors with the same keys as `y`.

    If return_labels if True, then the `labels` will be returned as well, also as a dictionary of tensors.

    Args:
        return_labels: Whether to also return the labels

    Returns:
        Dictionary of predictions, and optionally labels
    """
    # create empty dictionaries with the column names
    first_batch = next(iter(self.dataloader))
    keys = first_batch[1].keys()
    predictions: dict[str, list[Tensor]] = {k: [] for k in keys}
    labels: dict[str, list[Tensor]] = {k: [] for k in keys}

    # get the predictions (and labels) for each batch
    with torch.no_grad():
        for x, y, _ in self.dataloader:
            current_predictions = self.model(**x)
            current_predictions = self.handle_predictions(current_predictions, y)
            for k in keys:
                # it might happen that the batch consists of one element only so the torch.cat will fail. To prevent this the function to ensure at least one dimensionality is called.
                predictions[k].append(ensure_at_least_1d(current_predictions[k]))
                if return_labels:
                    labels[k].append(ensure_at_least_1d(y[k]))

    # return the predictions (and labels) as a dictionary of tensors for the entire dataset.
    if not return_labels:
        return {k: torch.cat(v) for k, v in predictions.items()}
    return {k: torch.cat(v) for k, v in predictions.items()}, {k: torch.cat(v) for k, v in labels.items()}

predict ¶

PredictWrapper ¶

compute_loss ¶

compute_metric ¶

compute_metrics ¶

compute_other_metric ¶

TODO currently we computes the average performance metric across target y, but maybe in the future we want something different¶

handle_predictions ¶

predict ¶

Feedback