Skip to content

📍 Interpreto Roadmap

Welcome to the roadmap of Interpreto 🪄. This document outlines the planned features and improvements for upcoming releases.


🧭 Upcoming Features

1. Evaluation Metrics for Attribution Methods

We plan to integrate a set of standardized evaluation metrics to assess the quality and reliability of attribution methods. We'll start by adding the insertion/deletion metrics then AOPC Comprehensiveness and AOPC Sufficiency (DeYoung et al., 2020).

2. Integration of ConSim metric

We will implement ConSim, a robust evaluation metric measuring how well concept‑based explanations enable automated simulators (LLMs) to mimic the predictions of the original model. ConSim goes beyond evaluating just the concept space or importance alignment—it captures end-to-end interpretability effectiveness by testing whether the conveyed concepts actually allow a “simulator” to reproduce model outputs.


🙌 Contribute

Want to help shape the future of Interpreto? Check out our contributing guide and feel free to open an issue or pull request!