Roadmap

Contributions

As an open-source project, we welcome community contributions to seismometer. Ultimately, we want this project to be a community-led effort to codify guidelines for ensuring the equitable and informed use of machine learning and AI tools in the healthcare space. Contributions to this project can be as simple as fixing typos or small bugs, or more complex contributions that, with the support and scrutiny of our development team, guide the overall direction of the project.

See also

Contributing to Seismometer for our Contributor’s Guide.

Use Cases

Templates

As of v0.1.0, seismometer supports evaluating model performance using standardized evaluation criteria binary classifier models. We plan to add support in the near future for other types of machine learning models, such as multiple classifier models. Similarly, we plan to add support for validating generative AI models. These enhancements will include changes to the underlying seismometer tooling, as well as adding new templates for validating generative models.

Workflows and Pre-Live Evaluation

As of v0.1.0, seismometer has limited support for evaluating model performance pre-live. We are planning to add support for workflow simulation (e.g., estimating the number of alerts that would be shown to end-users for a clinical model that predicts an adverse event, or the amount of time saved per clinician for a generative model that drafts messages to patients) based on particular thresholds. We will also add tools to identify thresholds for models based on pre-live data and operational goals. These tools are intended to help identify when a machine learning or artificial intelligence solutions will improve current workflows and also improve efficiency when integrating models into a workflow.

Comparing to Baselines

We plan to add support for comparing model performance to baseline statistics (e.g., statistics from a model train or from model performance at a separate site). These are intended to verify that the model feature or target drift are not adversely affecting the model’s performance after it goes live.

Functional changes

Visualizations

As seismometer grows, we will add support for new types of visualizations. Our initial focus is to improve visualizations for interventions and outcomes stratified by sensitive groups, but we plan to extend our model performance visualizations as well.

Data Layer

As of v0.1.0, seismometer supports reading data from parquet files, which contain data type information and performance improvements that standard CSV data does not have. We plan to add support for more file formats (alongside metadata files that will describe the data types) as well as support for reading data directly from a database (e.g., through an ODBC connection).

Code Structure

As we gear up for seismometer’s version 1.0 release, we will be working on finalizing the internals of the tool. Prior to the version 1.0 release, we expect there will be breaking changes to APIs, after which the goal will be to minimize those breaking changes and only release breaking changes alongside a major version bump.

See also

Changelog for our Release Notes and any breaking changes.

Technical Plans

Some changes are known to have potential impact on using the package, such as changing supported python versions. As these arise, issues will be created using the label ‘compatibility’ for ease of discovery. This is distinct from large functional enhancements using a request for change process.