seismometer.configuration.model.DataUsage¶

pydantic model seismometer.configuration.model.DataUsage¶

The definitions of data to use in a notebook run.

This structure defines what data to load and how to use it. The entity_id and context_id are the possible keys for joining events and predictions, and are also used to summarize predictions to a single entity. Primary output and target are the score and target used in default performance analysis.

The features and scores list, when defined, limit the loading of data from the predictions file to only those inputs and outputs (plus primary_score and cohort attributes). The events similarly limits the event types that are merged into the working dataframe and available to analyses.

Show Entity Relationship Diagram

$digraph "Entity Relationship Diagram created by erdantic" { graph [fontcolor=gray66, fontname="Times New Roman,Times,Liberation Serif,serif", fontsize=9, nodesep=0.5, rankdir=LR, ranksep=1.5 ]; node [fontname="Times New Roman,Times,Liberation Serif,serif", fontsize=14, label="\N", shape=plain ]; edge [dir=both]; "seismometer.configuration.model.Cohort" [label=<<table border="0" cellborder="1" cellspacing="0"><tr><td port="_root" colspan="2"><b>Cohort</b></td></tr><tr><td>source</td><td port="source">str</td></tr><tr><td>display_name</td><td port="display_name">str</td></tr><tr><td>splits</td><td port="splits">Optional[list[Any]]</td></tr></table>>, tooltip="seismometer.configuration.model.Cohort

The definition of an expected cohort attribute.

This structure defines \ a cohort attribute that should be available for selection in a notebook.
For a categorical data, the splits should all be existing \ values and the list limits the selections available.
For numerical data, the splits should be the inner boundaries of bucketing; \ with a high and low being added
outside theses values.
"]; "seismometer.configuration.model.DataUsage" [label=<<table border="0" cellborder="1" cellspacing="0"><tr><td port="_root" colspan="2"><b>DataUsage</b></td></tr><tr><td>entity_id</td><td port="entity_id">str</td></tr><tr><td>context_id</td><td port="context_id">Optional[str]</td></tr><tr><td>primary_output</td><td port="primary_output">str</td></tr><tr><td>primary_target</td><td port="primary_target">str</td></tr><tr><td>predict_time</td><td port="predict_time">str</td></tr><tr><td>comparison_time</td><td port="comparison_time">str</td></tr><tr><td>event_table</td><td port="event_table">EventTableMap</td></tr><tr><td>outputs</td><td port="outputs">list[str]</td></tr><tr><td>cohorts</td><td port="cohorts">list[Cohort]</td></tr><tr><td>features</td><td port="features">list[str]</td></tr><tr><td>events</td><td port="events">list[Event]</td></tr><tr><td>metrics</td><td port="metrics">list[Metric]</td></tr><tr><td>censor_min_count</td><td port="censor_min_count">int</td></tr></table>>, tooltip="seismometer.configuration.model.DataUsage

The definitions of data to use in a notebook run.

This structure defines \ what data to load and how to use it.
The entity_id and context_id are the possible keys for joining events and predictions, \ and are also used to
summarize predictions to a single entity.
Primary output and target are the score and target used in \ default performance analysis.

The features and scores list, when defined, limit the loading of data from the predictions \ file to only those
inputs and outputs (plus primary_score and cohort attributes).
The events similarly limits the event \ types that are merged into the working dataframe and available to analyses.
"]; "seismometer.configuration.model.DataUsage":cohorts:e -> "seismometer.configuration.model.Cohort":_root:w [arrowhead=crownone, arrowtail=nonenone]; "seismometer.configuration.model.Event" [label=<<table border="0" cellborder="1" cellspacing="0"><tr><td port="_root" colspan="2"><b>Event</b></td></tr><tr><td>type</td><td port="type">str</td></tr><tr><td>group_keys</td><td port="group_keys">Optional[Union[str, list[str]]]</td></tr><tr><td>source</td><td port="source">list[str]</td></tr><tr><td>display_name</td><td port="display_name">str</td></tr><tr><td>window_hr</td><td port="window_hr">Optional[float]</td></tr><tr><td>offset_hr</td><td port="offset_hr">float</td></tr><tr><td>impute_val</td><td port="impute_val">Optional[Any]</td></tr><tr><td>usage</td><td port="usage">Optional[str]</td></tr><tr><td>aggregation_method</td><td port="aggregation_method">Optional[Literal['min', 'max', 'first', 'last']]</td></tr><tr><td>merge_strategy</td><td port="merge_strategy">Optional[Literal['first', 'last', 'nearest', 'forward', 'count']]</td></tr></table>>, tooltip="seismometer.configuration.model.Event

The definition of an event.

This structure defines an event and which predictions \ are relevant to it.
If a window is specified:

- the offset_hr defines the upper bound of the window relative to the \ event time,
 has default value of 0 (event time),
- the window_hr defines the size of the window looking backwards from \ the offset_hr.

If an event is present but the prediction is not in the window, the predictions are ignored for the event \ type.
If multiple events are present then the closest one is used.

The impute_val is used as the value for the event \ if no event is present.

Usage is used for context when selecting events, such as analyzing performance of the model with \ respect to a
target or when comparing an expected intervention to a monitored outcome.
"]; "seismometer.configuration.model.DataUsage":events:e -> "seismometer.configuration.model.Event":_root:w [arrowhead=crownone, arrowtail=nonenone]; "seismometer.configuration.model.EventTableMap" [label=<<table border="0" cellborder="1" cellspacing="0"><tr><td port="_root" colspan="2"><b>EventTableMap</b></td></tr><tr><td>type</td><td port="type">str</td></tr><tr><td>time</td><td port="time">str</td></tr><tr><td>value</td><td port="value">str</td></tr></table>>, tooltip="seismometer.configuration.model.EventTableMap

Override mapping of event table columns.
"]; "seismometer.configuration.model.DataUsage":event_table:e -> "seismometer.configuration.model.EventTableMap":_root:w [arrowhead=noneteetee, arrowtail=nonenone]; "seismometer.configuration.model.Metric" [label=<<table border="0" cellborder="1" cellspacing="0"><tr><td port="_root" colspan="2"><b>Metric</b></td></tr><tr><td>type</td><td port="type">str</td></tr><tr><td>group_keys</td><td port="group_keys">Optional[Union[str, list[str]]]</td></tr><tr><td>source</td><td port="source">str</td></tr><tr><td>display_name</td><td port="display_name">str</td></tr><tr><td>metric_details</td><td port="metric_details">MetricDetails</td></tr></table>>, tooltip="seismometer.configuration.model.Metric

A class to store information associated with a metric.
"]; "seismometer.configuration.model.DataUsage":metrics:e -> "seismometer.configuration.model.Metric":_root:w [arrowhead=crownone, arrowtail=nonenone]; "seismometer.configuration.model.MetricDetails" [label=<<table border="0" cellborder="1" cellspacing="0"><tr><td port="_root" colspan="2"><b>MetricDetails</b></td></tr><tr><td>min</td><td port="min">Optional[Union[float, int]]</td></tr><tr><td>max</td><td port="max">Optional[Union[float, int]]</td></tr><tr><td>handle_na</td><td port="handle_na">Optional[str]</td></tr><tr><td>values</td><td port="values">Optional[list[Union[float, int, str]]]</td></tr></table>>, tooltip="seismometer.configuration.model.MetricDetails

Contains details about a metric.
"]; "seismometer.configuration.model.Metric":metric_details:e -> "seismometer.configuration.model.MetricDetails":_root:w [arrowhead=noneteetee, arrowtail=nonenone]; }$