A Marimo notebook that uses S3 inventory data to display statistics about content in MIT Libraries' preservation storage. The analysis logic depends on Archivematica package and MIT Libraries file naming conventions. After launching the notebook, see the "About this notebook" section for more information.
As this dashboard relies on S3 Inventory data, authenticate with Dev1 credentials before editing.
The recommended approach for developing a Marimo notebook is to use the Marimo GUI editor:
make edit-notebookThis Confluence page describes the update workflow for this notebook. All updates are done in coordination with and fully reviewed by the DataEng team.
To run tests:
make testTo run linting:
make lintS3_INVENTORY_LOCATIONS=# A comma-delimited list of S3 URIs containing S3 Inventory symlink.txt files.# add optional env vars here...Often, notebooks are served as an "app". This is the default mode for marimo-launcher.
uv run marimo run --sandbox --headless --no-token notebook.pyAccess to an AWS-hosted notebook is managed through secrets stored in LastPass. For details on accessing this hosted notebook, visit this Confluence page.