TokTagger
TokTagger is a web-based platform for curating labeled datasets from tokamak diagnostics. It lets users browse shots, inspect signals and images, apply consistent labels, and manage annotations in one place. The Python API and React UI support local or team workflows, making it straightforward to create datasets for downstream analysis and machine-learning models.
It currently supports the following features:
- Data Browsing: Explore tokamak shots, signals, and images through an intuitive interface.
- Annotation Tools: Apply consistent labels to signals and images using a customizable tagging system.
- ML Models: Train and infer from ML models within the UI.
- Dataset Management: Organize and manage annotations in a central repository.
- Extensible API: A Python API for integrating with existing workflows and tools.
Installation
To run the application locally:
Install via pip
To install the package via pip (or similarly via Poetry or uv package managers):
python -m venv .venv
source .venv/bin/activate
GIT_LFS_SKIP_SMUDGE=1 pip install git+ssh://git@github.com/ukaea/toktagger.git
Install as a uv tool
Alternatively, it can be installed as a tool using uv:
GIT_LFS_SKIP_SMUDGE=1 uv tool install --lfs --python 3.12.6 git+ssh://git@github.com/ukaea/toktagger.git`
Quick Start
To get started, run:
toktagger
This will start a local instance of the application running at http://localhost:8002.