Readme

The library is organized around three core modules:

qdisc.dataset: Handles the loading and management of quantum datasets, including their associated types and tuning parameters used throughout the pipeline.
qdisc.vae: Implements the Variational Auto-Encoder (VAE) model and a Trainer wrapper that learns a low-dimensional representation of the quantum phase space.
qdisc.sr: Provides Symbolic Regression (SR) methods to derive compact, interpretable analytical descriptors for the clusters identified in the learned representation.

The library also includes two supporting submodules:

qdisc.nn: Contains neural network architectures used as building blocks within the pipeline.
qdisc.clustering: Implements a clustering algorithm that can be applied to the learned representation to identify and label distinct phases.

The interplay between these modules across the full pipeline is illustrated below:

The qdisc.vae module combines data from qdisc.dataset with neural network architectures from qdisc.nn to learn a low-dimensional representation of the phase space. The qdisc.sr module then operates on this representation — and, for some approaches, on the VAE itself — to produce compact, interpretable analytical expressions.

Other Formats