Stochastic Tools System Design Description

This template follows INL template TEM-140, "IT System Design Description."

commentnote

This document serves as an addendum to Framework System Design Description and captures information for Software Design Description (SDD) specific to the Stochastic Tools module.

Introduction

The MOOSE Stochastic Tools module is based on the MOOSE framework and thus inherits the unique features and base characteristics of the framework, as outlined in the Framework System Design Description. Specific details unique to the module are outlined in this document.

System Purpose

The Software Design Description provided here is description of each object in the system. The pluggable architecture of the underlying framework of the Stochastic Tools module makes MOOSE and MOOSE-based applications straightforward to develop as each piece of end-user (developer) code that goes into the system follows a well-defined interface for the underlying systems that those object plug into. These descriptions are provided through developer-supplied "markdown" files that are required for all new objects that are developed as part of the Stochastic Tools module. More information about the design documentation for MOOSE-based applications and like the Stochastic Tools module can be found in Documenting MOOSE.

System Scope

The MOOSE Stochastic Tools module builds on the existing framework MultiApps system and provides several additional systems to address its purpose. To address the efficient evaluation of multiphysics models, the MultiApps system is extended to include a "batch" mode of execution. This mode provides a memory efficient way of building MooseApp instances and reusing them for different perturbations of input parameters. This is useful for stochastic simulations with many, many samples to be run without holding them in memory or re-initializing potentially costly data allocation. The additional systems that the Stochastic Tools module provides include: Distributions, Samplers, Trainers, and Surrogates. Distributions are functions defining the uncertainty of input parameters and provides an interface for computing probability density, cumulative probability, and quantiles. Samplers define the sampling scheme of the stochastic analysis, whether it be random or deterministic. Trainers are objects that build meta-models meant to be used as surrogates or reduced-order models of the multiphysics model. The resulting reduced model is able to be saved in a meta-data file and reloaded for future use. Surrogates take the data created from a Trainer and provide functionality to evaluate the model. The module also provides capabilities for computing quantities related to basic uncertainty quantification and sensitivity analysis using the framework's Reporters system.

Dependencies and Limitations

The Stochastic Tools module inherits the software dependencies of the MOOSE framework, with no additional dependencies. The design of this module is motivated by the needs of its client applications.

Definitions and Acronyms

This section defines, or provides the definition of, all terms and acronyms required to properly understand this specification.

Definitions

  • Pull (Merge) Request: A proposed change to the software (e.g. usually a code change, but may also include documentation, requirements, design, and/or testing).

  • Baseline: A specification or product (e.g., project plan, maintenance and operations (M&O) plan, requirements, or design) that has been formally reviewed and agreed upon, that thereafter serves as the basis for use and further development, and that can be changed only by using an approved change control process (NQA-1, 2009).

  • Validation: Confirmation, through the provision of objective evidence (e.g., acceptance test), that the requirements for a specific intended use or application have been fulfilled (24765:2010(E), 2010).

  • Verification: (1) The process of: evaluating a system or component to determine whether the products of a given development phase satisfy the conditions imposed at the start of that phase. (2) Formal proof of program correctness (e.g., requirements, design, implementation reviews, system tests) (24765:2010(E), 2010).

Acronyms

AcronymDescription
APIApplication Programming Interface
DOE-NEDepartment of Energy, Nuclear Energy
FEfinite element
HITHierarchical Input Text
HPCHigh Performance Computing
I/OInput/Output
INLIdaho National Laboratory
MOOSEMultiphysics Object Oriented Simulation Environment
MPIMessage Passing Interface
SDDSoftware Design Description

Design Stakeholders and Concerns

Design Stakeholders

Stakeholders for MOOSE include several of the funding sources including Department of Energy, Nuclear Energy (DOE-NE) and the INL. However, Since MOOSE is an open-source project, several universities, companies, and foreign governments have an interest in the development and maintenance of the MOOSE project.

Stakeholder Design Concerns

Concerns from many of the stakeholders are similar. These concerns include correctness, stability, and performance. The mitigation plan for each of these can be addressed. For correctness, Stochastic Tools module development requires either regression or unit testing for all new code added to the repository. The project contains several comparisons against analytical solutions where possible and also other verification methods such as MMS. For stability, the Stochastic Tools module (located within the MOOSE repository) maintains multiple branches to incorporate several layers of testing both internally and for dependent applications. Finally, performance tests are also performed as part of the the normal testing suite to monitor code change impacts to performance.

System Design

The Stochastic Tools module inherits the wide range of pluggable systems from MOOSE. More information regarding MOOSE system design can be found in the framework System Design section. Most of these inherited systems are utilized to build physics models for which stochastic analysis is being performed. Other than these, the inherited core capabilities define the order and frequency of object execution. Furthermore, the MultiApps, Transfers, Controls, and Reporters systems in the framework are utilized to partition the stochastic simulations and manipulate distributed data. The Stochastic Tools implements several other pluggable systems, including: Distributions, Samplers, Trainers, and Surrogates. Distributions and Samplers define the parameter space of the stochastic analysis and Trainers and Surrogates build and utilize the data from the analysis to build reduced-order models. Dependent applications can inherit from these systems to define new methods. Documentation for each object, data structure, and process specific to the module are kept up-to-date alongside the MOOSE documentation. Expected failure modes and error conditions are accounted for via regression testing, and error conditions are noted in object documentation where applicable.

System Structure

The architecture of the Stochastic Tools module consists of a core and several pluggable systems. The core of MOOSE consists of a number of key objects responsible for setting up and managing the user-defined objects of a finite element simulation. This core set of objects has limited extendability and exists for every simulation configuration that the module is capable of running.

AuxKernels

AuxScalarKernels

AuxVariables

Controls

Covariance

Distributions

Functions

Likelihood

MultiApps

Outputs

ParameterStudy

Postprocessors

Reporters

Samplers

StochasticTools

Surrogates

Trainers

Transfers

UserObjects

VariableMappings

VectorPostprocessors

The MooseApp is the top-level object used to hold all of the other objects in a simulation. In a normal simulation, a single MooseApp object is created and "run()". This object uses its Factory objects to build user-defined objects which are stored in a series of Warehouse objects and executed. The Finite Element data is stored in the Systems and Assembly object while the domain information (the Mesh) is stored in the Mesh object. A series of threaded loops are used to run parallel calculations on the objects created and stored within the warehouses.

MOOSE's pluggable systems are documented on the MOOSE website, and those for the Stochastic Tools module are on this webpage, accessible through the high-level system links above. Each of these systems has a set of defined polymorphic interfaces and are designed to accomplish a specific task within the simulation. The design of these systems is solid and is managed through agile methods and ticket request system on the MOOSE GitHub repository.

Data Design and Control

At a high level, the system is designed to process Hierarchical Input Text (HIT) input files to construct several objects that will constitute an finite element (FE) simulation. Some of the objects in the simulation may in turn load other file-based resources to complete the simulation. Examples include meshes or data files. The system will then assemble systems of equations and solve them using the libraries of the Code Platform. The system can then output the solution in one or more supported output formats commonly used for visualization.

Human-Machine Interface Design

The Stochastic Tools module is a command-line driven program. All interaction with the Stochastic Tools module is ultimately done through the command line. This is typical for HPC applications that use the MPI interface for running on super computing clusters. Optional GUIs may be used to assist in creating input files and launching executables on the command line.

System Design Interface

All external system interaction is performed either through file Input/Output (I/O) or through local API calls. Neither the Stochastic Tools module, nor the MOOSE framework, nor the other MOOSE modules are designed to interact with any external system directly through remote procedure calls. Any code to code coupling performed using the framework are done directly through API calls either in a static binary or after loading shared libraries.

Security Structure

The Stochastic Tools module does not require any elevated privileges to operate and does not run any stateful services, daemons or other network programs. Distributed runs rely on the MPI library.

Requirements Cross-Reference

  • stochastic_tools: LibtorchDRLControl
  • 10.3.1The system shall be able to read a neural network from a parameter file and use it to control a transient process.

    Specification(s): read-parameters

    Design: LibtorchDRLControl

    Issue(s): #19571

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • 10.3.2The system shall be able to run with a DRL controller without having a neural net initialized in it.

    Specification(s): without-nn

    Design: LibtorchDRLControl

    Issue(s): #19571

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • stochastic_tools: RandomIC
  • 10.6.1The system shall generate parallel agnostic random initial conditions using a distribution function.

    Specification(s): generate

    Design: RandomIC

    Issue(s): #5567#11901#9710

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • 10.6.2The system shall generate an error the random initial condition is used with both a distribution and min or max value defined.

    Specification(s): test_err_distribution_and_min_max

    Design: RandomIC

    Issue(s): #5567#11901#9710

    Collection(s): FUNCTIONALFAILURE_ANALYSIS

    Type(s): RunException

  • stochastic_tools: SamplerParameterTransfer
  • 10.8.19The system shall be able to transfer parameter(s) to nested sub-applications in
    1. normal multiapp mode;
    2. batch-reset multiapp mode;
    3. batch-restore multiapp mode;

    Specification(s): parameter_transfer/normal, parameter_transfer/batch_reset, parameter_transfer/batch_restore

    Design: SamplerParameterTransfer

    Issue(s): #21233

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • 10.12.1The system shall include the ability to modify parameters for sub-applications using values from a distribution
    1. on a single processor,
    2. on multiple processors,
    3. and on more processors than samples.

    Specification(s): normal/n1, normal/n2, normal/n3

    Design: SamplerParameterTransfer

    Issue(s): #8863

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • 10.12.2The system shall include the ability to modify parameters for sub-applications executed in batches using values from a distribution
    1. on a single processor,
    2. on multiple processors, and
    3. on multiple processors using in-memory sub-application restore.

    Specification(s): batch/n1, batch/n2, batch/n2_restore

    Design: SamplerParameterTransfer

    Issue(s): #8863

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • 10.12.3The system shall include the ability to transfer stochastic results for two sub apps.

    Specification(s): batch_two_subapps

    Design: SamplerParameterTransfer

    Issue(s): #17079

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • 10.12.4The 'StochasticToolsTransfer object shall error if the 'execute_on' parameter is defined when the corresponding MultiApp object is running in batch mode.

    Specification(s): StochasticToolsTransfer_execute_on_error

    Design: SamplerParameterTransfer

    Issue(s): #8863

    Collection(s): FUNCTIONALFAILURE_ANALYSIS

    Type(s): RunException

  • 10.12.5The 'StochasticToolsTransfer' object shall error if the 'execute_on' parameter does not match the corresponding MultiApp object is running in normal mode.

    Specification(s): StochasticToolsTransfer_execute_on_check

    Design: SamplerParameterTransfer

    Issue(s): #8863

    Collection(s): FUNCTIONALFAILURE_ANALYSIS

    Type(s): RunException

  • 10.12.6The system shall report a reasonable error if parameters for a trasnfer between multiapps are provided to stochastics transfer, which do not support this currently

    Specification(s): direction_error

    Design: SamplerParameterTransfer

    Issue(s): #8863

    Collection(s): FUNCTIONALFAILURE_ANALYSIS

    Type(s): RunException

  • 10.12.8The system shall support the creation of a sub-application for each row of the stochastic data.

    Specification(s): monte_carlo

    Design: MonteCarloSamplerParameterTransfer

    Issue(s): #8863

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • 10.12.15The system shall produce an error if neither a 'SamplerTransientMultiApp' nor SamplerFullSolveMultiApp is provided in SamplerParameterTransfer.

    Specification(s): multiapp_type

    Design: SamplerParameterTransfer

    Issue(s): #11363

    Collection(s): FUNCTIONALFAILURE_ANALYSIS

    Type(s): RunException

  • 10.12.17The system shall produce an error if supplied vector of real values is not sized correctly within the SamplerParameterTransfer object.

    Specification(s): num_parameters_wrong

    Design: SamplerParameterTransfer

    Issue(s): #11363

    Collection(s): FUNCTIONALFAILURE_ANALYSIS

    Type(s): RunException

  • 10.12.18The system shall produce an error if a vector of values is supplied to a scalar parameter.

    Specification(s): not_vector

    Design: SamplerParameterTransfer

    Issue(s): #11363

    Collection(s): FUNCTIONALFAILURE_ANALYSIS

    Type(s): RunException

  • 10.12.19The system shall produce an error if sampling method differs between the sub-application and the associated sub-application data transfer.

    Specification(s): sampler_mismatch

    Design: SamplerParameterTransfer

    Issue(s): #11363

    Collection(s): FUNCTIONALFAILURE_ANALYSIS

    Type(s): RunException

  • 10.12.20The system shall be capable of transferring scalar data to sub-applications for each row of the stochastic data
    1. using a Monte Carlo and
    2. Sobol sampling scheme.

    Specification(s): transfer/monte_carlo, transfer/sobol

    Design: SamplerParameterTransfer

    Issue(s): #8065

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • 10.12.21The system shall be capable of transferring vector data to sub-applications for each row of the stochastic data.

    Specification(s): monte_carlo

    Design: SamplerParameterTransfer

    Issue(s): #8065

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • 10.12.22The system shall report an error if the transferred vector to a sub-application
    1. if the vector parameter does not exist;
    2. if the sub-application does not consume all of the supplied data;

    Specification(s): errors/invalid_name, errors/extra_data

    Design: SamplerParameterTransfer

    Issue(s): #8065

    Collection(s): FUNCTIONALFAILURE_ANALYSIS

    Type(s): RunException

  • 10.12.25The system shall support the creation of a sub-application for each row sampled data generated from a Sobol scheme.

    Specification(s): sobol

    Design: SobolSamplerParameterTransfer

    Issue(s): #8863

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • stochastic_tools: MappingReporter
  • 10.9.12The system should be able to map the solutions in a snapshot container into a latent space.

    Specification(s): from_parallelstorage

    Design: MappingReporter

    Issue(s): #23619

    Collection(s): FUNCTIONAL

    Type(s): JSONDiff

  • 10.9.13The system should be able to extract and map variables in nonlinear system into a latent (low-dimensional) space.

    Specification(s): from_solutions

    Design: MappingReporter

    Issue(s): #23619

    Collection(s): FUNCTIONAL

    Type(s): JSONDiff

  • stochastic_tools: SobolReporter
  • 10.9.17The system shall support the ability to compute first, second, and total-effect Sobol sensitivity indices with a reporter.

    Specification(s): sobol

    Design: SobolReporter

    Issue(s): #15558

    Collection(s): FUNCTIONAL

    Type(s): JSONDiff

  • 10.9.18The system shall support the ability to compute Sobol sensitivity indices for vector-type data.

    Specification(s): sobol_vec

    Design: SobolReporter

    Issue(s): #15558

    Collection(s): FUNCTIONAL

    Type(s): JSONDiff

  • stochastic_tools: Cartesian1D
  • 10.10.5The system shall include the ability to create a 1D Cartesian sampling scheme.

    Specification(s): test

    Design: Cartesian1D

    Issue(s): #24566

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • 10.10.6The system shall throw an error when performing 1D Cartesian sampling if
    1. the specified grid does not form a triplet,
    2. the number of nominal values specified does not match the number of sample columns,
    3. the number of grid points is not an integer, or
    4. the number of grid points is negative.

    Specification(s): errors/no_triplet, errors/wrong_num_values, errors/non_int, errors/negative

    Design: Cartesian1D

    Issue(s): #24566

    Collection(s): FUNCTIONALFAILURE_ANALYSIS

    Type(s): RunException

  • stochastic_tools: Sampler
  • 10.10.14The system shall support the creation of data sampled from distribution during the initial setup of a simulation.

    Specification(s): initial

    Design: Sampler

    Issue(s): #8065

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • stochastic_tools: LatinHypercube
  • 10.10.16The system shall support the ability to sample data using the Latin Hypercube method that can operate
    1. using global matrix,
    2. a local matrix,
    3. or row-by-row.

    Specification(s): modes/global, modes/local, modes/row

    Design: LatinHypercube

    Issue(s): #14830

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • 10.10.17The system shall support the ability to sample data using the Latin Hypercube method with more processors than rows that can operate
    1. using global matrix,
    2. a local matrix,
    3. or row-by-row.

    Specification(s): more_procs/global, more_procs/local, more_procs/row

    Design: LatinHypercube

    Issue(s): #14830

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • 10.10.18The system shall include a utility that visually displays results of plotting Latin Hypercube test.

    Specification(s): visualize

    Design: LatinHypercube

    Issue(s): #14830

    Collection(s): FUNCTIONAL

    Type(s): CheckFiles

  • stochastic_tools: Morris
  • 10.10.22The system shall include a Morris trajectory method for sampling distribution data:
    1. with replicated output;
    2. with parallel output;
    3. with more processors than trajectories;

    Specification(s): morris/serial, morris/parallel, morris/parallel_more

    Design: Morris

    Issue(s): #21191

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • stochastic_tools: NestedMonteCarlo
  • 10.10.23The system shall include a nested Monte Carlo sampling scheme where sets of distributions are sampled as nested loops of rows
    1. in serial;
    2. in parallel;

    Specification(s): nested_monte_carlo/serial, nested_monte_carlo/parallel

    Design: NestedMonteCarlo

    Issue(s): #20130

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • 10.10.24The system shall report an error when the number of nested Monte Carlo loops does not match the number of sets of distributions.

    Specification(s): error

    Design: NestedMonteCarlo

    Issue(s): #20130

    Collection(s): FUNCTIONALFAILURE_ANALYSIS

    Type(s): RunException

  • stochastic_tools: Sobol
  • 10.10.25The system shall include a SOBOL method for sampling distribution data:
    1. with the re-sampling matrix and
    2. without the re-sampling matrix.

    Specification(s): sobol/resample, sobol/no_resample

    Design: Sobol

    Issue(s): #8065

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • 10.10.26The system shall report an error if the SOBOL sampling method is setup with input sampling matrices
    1. with differing number of rows;
    2. with differing number of columns; and
    3. if the matrices are the same.

    Specification(s): errors/row_mismatch, errors/col_mismatch, errors/same_matrix

    Design: Sobol

    Issue(s): #8065

    Collection(s): FUNCTIONALFAILURE_ANALYSIS

    Type(s): RunException

  • 10.12.25The system shall support the creation of a sub-application for each row sampled data generated from a Sobol scheme.

    Specification(s): sobol

    Design: SobolSamplerParameterTransfer

    Issue(s): #8863

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • stochastic_tools: StochasticResults
  • 10.12.9The system shall produce an error if neither a 'SamplerTransientMultiApp' nor SamplerFullSolveMultiApp is provided in SamplerPostprocessorTransfer.

    Specification(s): wrong_multi_app

    Design: StochasticResults

    Issue(s): #9419

    Collection(s): FUNCTIONALFAILURE_ANALYSIS

    Type(s): RunException

  • 10.12.10The system shall produce an error if the 'result' object in 'SamplerPostprocessorTransfer' is not a 'StochasticResults object'.

    Specification(s): require_stochastic_results

    Design: StochasticResults

    Issue(s): #9419

    Collection(s): FUNCTIONALFAILURE_ANALYSIS

    Type(s): RunException

  • 10.15.9The system shall support the collection of stochastic data from multiple sub-applications.

    Specification(s): multiple

    Design: StochasticResults

    Issue(s): #14414

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • 10.15.17The system shall support the collection of stochastic data that is
    1. replicated on all processors and
    2. distributed across many.

    Specification(s): parallel_type/replicated, parallel_type/distributed

    Design: StochasticResults

    Issue(s): #14410

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • 10.15.18The system shall support the labeling of collection of stochastic data
    1. with custom prefix and
    2. without a prefix.

    Specification(s): prefix/custom, prefix/none

    Design: StochasticResults

    Issue(s): #14410

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • 10.15.19The system shall support the collection of stochastic data that
    1. can be appended into a single data set or
    2. or contain a single file per timestep.

    Specification(s): data/complete, data/time

    Design: StochasticResults

    Issue(s): #14412

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • stochastic_tools: PODMapping
  • 10.14.1The system should be able to build a variable mapping by computing the SVD of a snapshot matrix in serial.

    Specification(s): svd_1_proc_per_app

    Design: PODMapping

    Issue(s): #23619

    Collection(s): FUNCTIONAL

    Type(s): JSONDiff

  • 10.14.2The system should be able to build a variable mapping by computing the SVD of a snapshot matrix in parallel.

    Specification(s): svd_2_proc_per_app

    Design: PODMapping

    Issue(s): #23619

    Collection(s): FUNCTIONAL

    Type(s): JSONDiff

  • 10.14.3The system should be able to build a variable mapping by computing the SVD of a snapshot matrix in parallel with having data on only root processors.

    Specification(s): svd_2_proc_per_app_root

    Design: PODMapping

    Issue(s): #23619

    Collection(s): FUNCTIONAL

    Type(s): JSONDiff

  • stochastic_tools: SobolStatistics
  • 10.15.13The system shall support the ability to compute first, second, and total-effect Sobol sensitivity indices.

    Specification(s): sobol

    Design: SobolStatistics

    Issue(s): #14784

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

  • 10.15.14The system shall support the ability to compute confidence intervals on Sobol sensitivity indices.

    Specification(s): sobol_bootstrap

    Design: SobolStatistics

    Issue(s): #14784

    Collection(s): FUNCTIONAL

    Type(s): CSVDiff

References

  1. ISO/IEC/IEEE 24765:2010(E). Systems and software engineering—Vocabulary. first edition, December 15 2010.[BibTeX]
  2. ASME NQA-1. ASME NQA-1-2008 with the NQA-1a-2009 addenda: Quality Assurance Requirements for Nuclear Facility Applications. first edition, August 31 2009.[BibTeX]