7.1. Introduction

KINSOL is part of a software family called SUNDIALS: SUite of Nonlinear and DIfferential/ALgebraic equation Solvers [81]. This suite consists of CVODE, ARKODE, KINSOL, and IDA, and variants of these with sensitivity analysis capabilities.

KINSOL is a general-purpose nonlinear system solver based on Newton-Krylov solver technology. A fixed point iteration is also included with the release of KINSOL v.2.8.0 and higher.

7.1.1. Historical Background

The first nonlinear solver packages based on Newton-Krylov methods were written in Fortran. In particular, the NKSOL package, written at LLNL, was the first Newton-Krylov solver package written for solution of systems arising in the solution of partial differential equations [28]. This Fortran code made use of Newton’s method to solve the discrete nonlinear systems and applied a preconditioned Krylov linear solver for solution of the Jacobian system at each nonlinear iteration. The key to the Newton-Krylov method was that the matrix-vector multiplies required by the Krylov method could effectively be approximated by a finite difference of the nonlinear system-defining function, avoiding a requirement for the formation of the actual Jacobian matrix. Significantly less memory was required for the solver as a result.

In the late 1990s, there was a push at LLNL to rewrite the nonlinear solver in C and port it to distributed memory parallel machines. Both Newton and Krylov methods are easily implemented in parallel, and this effort gave rise to the KINSOL package. KINSOL is similar to NKSOL in functionality, except that it provides for more options in the choice of linear system methods and tolerances, and has a more modular design to provide flexibility for future enhancements.

At present, KINSOL may utilize a variety of Krylov methods provided in SUNDIALS. These methods include the GMRES (Generalized Minimal RESidual) [122], FGMRES (Flexible Generalized Minimum RESidual) [121], Bi-CGStab (Bi-Conjugate Gradient Stabilized) [157], TFQMR (Transpose-Free Quasi-Minimal Residual) [64], and PCG (Preconditioned Conjugate Gradient) [76] linear iterative methods. As Krylov methods, these require little matrix storage for solving the Newton equations as compared to direct methods. However, the algorithms allow for a user-supplied preconditioner, and, for most problems, preconditioning is essential for an efficient solution. For very large nonlinear algebraic systems, the Krylov methods are preferable over direct linear solver methods, and are often the only feasible choice. Among the Krylov methods in SUNDIALS, we recommend GMRES as the best overall choice. However, users are encouraged to compare all options, especially if encountering convergence failures with GMRES. Bi-CGStab and TFQMR have an advantage in storage requirements, in that the number of workspace vectors they require is fixed, while that number for GMRES depends on the desired Krylov subspace size. FGMRES has an advantage in that it is designed to support preconditioners that vary between iterations (e.g., iterative methods). PCG exhibits rapid convergence and minimal workspace vectors, but only works for symmetric linear systems.

For the sake of completeness in functionality, direct linear system solvers are included in KINSOL. These include methods for both dense and banded linear systems, with Jacobians that are either user-supplied or generated internally by difference quotients. KINSOL also includes interfaces to sparse direct solvers, including KLU [4, 44] and the threaded sparse direct solver, SuperLU_MT [9, 46, 102], among others (see Chapter §10 for further details).

In the process of translating NKSOL into C, the overall KINSOL organization has been changed considerably. One key feature of the KINSOL organization is that a separate module devoted to vector operations was created. This module facilitated extension to multiprosessor environments with minimal impact on the rest of the solver. The vector module design is shared across the SUNDIALS suite. This N_Vector module is written in terms of abstract vector operations with the actual routines attached by a particular implementation (such as serial or parallel) of N_Vector. This abstraction allows writing the SUNDIALS solvers in a manner independent of the actual N_Vector implementation (which can be user-supplied), as well as allowing more than one N_Vector module linked into an executable file. SUNDIALS (and thus KINSOL) is supplied with serial, MPI-parallel, OpenMP and Pthreads thread-parallel N_Vector implementations, as well as multiple N_Vector implementations designed to leverage GPU architectures (see Chapter §8 for further details).

There are several motivations for choosing the C language for KINSOL. First, a general movement away from Fortran and toward C in scientific computing was apparent. Second, the pointer, structure, and dynamic memory allocation features in C are extremely useful in software of this complexity, with the great variety of method options offered. Finally, we prefer C over C++ for KINSOL because of the wider availability of C compilers, the potentially greater efficiency of C, and the greater ease of interfacing the solver to applications written in Fortran.

7.1.2. Changes to SUNDIALS in release 7.7.0

New Features and Enhancements

The default number of stages for the SSP Runge-Kutta methods ARKODE_LSRK_SSP_S_2 and ARKODE_LSRK_SSP_S_3 in LSRKStep were changed from 10 and 9, respectively, to their minimum allowable values of 2 and 4. Users may revert to the previous values by calling LSRKStepSetNumSSPStages().

Added the optional function ARKodeInit() to ARKODE to enable data allocation before the first call to ARKodeEvolve() (but after all other optional input routines have been called), to support users who measure memory usage before beginning a simulation.

Added the function ARKodeGetStageIndex() that returns the index of the stage currently being processed, and the total number of stages in the method, for users who wish to compute auxiliary quantities in their IVP right-hand side functions during some stages and not others (e.g., in all but the first or last stage).

Added the functions ARKodeGetLastTime() and ARKodeGetLastState() to return the last successful time and state achieved by ARKODE, respectively.

ARKODE now allows users to supply functions that will be called before each internal time step attempt (ARKodeSetPreStepFn()), after each successful time step (ARKodeSetPostStepFn()), before right-hand side routines are called on an updated state (ARKodeSetPreRhsFn()), and/or once each internal step/stage is computed (ARKodeSetPostprocessStepFn()/ ARKodeSetPostprocessStageFn()). These are considered advanced functions, as they should treat the state vector as read-only, otherwise all theoretical guarantees of solution accuracy and stability will be lost. As a result of these new functions, the values of multiple ARKODE return codes (e.g., ARK_INTERP_FAIL) have been updated; users who key off of the named constants will not be affected, but users who rely on the values themselves should update their codes accordingly.

Note to users utilizing the previously undocumented ARKodeSetPostprocessStepFn() function, the supplied function is now called on the newly computed state vector for all step attempts not just successful steps. To obtain the previous behavior of only calling a function on successful steps, switch to using ARKodeSetPostStepFn().

Added SUNLogger_Set{Error,Warning,Info,Debug}File functions to allow setting logger output streams with a FILE*.

Updated the Kokkos N_Vector to support Kokkos 5.x versions.

Bug Fixes

Fixed a CMake bug where the SuperLU_MT interface would not be built and installed without setting the SUPERLUMT_WORKS option to TRUE.

Fixed the embedded coefficients for the ARKODE_TSITOURAS_7_4_5 Butcher table.

Fixed a bug in LSRKStep where an incorrect state vector could be passed to a user-supplied dominant eigenvalue function on the first step unless the output vector passed to ARKodeEvolve() contained the initial condition and when an eigenvalue estimate is requested on the first step in a subsequent call to ARKodeEvolve() unless the output vector passed contained the most recently returned solution.

Fixed a potential bug in LSRKStep’s ARKODE_LSRK_SSP_S_3 method, where a real number was used instead of an integer, potentially resulting in a rounding error.

Fixed a bug in MRIStep for estimating the first “slow” time step in an adaptive multirate calculation.

Fixed a bug in MRIStep when using a custom inner integrator that relies on the input state being the initial condition for the fast integration rather than retaining the result from the last inner integration or most recent reset call and the output vector passed to ARKodeEvolve() does not contain the initial condition on the first call or the last returned solution on subsequent calls.

Added a missing call to SUNNonlinSolSetup() in MRIStep when using an IMEX-MRI-SR method.

Fixed a bug in the ARKODE discrete adjoint checkpointing where an incorrect state would be stored on the first step if the output vector passed to ARKodeEvolve() did not contain the initial condition on the first call.

Removed extraneous copy of output vector when using ARKODE in ARK_ONE_STEP mode.

Removed an extraneous copy of the output vector in each step with SplittingStep.

Fixed a bug in logging output from ARKODE, where for some time stepping modules, the current “time” output in the logger was incorrect.

Fixed a bug where passing an empty string to SUNLogger_Set{Error,Warning,Info,Debug}Filename did not disable the corresponding logging stream Issue #844.

Deprecation Notices

The CVodeSetMonitorFn and CVodeSetMonitorFrequency functions have been deprecated and will be removed in the next major release.

Several CMake options have been deprecated in favor of namespaced versions prefixed with SUNDIALS_ to avoid naming collisions in applications that include SUNDIALS directly within their CMake builds. Additionally, a consistent naming convention (SUNDIALS_ENABLE) is now used for all boolean options. The table below lists the old CMake option names and the new replacements.

Old Option	New Option
`BUILD_ARKODE`	`SUNDIALS_ENABLE_ARKODE`
`BUILD_CVODE`	`SUNDIALS_ENABLE_CVODE`
`BUILD_CVODES`	`SUNDIALS_ENABLE_CVODES`
`BUILD_IDA`	`SUNDIALS_ENABLE_IDA`
`BUILD_IDAS`	`SUNDIALS_ENABLE_IDAS`
`BUILD_KINSOL`	`SUNDIALS_ENABLE_KINSOL`
`ENABLE_MPI`	`SUNDIALS_ENABLE_MPI`
`ENABLE_OPENMP`	`SUNDIALS_ENABLE_OPENMP`
`ENABLE_OPENMP_DEVICE`	`SUNDIALS_ENABLE_OPENMP_DEVICE`
`OPENMP_DEVICE_WORKS`	`SUNDIALS_ENABLE_OPENMP_DEVICE_CHECKS`
`ENABLE_PTHREAD`	`SUNDIALS_ENABLE_PTHREAD`
`ENABLE_CUDA`	`SUNDIALS_ENABLE_CUDA`
`ENABLE_HIP`	`SUNDIALS_ENABLE_HIP`
`ENABLE_SYCL`	`SUNDIALS_ENABLE_SYCL`
`ENABLE_LAPACK`	`SUNDIALS_ENABLE_LAPACK`
`LAPACK_WORKS`	`SUNDIALS_ENABLE_LAPACK_CHECKS`
`ENABLE_GINKGO`	`SUNDIALS_ENABLE_GINKGO`
`GINKGO_WORKS`	`SUNDIALS_ENABLE_GINKGO_CHECKS`
`ENABLE_MAGMA`	`SUNDIALS_ENABLE_MAGMA`
`MAGMA_WORKS`	`SUNDIALS_ENABLE_MAGMA_CHECKS`
`ENABLE_SUPERLUDIST`	`SUNDIALS_ENABLE_SUPERLUDIST`
`SUPERLUDIST_WORKS`	`SUNDIALS_ENABLE_SUPERLUDIST_CHECKS`
`ENABLE_SUPERLUMT`	`SUNDIALS_ENABLE_SUPERLUMT`
`SUPERLUMT_WORKS`	`SUNDIALS_ENABLE_SUPERLUMT_CHECKS`
`ENABLE_KLU`	`SUNDIALS_ENABLE_KLU`
`KLU_WORKS`	`SUNDIALS_ENABLE_KLU_CHECKS`
`ENABLE_HYPRE`	`SUNDIALS_ENABLE_HYPRE`
`HYPRE_WORKS`	`SUNDIALS_ENABLE_HYPRE_CHECKS`
`ENABLE_PETSC`	`SUNDIALS_ENABLE_PETSC`
`PETSC_WORKS`	`SUNDIALS_ENABLE_PETSC_CHECKS`
`ENABLE_TRILINOS`	`SUNDIALS_ENABLE_TRILINOS`
`ENABLE_RAJA`	`SUNDIALS_ENABLE_RAJA`
`ENABLE_XBRAID`	`SUNDIALS_ENABLE_XBRAID`
`XBRAID_WORKS`	`SUNDIALS_ENABLE_XBRAID_CHECKS`
`ENABLE_ONEMKL`	`SUNDIALS_ENABLE_ONEMKL`
`ONEMKL_WORKS`	`SUNDIALS_ENABLE_ONEMKL_CHECKS`
`ENABLE_CALIPER`	`SUNDIALS_ENABLE_CALIPER`
`ENABLE_ADIAK`	`SUNDIALS_ENABLE_ADIAK`
`ENABLE_KOKKOS`	`SUNDIALS_ENABLE_KOKKOS`
`KOKKOS_WORKS`	`SUNDIALS_ENABLE_KOKKOS_CHECKS`
`ENABLE_KOKKOS_KERNELS`	`SUNDIALS_ENABLE_KOKKOS_KERNELS`
`KOKKOS_KERNELS_WORKS`	`SUNDIALS_ENABLE_KOKKOS_KERNELS_CHECKS`
`BUILD_FORTRAN_MODULE_INTERFACE`	`SUNDIALS_ENABLE_FORTRAN`
`SUNDIALS_BUILD_WITH_PROFILING`	`SUNDIALS_ENABLE_PROFILING`
`SUNDIALS_BUILD_WITH_MONITORING`	`SUNDIALS_ENABLE_MONITORING`
`SUNDIALS_BUILD_PACKAGE_FUSED_KERNELS`	`SUNDIALS_ENABLE_PACKAGE_FUSED_KERNELS`
`EXAMPLES_ENABLE_C`	`SUNDIALS_ENABLE_C_EXAMPLES`
`EXAMPLES_ENABLE_CXX`	`SUNDIALS_ENABLE_CXX_EXAMPLES`
`EXAMPLES_ENABLE_F2003`	`SUNDIALS_ENABLE_FORTRAN_EXAMPLES`
`EXAMPLES_ENABLE_CUDA`	`SUNDIALS_ENABLE_CUDA_EXAMPLES`
`EXAMPLES_INSTALL`	`SUNDIALS_ENABLE_EXAMPLES_INSTALL`
`EXAMPLES_INSTALL_PATH`	`SUNDIALS_EXAMPLES_INSTALL_PATH`
`BUILD_BENCHMARKS`	`SUNDIALS_ENABLE_BENCHMARKS`
`BENCHMARKS_INSTALL_PATH`	`SUNDIALS_BENCHMARKS_INSTALL_PATH`
`SUNDIALS_BENCHMARK_OUTPUT_DIR`	`SUNDIALS_BENCHMARKS_OUTPUT_DIR`
`SUNDIALS_BENCHMARK_CALIPER_OUTPUT_DIR`	`SUNDIALS_BENCHMARKS_CALIPER_OUTPUT_DIR`
`SUNDIALS_BENCHMARK_NUM_CPUS`	`SUNDIALS_BENCHMARKS_NUM_CPUS`
`SUNDIALS_BENCHMARK_NUM_GPUS`	`SUNDIALS_BENCHMARKS_NUM_GPUS`
`ENABLE_ALL_WARNINGS`	`SUNDIALS_ENABLE_ALL_WARNINGS`
`ENABLE_WARNINGS_AS_ERRORS`	`CMAKE_COMPILE_WARNING_AS_ERROR`
`ENABLE_ADDRESS_SANITIZER`	`SUNDIALS_ENABLE_ADDRESS_SANITIZER`
`ENABLE_MEMORY_SANITIZER`	`SUNDIALS_ENABLE_MEMORY_SANITIZER`
`ENABLE_LEAK_SANITIZER`	`SUNDIALS_ENABLE_LEAK_SANITIZER`

Following the updated CMake options, the macros listed below have been deprecated and replaced with versions that align with the new CMake options.

Old Macro	New Macro
`SUNDIALS_BUILD_WITH_PROFILING`	`SUNDIALS_ENABLE_PROFILING`
`SUNDIALS_BUILD_WITH_MONITORING`	`SUNDIALS_ENABLE_MONITORING`
`SUNDIALS_BUILD_PACKAGE_FUSED_KERNELS`	`SUNDIALS_ENABLE_PACKAGE_FUSED_KERNELS`

For changes in prior versions of SUNDIALS see §18.

7.1.3. Reading this User Guide

This user guide is a combination of general usage instructions and specific examples. We expect that some readers will want to concentrate on the general instructions, while others will refer mostly to the examples, and the organization is intended to accommodate both styles.

There are different possible levels of usage of KINSOL. The most casual user, with a small nonlinear system, can get by with reading all of Chapter §7.2, then Chapter :numref:KINSOL.Usage.CC through §7.4 only, and looking at examples in [42]. In a different direction, a more expert user with a nonlinear system may want to (a) use a package preconditioner (§7.4.5), (b) supply his/her own Jacobian or preconditioner routines (§7.4.4), (c) supply a new N_Vector module (Chapter §8), or even (d) supply a different linear solver module (§7.4.3.2 and Chapter §10).

The structure of this document is as follows:

In Chapter §7.2, we provide short descriptions of the numerical methods implemented by KINSOL for the solution of nonlinear systems.
The following chapter describes the software organization of the KINSOL solver (§7.3).
Chapter :numref:KINSOL.Usage.CC is the main usage document for KINSOL for C applications. It includes a complete description of the user interface for the solution of nonlinear algebraic systems.
Chapter §8 gives a brief overview of the generic N_Vector module shared among the various components of SUNDIALS, and details on the four N_Vector implementations provided with SUNDIALS.
Chapter §9 gives a brief overview of the generic SUNMatrix module shared among the various components of SUNDIALS, and details on the SUNMatrix implementations provided with SUNDIALS.
Chapter §10 gives a brief overview of the generic SUNLinearSolver module shared among the various components of SUNDIALS. This chapter contains details on the SUNLinearSolver implementations provided with SUNDIALS. The chapter also contains details on the SUNLinearSolver implementations provided with SUNDIALS that interface with external linear solver libraries.
Finally, in the appendices, we provide detailed instructions for the installation of KINSOL, within the structure of SUNDIALS (Appendix §1.1), as well as a list of all the constants used for input to and output from KINSOL functions (Appendix §7.5).

Finally, the reader should be aware of the following notational conventions in this user guide: program listings and identifiers (such as KINInit) within textual explanations appear in typewriter type style; fields in C structures (such as content) appear in italics; and packages or modules are written in all capitals. Usage and

7.1.4. SUNDIALS License and Notices

All SUNDIALS packages are released open source, under the BSD 3-Clause license for more details see the LICENSE and NOTICE files provided with all SUNDIALS packages.

7.1.5. Acknowledgments

We wish to acknowledge the contributions to previous versions of the KINSOL code and user guide by Allan G. Taylor.