I4DVAR ANA SENSITIVITY: Difference between revisions

From WikiROMS
Jump to navigationJump to search
m Robertson moved page I4DVAR ANA SENSITIVTY to I4DVAR ANA SENSITIVITY without leaving a redirect: Name title typo   (change visibility)
No edit summary   (change visibility)
Line 1: Line 1:
{{#lst:Options|I4DVAR_ANA_SENSITIVTY}}
{{#lst:Options|I4DVAR_ANA_SENSITIVITY}}


__TOC__
__TOC__

Revision as of 19:10, 20 July 2020

I4DVAR_ANA_SENSITIVITY (Formerly IS4DVAR_SENSITIVITY)
Option to activate the observation impact and sensitivity driver which quantifies the impact that each observation has on the 4DVAR data assimilation system. By measuring the sensitivity of the data assimilation system to each observation, we can determine the degree to which each observation contributes to the uncertainty in the circulation estimate. This analysis can help us to determine the type of measurements that need to be made, where to observe, and when.
required = ADJOINT, NONLINEAR, TANGENT
related = I4DVAR, LANCZOS
routine = obs_sen_i4dvar_analysis.h

Mathematical Formulation

The mathematical development presented here closely parallels that of Zhu and Gelaro (2008). The sensitivity of any functional, J, of the analysis Ψa or forecast Ψf can be efficiently computed using the adjoint model which yields information about the gradients of J(Ψa) and J(Ψf). We can extent the concept of the adjoint sensitivity to compute the sensitivity of the IS4DVAR cost function, J, in (1) and any other function J of the forecast Ψf to the observations, y.

In IS4DVAR, we define a quadratic cost function:

J(ψ)=12ψTB1ψ+12(Gψd)TO1(Gψd) (1)

where ψ is the ocean state vector (ζ, u, v, T, S,) with ψψ(t0), O is the observation error and error of representativeness covariance matrix, B represents the background error covariance matrix, d is the innovation vector that represents the difference between the nonlinear background solution (Ψb) and the observations, di=yiHi(Ψb(ti)). Here, Hi is an operator that samples the nonlinear model at the observation location, G=HiR(t0,ti), Hi is the linearization of Hi, and the operator R is referred as the tangent linear model propagator.

At the minimum of (1), the cost function gradient J/ψ vanishes and:

ψψa=(B1+GTO1G)1GTO1d, (2)

where ψa is referred to as the analysis increment, the desired solution of the incremental data assimilation procedure.

The IS4DVAR analysis can be written in the more traditional form as Ψa=Ψb+Kd ( Daley, 1991), where ψa=Kd, the Kalman gain matrix K=H1GTO1, and H=2J/ψ2=B1GTO1G is the Hessian matrix. The entire IS4DVAR procedure is therefore neatly embodied in K. At the cost function minimum, (1) can be written as J=12dT(IKTGT)O1d which yields the sensitivity of the IS4DVAR cost function to the observations.

In the current IS4DVAR/LANCZOS data assimilation algorithm, the cost function minimum of (1) is identified using the Lanczos method (Golub and Van Loan, 1989), in which case:

Ψa=ΨbQkTk1QkTGTO1d (3)

where Qk=(qi) is the matrix of k orthogonal Lanczos vectors, and Tk is a known tridiagonal matrix. Each of the kiterations of IS4DVAR employed in finding the minimum of J yields one column qi of Qk. From (3) we can identify the Kalman gain matrix as K=QkTk1QkTGTO1 in which case KT=O1GQkTk1QkT represents the adjoint of the entire IS4DVAR system. The action of KT on the vector GTO1d in (3) can be readily computed since the Lanczos vectors qi and the matrix Tk are available at the end of each IS4DVAR assimilation cycle.

Therefore, the basic observation sensitivity algorithm is as follows:

  1. Force the adjoint model with O1d to yield ψ=GTO1dqi.
  2. Operate on ψ(t0) with QkTk1QkT which is equivalent to a rankk approximation of the Hessian matrix.
  3. Integrate the results of step (ii) forward in time using the tangent linear model and save the solution Hψ(ti). That is, the solution at observation locations.
  4. Multiply Hψ(ti) by O1 to yield J/y.

Consider now a forecast Ψf(t) for the interval t=[t0+τ,t0+τ+tf] initialized from Ψa(t0+τ) obtained from an assimilation cycle over the interval t=[t0,t0+τ], where tf is the forecast lead time. In addition, consider J(Ψf(t)) that depends on the forecast Ψf(t), and which characterizes some future aspect of the forecast circulation. According to the chain rule, the sensitivity J/y of J to the observations y collected during the assimilation cycle t=[t0,t0+τ] is given by:

Jy=ΨayΨfΨaJΨf=O1HRQkTk1QkTRTJΨf (4)

which again can be readily evaluated using the adjoint model denoted by RT(t,t0+τ) and the adjoint of IS4DVAR, KT.

Technical Description

This ROMS driver can be used to evaluate the impact of each observation on the IS4DVAR data assimilation algorithm by measuring their sensitivity over a specified circulation functional index, J. This algorithm is an extension of the adjoint sensitivity driver, AD_SENSITIVITY. As mentioned above, this algorithm is equivalent to taking the adjoint of the IS4DVAR algorithm.

The following steps are carried out in obs_sen_is4dvar.h:

  1. We begin by running the incremental strong constraint 4DVar data assimilation algorithm (IS4DVAR) with the Lanczos conjugate gradient minimization algorithm (LANCZOS) using k inner-loops (Ninner=k) and a single outer-loop (Nouter=1) for the period t=[t0,t1]. We will denote by Ψb(t0) the background initial condition, and the observations vector by y. The resulting Lanczos vectors that we save in the adjoint NetCDF file will be denoted by qi, where i=1,2,,k.
  2. Next we run the nonlinear model for the combined assimilation plus forecast period t=[t0,t2], where t2t1. This represents the final sweep of the nonlinear model for the period t=[t0,t1] after exiting the inner-loop in the IS4DVAR plus the forecast period t=[t1,t2]. The initial condition for the nonlinear mode at t=t0 is Ψb(t0) and not the new estimated initial conditions, Ψ(t0)=Ψb(t0)+ψ(t0). We save the basic state trajectory, Ψb(t), of this nonlinear model run for use in the adjoint sensitivity calculation next, and for use in the tangent linear model run later in step (vii). Depending on time for which the sensitivity functional J(t) is defined, this will dictate t2. For example, if J(t) is a functional defined during the forecast interval t1<t<t2, then t2>t1 for this run of the nonlinear model. However, if J(t) is defined during the assimilation interval t0<t<t1, then t2=t1. That is, the definition of t2 should be flexible depending on the choice of J.
  3. The next step involves an adjoint sensitivity calculation for the combined assimilation plus forecast period t=[t2,t0]. The basic state trajectory for this calculation will be that from the nonlinear model run in step (ii).
  4. After running the regular adjoint sensitivity calculation in (iii), we will have a full 3D-adjoint state vector at time t=t0. Let's call this vector ψ(t0). The next thing we want to do is to compute the dot-product of ψ(t0) with each of the Lanczos vectors from the previous IS4DVAR run. So if we ran IS4DVAR with k inner-loops, we will have kLanczos vectors which we denote as qi where i=1,2,,k. So we will compute ai=ψTqi where ψT is the transpose of the vector ψ(t0), and ai for i=1,2,,k are scalars, so there will be k of them.
  5. The next step is to invert the tridiagonal matrix associated with the Lanczos vectors. Let's denote this matrix as T. So what we want to solve Tb=a, where a is the k×1 vector of scalars ai from step (iv), and b is the k×1 vector that we want to find. So we solve for b by using a tridiagonal solver.
  6. The next step is to compute a weighted sum of the Lanczos vectors. Let's call this z, where z=i(biqi) for i=1,2,,k. The bi are obtained from solving the tridiagonal equation in (v), and the qi are the Lanczos vectors. The vector z is a full-state vector and be used as an initial condition for the tangent linear model in step (vii).
  7. Finally, we run the tangent linear model from t=[t0,t1] using z from (vi) as the initial conditions. During this run of the tangent linear model, we need to read and process the observations y that we used in the IS4DVAR of step 1 and write the solution at the observation points and times to the MODname NetCDF file. The values that we write into this MODname are actually the values multiplied by error covariance O1 assigned to each observation during the IS4DVAR in step (i).