Difference between revisions of "I4DVAR ANA SENSITIVITY"

From WikiROMS
Jump to navigationJump to search
Line 5: Line 5:
<wikitex>
<wikitex>


The mathematical development presented here closely parallels that of [[Bibliography#ZhuY_2008a | Zhu and Gelaro (2008)]]. The sensitivity of any functional, $\cal J$, of the analysis $\bf{\Psi}_a$ or forecast <math>\,\!\mathbf{\Psi}_f</math> can be efficiently computed using the adjoint model which yields information about the gradients of <math>\,\!\jmath(\mathbf{\Psi}_a)</math> and <math>\,\!\jmath(\mathbf{\Psi}_f)</math>. We can extent the concept of the adjoint sensitivity to compute the sensitivity of the [[IS4DVAR]] cost function, <math>\,\!J</math>,
The mathematical development presented here closely parallels that of [[Bibliography#ZhuY_2008a | Zhu and Gelaro (2008)]]. The sensitivity of any functional, $\cal J$, of the analysis $\bf\Psi_a$ or forecast $\bf\Psi_f$ can be efficiently computed using the adjoint model which yields information about the gradients of ${\cal J}({\bf\Psi_a})$ and ${\cal J}({\bf\Psi_f})$. We can extent the concept of the adjoint sensitivity to compute the sensitivity of the [[IS4DVAR]] cost function, $J$,


$$J(\psi) = \frac{1}{2}\,\psi^T \bf{B}^{-1} \psi + \frac{1}{2}\,(\bf{G}\psi - \bf{d})^T \bf{O}^{-1} (\bf{G}\psi - \bf{d}) \eqno{(1)} $$
$$J(\psi) = \frac{1}{2}\,\psi^T {\bf B}^{-1} \psi + \frac{1}{2}\,({\bf G}\psi - {\bf d})^T {\bf O}^{-1} ({\bf G}\psi - {\bf d}) \eqno{(1)} $$


and any other function <math>\,\!\jmath</math> of the forecast <math>\,\!\mathbf{\Psi}_f</math> to the observations, <math>\,\!\mathbf{y}</math>. Here, <math>\,\!\psi</math> is the ocean state vector, <math>\,\!\mathbf{O}</math> is the observation error and error of representativeness matrix, <math>\,\!\mathbf{B}</math> represents the background error covariance matrix, <math>\,\!\mathbf{d}</math> is the innovation vector that represents the difference between the nonlinear background solution and the observations, <math>\,\!\mathbf{d}_i = \mathbf{y}_i - \mathbf{H}_i(\mathbf{\Psi}_b(t))</math>, <math>\,\!\mathbf{H}_i</math> is an operator that samples the nonlinear model at the observation location, and <math>\,\!\mathbf{G} = \mathbf{H}_{i}^{'}\mathbf{R}(t_0,t_i)</math>.
and any other function $\cal J$ of the forecast $\bf\Psi_f$ to the observations, $\bf y$. Here, $\psi$ is the ocean state vector, $\bf O$ is the observation error and error of representativeness matrix, $\bf B$ represents the background error covariance matrix, $\bf d$ is the innovation vector that represents the difference between the nonlinear background solution and the observations, ${\bf d}_i = {\bf y}_i - {\bf H}_i ({\bf\Psi_b}(t))$, ${\bf H}_i$ is an operator that samples the nonlinear model at the observation location, and ${\bf G} = {\bf H}_{i}^{\prime} {\bf R}(t_0,t_i)$.


In the current [[IS4DVAR]]/[[LANCZOS]] data assimilation algorithm, the above cost function is identified using the Lanczos method  [[Bibliography#Golub_1989a | (Golub and Van Loan, 1989)]], in which case:
In the current [[IS4DVAR]]/[[LANCZOS]] data assimilation algorithm, the cost function (1) is identified using the Lanczos method  [[Bibliography#Golub_1989a | (Golub and Van Loan, 1989)]], in which case:


:<math>\mathbf{\Psi}_a = \mathbf{\Psi}_b - \mathbf{Q}_k \mathbf{T}_{k}^{-1} \mathbf{Q}_{k}^{T} \mathbf{G}^T \mathbf{O}^{-1} \mathbf{d}</math>
$${\bf\Psi_a} = {\bf\Psi_b} - {\bf Q}_{k} {\bf T}_{k}^{-1} {\bf Q}_{k}^{T} {\bf G}^T {\bf O}^{-1} {\bf d} \eqno{(2)}$$


where <math>\,\!\mathbf{Q}_k =(\mathbf{q}_i)</math> is the matrix of ''k'' orthogonal Lanczos vectors, and <math>\,\!\mathbf{T}_k</math> is a known tridiagonal matrix. Each of the ''k''-iterations of [[IS4DVAR]] employed in finding the minimum of <math>\,\!J</math> yields one column <math>\,\!\mathbf{q}_i</math> of <math>\,\!\mathbf{Q}_k</math>. We can identify the Kalman gain matrix as <math>\mathbf{K} = - \mathbf{Q}_k \mathbf{T}_{k}^{-1} \mathbf{Q}_{k}^{T} \mathbf{G}^T \mathbf{O}^{-1}</math> in which case <math>\mathbf{K}^T = - \mathbf{O}^{-1} \mathbf{G} \mathbf{Q}_k \mathbf{T}_{k}^{-1} \mathbf{Q}_{k}^{T}</math> represents the adjoint of the entire [[IS4DVAR]] system. The <math>\,\!\mathbf{K}^T</math> on the vector <math>\,\!\mathbf{G}^T \mathbf{O}^{-1} d</math> above can be readily computed since the Lanczos vectors <math>\,\!\mathbf{q}_i</math> and the matrix <math>\,\!\mathbf{T}_k</math> are available at the end of each [[IS4DVAR]] assimilation cycle.
where ${\bf Q}_k = ({\bf q}_i)$ is the matrix of $k$ orthogonal Lanczos vectors, and ${\bf T}_k$ is a known tridiagonal matrix. Each of the $k\hyphen$iterations of [[IS4DVAR]] employed in finding the minimum of $J$ yields one column ${\bf q}_i$ of ${\bf Q}_k$. We can identify the Kalman gain matrix as ${\bf K} = - {\bf Q}_k {\bf T}_{k}^{-1} {\bf Q}_{k}^{T} {\bf G}^T {\bf O}^{-1}$ in which case ${\bf K}^T = - {\bf O}^{-1} {\bf G} {\bf Q}_k {\bf T}_{k}^{-1} {\bf Q}_{k}^{T}$ represents the adjoint of the entire [[IS4DVAR]] system. The ${\bf K}^T$ on the vector ${\bf G}^T {\bf O}^{-1} {\bf d}$ above can be readily computed since the Lanczos vectors ${\bf q}_i$ and the matrix ${\bf T}_k$ are available at the end of each [[IS4DVAR]] assimilation cycle.


Therefore, the basic observation sensitivity algorithm is as follows:
Therefore, the basic observation sensitivity algorithm is as follows:


# Force the adjoint model with <math>\,\!\mathbf{O}^{-1} \mathbf{d}</math> to yield <math>\,\!\psi^\dagger = \mathbf{G}^\mathbf{T} \mathbf{O}^{-1} \mathbf{d} \propto \mathbf{q}_i</math>.
# Force the adjoint model with ${\bf O}^{-1} {\bf d}$ to yield $\psi^\dagger = {\bf G}^{T} {\bf O}^{-1} {\bf d} \propto {\bf q}_i$.
# Operate <math>\,\!\psi^\dagger(t_0)</math> with <math>\,\!\mathbf{Q}_k \mathbf{T}_{k}^{-1} \mathbf{Q}_{k}^{T}</math> which is equivalent to a rank-''k'' approximation of the Hessian matrix.
# Operate $\psi^\dagger(t_0)$ with ${\bf Q}_k {\bf T}_{k}^{-1} {\bf Q}_{k}^{T}$ which is equivalent to a rank$\hyphen k$ approximation of the Hessian matrix.
# Integrate the results of (2) forward in time using the tangent linear model and save the solution <math>\,\!\mathbf{H}^{'}\psi(t)</math>, that is, solution at observation locations.
# Integrate the results of step 2 forward in time using the tangent linear model and save the solution ${\bf H}^{\prime} \psi(t)$, that is, solution at observation locations.
# Multiply <math>\,\!\mathbf{H}^{'}\psi(t)</math> by <math>\,\!-\mathbf{O}^{-1}</math> to yield <math>\,\!{\partial J}/{\partial\mathbf{y}}</math>.
# Multiply ${\bf H}^{\prime}\psi(t)$ by $-{\bf O}^{-1}$ to yield ${\partial J}/{\partial{\bf y}}$.


Consider now a forecast <math>\,\!\mathbf{\Psi}_f(t)</math> fort the interval <math>\,\!t = [t_0+\tau, \; t_0+\tau+t_f]</math> initialized from <math>\,\!\mathbf{\Psi}_a(t_0+\tau)</math> obtained from an assimilation cycle over the interval <math>\,\!t = [t_0+\tau, \; t_0+\tau]</math>, where <math>\,\!t_f</math> is the forecast lead time. In addition, consider <math>\,\!\jmath(\mathbf{\Psi}_f(t))</math> that depends on the forecast <math>\,\!\mathbf{\Psi}_f(t)</math>, and which characterizes some future aspect of the forecast circulation.  According to the chain rule, the sensitivity <math>\,\!{\partial\jmath}/{\partial y}</math> of <math>\,\!\jmath</math> to the observations <math>\,\!\mathbf{y}</math> collected during the assimilation cycle <math>\,\!t = [t_0+\tau, \; t_0+\tau]</math> is given by:
Consider now a forecast ${\bf\Psi_f}(t)$ fort the interval $t = [t_0+\tau, \; t_0+\tau+t_f]$ initialized from ${\bf\Psi_a}(t_0+\tau)$ obtained from an assimilation cycle over the interval $t = [t_0+\tau, \; t_0+\tau]$, where $t_f$ is the forecast lead time. In addition, consider ${\cal J}({\bf\Psi_f}(t))$ that depends on the forecast ${\bf\Psi_f}(t)$, and which characterizes some future aspect of the forecast circulation.  According to the chain rule, the sensitivity ${\partial\cal{J}}/{\partial{\bf y}}$ of $\cal{J}$ to the observations $\bf y$ collected during the assimilation cycle $t = [t_0+\tau, t_0+\tau]$ is given by:


:<math>\frac{\partial\jmath}{\partial\mathbf{y}} = \frac{\partial\mathbf{\Psi}_a}{\partial\mathbf{y}} \frac{\partial\mathbf{\Psi}_f}{\partial\mathbf{\Psi}_a} \frac{\partial\jmath}{\partial\mathbf{\Psi}_f} = - \mathbf{O}^{-1} \mathbf{H} \mathbf{R} \mathbf{Q}_k \mathbf{T}_{k}^{-1} \mathbf{Q}_{k}^{T} \mathbf{R}^{T} \frac{\partial\jmath}{\partial\mathbf{\Psi}_f}</math>
$$ {\pder{\cal J}{\bf y}} = {\pder{\bf\Psi_a}{\bf y}} {\pder{\bf\Psi_f}{\bf\Psi_a}} {\pder{\cal J}{\bf\Psi_f}} = - {\bf O}^{-1} {\bf H} {\bf R} {\bf Q}_k {\bf T}_{k}^{-1} {\bf Q}_{k}^{T} {\bf R}^{T} {\pder{\cal J}{\bf\Psi_f}} \eqno{(3)}$$


which again can be readily evaluated using the adjoint model denoted by <math>\,\!\mathbf{R}^{T}(t,\;t_0+\tau)</math> and the adjoint of [[IS4DVAR]], <math>\,\!\mathbf{K}^{T}</math>.
which again can be readily evaluated using the adjoint model denoted by ${\bf R}^{T}(t,t_0+\tau)$ and the adjoint of [[IS4DVAR]], ${\bf K}^{T}$.


<wikitex>
<wikitex>

Revision as of 22:27, 9 July 2008


Technical Description

<wikitex>

The mathematical development presented here closely parallels that of Zhu and Gelaro (2008). The sensitivity of any functional, $\cal J$, of the analysis $\bf\Psi_a$ or forecast $\bf\Psi_f$ can be efficiently computed using the adjoint model which yields information about the gradients of ${\cal J}({\bf\Psi_a})$ and ${\cal J}({\bf\Psi_f})$. We can extent the concept of the adjoint sensitivity to compute the sensitivity of the IS4DVAR cost function, $J$,

$$J(\psi) = \frac{1}{2}\,\psi^T {\bf B}^{-1} \psi + \frac{1}{2}\,({\bf G}\psi - {\bf d})^T {\bf O}^{-1} ({\bf G}\psi - {\bf d}) \eqno{(1)} $$

and any other function $\cal J$ of the forecast $\bf\Psi_f$ to the observations, $\bf y$. Here, $\psi$ is the ocean state vector, $\bf O$ is the observation error and error of representativeness matrix, $\bf B$ represents the background error covariance matrix, $\bf d$ is the innovation vector that represents the difference between the nonlinear background solution and the observations, ${\bf d}_i = {\bf y}_i - {\bf H}_i ({\bf\Psi_b}(t))$, ${\bf H}_i$ is an operator that samples the nonlinear model at the observation location, and ${\bf G} = {\bf H}_{i}^{\prime} {\bf R}(t_0,t_i)$.

In the current IS4DVAR/LANCZOS data assimilation algorithm, the cost function (1) is identified using the Lanczos method (Golub and Van Loan, 1989), in which case:

$${\bf\Psi_a} = {\bf\Psi_b} - {\bf Q}_{k} {\bf T}_{k}^{-1} {\bf Q}_{k}^{T} {\bf G}^T {\bf O}^{-1} {\bf d} \eqno{(2)}$$

where ${\bf Q}_k = ({\bf q}_i)$ is the matrix of $k$ orthogonal Lanczos vectors, and ${\bf T}_k$ is a known tridiagonal matrix. Each of the $k\hyphen$iterations of IS4DVAR employed in finding the minimum of $J$ yields one column ${\bf q}_i$ of ${\bf Q}_k$. We can identify the Kalman gain matrix as ${\bf K} = - {\bf Q}_k {\bf T}_{k}^{-1} {\bf Q}_{k}^{T} {\bf G}^T {\bf O}^{-1}$ in which case ${\bf K}^T = - {\bf O}^{-1} {\bf G} {\bf Q}_k {\bf T}_{k}^{-1} {\bf Q}_{k}^{T}$ represents the adjoint of the entire IS4DVAR system. The ${\bf K}^T$ on the vector ${\bf G}^T {\bf O}^{-1} {\bf d}$ above can be readily computed since the Lanczos vectors ${\bf q}_i$ and the matrix ${\bf T}_k$ are available at the end of each IS4DVAR assimilation cycle.

Therefore, the basic observation sensitivity algorithm is as follows:

  1. Force the adjoint model with ${\bf O}^{-1} {\bf d}$ to yield $\psi^\dagger = {\bf G}^{T} {\bf O}^{-1} {\bf d} \propto {\bf q}_i$.
  2. Operate $\psi^\dagger(t_0)$ with ${\bf Q}_k {\bf T}_{k}^{-1} {\bf Q}_{k}^{T}$ which is equivalent to a rank$\hyphen k$ approximation of the Hessian matrix.
  3. Integrate the results of step 2 forward in time using the tangent linear model and save the solution ${\bf H}^{\prime} \psi(t)$, that is, solution at observation locations.
  4. Multiply ${\bf H}^{\prime}\psi(t)$ by $-{\bf O}^{-1}$ to yield ${\partial J}/{\partial{\bf y}}$.

Consider now a forecast ${\bf\Psi_f}(t)$ fort the interval $t = [t_0+\tau, \; t_0+\tau+t_f]$ initialized from ${\bf\Psi_a}(t_0+\tau)$ obtained from an assimilation cycle over the interval $t = [t_0+\tau, \; t_0+\tau]$, where $t_f$ is the forecast lead time. In addition, consider ${\cal J}({\bf\Psi_f}(t))$ that depends on the forecast ${\bf\Psi_f}(t)$, and which characterizes some future aspect of the forecast circulation. According to the chain rule, the sensitivity ${\partial\cal{J}}/{\partial{\bf y}}$ of $\cal{J}$ to the observations $\bf y$ collected during the assimilation cycle $t = [t_0+\tau, t_0+\tau]$ is given by:

$$ {\pder{\cal J}{\bf y}} = {\pder{\bf\Psi_a}{\bf y}} {\pder{\bf\Psi_f}{\bf\Psi_a}} {\pder{\cal J}{\bf\Psi_f}} = - {\bf O}^{-1} {\bf H} {\bf R} {\bf Q}_k {\bf T}_{k}^{-1} {\bf Q}_{k}^{T} {\bf R}^{T} {\pder{\cal J}{\bf\Psi_f}} \eqno{(3)}$$

which again can be readily evaluated using the adjoint model denoted by ${\bf R}^{T}(t,t_0+\tau)$ and the adjoint of IS4DVAR, ${\bf K}^{T}$.

<wikitex>