最小ノルム解の導出と図による理解

更新 2023/05/28

「連立一次方程式を満たす解の中で一番原点に近いものを求める」問題です。この問題はきれいに解けます。

定理1（最小ノルム解）

$A\overrightarrow{x}=\overrightarrow{b}$ を満たす $\overrightarrow{x}$ の中で $\|\overrightarrow{x}\|_2$ を最小にする解 $\overrightarrow{x_*}$ がただ1つ存在し， $\overrightarrow{x_*}=A^{\top}(AA^{\top})^{-1}\overrightarrow{b}$ ただし， $A$ は行ベクトルが線形独立な $m\times n$ 行列， $\overrightarrow{x}$ は $n$ 次元ベクトル， $\overrightarrow{b}$ は $m$ 次元ベクトルとする。

最小ノルム解について，問題設定・定理の証明・射影による理解を紹介します。

問題設定・記号

$\|\overrightarrow{x}\|_2$ は $\overrightarrow{x}$ の長さ（2ノルム）です。つまり， $\overrightarrow{x}$ の各成分を $x_1,...,x_n$ とすると $\|\overrightarrow{x}\|_2=\sqrt{x_1^2+x_2^2+\cdots +x_n^2}$ です。
$A$ の行ベクトルが線形独立なので， $m\leqq n$ です。横長行列（方程式の数よりも変数の数が多い連立方程式）をイメージしてください。
$\mathrm{rank}\:A=m$ です。よって，任意の $\overrightarrow{b}$ に対して $A\overrightarrow{x}=\overrightarrow{b}$ を満たす解 $\overrightarrow{x}$ が存在します。その解の中で $\|x\|_2$ を最小にするもの（最小ノルム解）を探す問題です。

最小ノルム解の証明

冒頭の定理を証明します。主張を3つにわけてそれぞれ証明します：

そもそも $AA^{\top}$ に逆行列 $(AA^{\top})^{-1}$ が存在すること。
$\overrightarrow{x_{*}}=A^{\top}(AA^{\top})^{-1}\overrightarrow{b}$ が $A\overrightarrow{x}=\overrightarrow{b}$ の解であること。
$A\overrightarrow{x}=\overrightarrow{b}$ なる任意の $\overrightarrow{x}$ に対して $\|\overrightarrow{x}\|_2\geqq \|\overrightarrow{x_*}\|_2$ であること。

1の証明

$AA^{\top}$ が正則であることを証明する。具体的には，行列が正則であることの意味と5つの条件の条件4「 $AA^{\top}\overrightarrow{x}=\overrightarrow{0}$ なら $\overrightarrow{x}=\overrightarrow{0}$ 」を証明する。

$AA^{\top}\overrightarrow{x}=\overrightarrow{0}$ なら $\overrightarrow{x}^{\top}AA^{\top}\overrightarrow{x}=0$ ，つまり $(A^{\top}\overrightarrow{x})^{\top}(A^{\top}\overrightarrow{x})=0$

$\|A^{\top}\overrightarrow{x}\|_2^2=0$

長さが $0$ であるベクトルは $\overrightarrow{0}$ のみなので

$A^{\top}\overrightarrow{x}=\overrightarrow{0}$

ここで，定理の仮定より $A$ の行ベクトルが線形独立なので， $A^{\top}$ の列ベクトルが線形独立。つまり $\overrightarrow{x}=\overrightarrow{0}$ がわかる。

2の証明

$A\overrightarrow{x_*}=(AA^{\top})(AA^{\top})^{-1}\overrightarrow{b}=\overrightarrow{b}$

3の証明

$\|\overrightarrow{x}\|^2_2\\ =\|\overrightarrow{x}-\overrightarrow{x}_*+\overrightarrow{x}_*\|_2^2\\ =\|\overrightarrow{x}-\overrightarrow{x_*}\|_2^2+\|\overrightarrow{x}_*\|_2^2+2(\overrightarrow{x}-\overrightarrow{x_*})^{\top}\overrightarrow{x_*}$

ここで「 $A\overrightarrow{x}=\overrightarrow{b}$ のとき第三項が $0$ 」なら $\|\overrightarrow{x}\|_2\geqq \|\overrightarrow{x_*}\|_2$ がわかる。以下では「 $A\overrightarrow{x}=\overrightarrow{b}$ のとき第三項が $0$ 」を示す：

$(\overrightarrow{x}-\overrightarrow{x_*})^{\top} \overrightarrow{x_*}\\ =(\overrightarrow{x}-\overrightarrow{x_*})^{\top} A^{\top}(AA^{\top})^{-1}\overrightarrow{b}\\ =(A\overrightarrow{x}-A\overrightarrow{x_*})^{\top}(AA^{\top})^{-1}\overrightarrow{b}\\ =(\overrightarrow{b}-\overrightarrow{b})(AA^{\top})^{-1}\overrightarrow{b}\\ =0$

参考文献：Least-norm solutions of undetermined equations

射影による理解

定理2

$A\overrightarrow{x_0}=\overrightarrow{b}$ なる任意の $\overrightarrow{x_0}$ に対して， $\overrightarrow{x_0}$ を $(\mathrm{Ker}\:A)^{\perp}$ に直交射影したら $\overrightarrow{x_*}$ になる。

ただし，定理2における記号の意味や前提は定理1のものと同じとします。

$\mathrm{Ker}\:A$ は $A\overrightarrow{x}=\overrightarrow{0}$ を満たす $\overrightarrow{x}$ 全体の集合（→行列のカーネル（核）の性質と求め方）で， $\perp$ は直交補空間を表します。

証明

$\mathrm{Im}(A^{\top})=(\mathrm{Ker}\:A)^{\perp}$ である（有名な公式）。

よって， $\mathrm{Im}(A^{\top})$ への射影を考える。これは射影行列のイメージと楽しい公式より， $P=A^{\top}(AA^{\top})^{-1}A$ をかけることに対応する。

つまり， $P\overrightarrow{x_0}=\overrightarrow{x_*}$ を示せばよい。

これは $\overrightarrow{x_*}=A^{\top}(AA^{\top})^{-1}\overrightarrow{b}$ と $A\overrightarrow{x_0}=\overrightarrow{b}$ から分かる。

定理2のイメージ

射影による理解図のように理解できます。図の説明：

$A\overrightarrow{x}=\overrightarrow{b}$ の解空間上に $\overrightarrow{x_0}$ があります。
その解空間は $\mathrm{Ker}\:A$ と平行です。なぜなら解に $\mathrm{Ker}\:A$ の元を加えても解だからです。
その直交補空間 $(\mathrm{Ker}\:A)^{\perp}$ を考えます。解空間とも直交します。
$\overrightarrow{x_0}$ を $(\mathrm{Ker}\:A)^{\perp}$ に射影した $\overrightarrow{x_*}$ が解空間の中で一番原点に近いのでノルム最小解になります。

定理1の証明は思ったより単純でした。

この記事の監修者

マスオ

東京大学大学院情報理工学系研究科修了／2014年にWebサイト『高校数学の美しい物語』を立ち上げ／著書累計 50,000部突破／「わかりやすいこと」と「ごまかさないこと」の両立を意識している。 →著者情報・書籍一覧を見る