私は最近PC(Nvidia GeForce GTX 1080 Ti GPU搭載)でUbuntu 16.04を18.04にアップグレードしました。それ以来、ファイルシステムがこれらの巨大なログファイルから構築されています。コンピュータをシャットダウンするたびに、PCがシャットダウンするまで、無限の数のpcieportメッセージが表示されます。
昨日、マシンを一晩置いたままにしましたが、戻ってきたときに、ディスク領域がすべて使用されているという通知がありました。つまり、_/dev/sda1
_は100%使用されていました。私はdu
- commandを介してこれを見つけることができ、_/var/log/
_フォルダー内のログファイルに問題を特定しました。これには、_350GB
_を超えるログファイルが含まれています。
現在、ログファイルは再構築されており、現在は約_150GB
_の領域を占めています。問題の原因となっているログファイルは、_syslog.1, syslog
_および_kern.log
_です。
私の質問は:この問題の原因と修正方法は?
システムとログファイルの数行に関する情報を以下に示します。私はそれらを再び削除しますが、それらを際限なく削除することは最良の長期的な解決策ではないようです。
_Distributor ID: Ubuntu
Description: Ubuntu 18.04.3 LTS
Release: 18.04
Codename: bionic
nvidia-smi
Thu Aug 15 09:25:53 2019
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 430.40 Driver Version: 430.40 CUDA Version: 10.1 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 GeForce GTX 108... Off | 00000000:01:00.0 On | N/A |
| 24% 58C P0 67W / 250W | 1373MiB / 11177MiB | 1% Default |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| 0 1298 G /usr/lib/xorg/Xorg 89MiB |
| 0 1337 G /usr/bin/gnome-Shell 50MiB |
| 0 2258 G /usr/lib/xorg/Xorg 726MiB |
| 0 2465 G /usr/bin/gnome-Shell 189MiB |
| 0 14914 G ...e --type=gpu-process --field-trial-hand 154MiB |
| 0 18206 C /usr/lib/libreoffice/program/soffice.bin 137MiB |
+-----------------------------------------------------------------------------+
lspci -vt
-[0000:00]-+-00.0 Intel Corporation 8th Gen Core Processor Host Bridge/DRAM Registers
+-01.0-[01]--+-00.0 NVIDIA Corporation GP102 [GeForce GTX 1080 Ti]
| \-00.1 NVIDIA Corporation GP102 HDMI Audio Controller
+-02.0 Intel Corporation Device 3e92
+-14.0 Intel Corporation 200 Series/Z370 Chipset Family USB 3.0 xHCI Controller
+-16.0 Intel Corporation 200 Series PCH CSME HECI #1
+-17.0 Intel Corporation 200 Series PCH SATA controller [AHCI mode]
+-1b.0-[02]--
+-1c.0-[03]--
+-1c.4-[04]----00.0 ASMedia Technology Inc. Device 2142
+-1c.7-[05]----00.0 Realtek Semiconductor Co., Ltd. RTL8812AE 802.11ac PCIe Wireless Network Adapter
+-1d.0-[06]--
+-1f.0 Intel Corporation Z370 Chipset LPC/eSPI Controller
+-1f.2 Intel Corporation 200 Series/Z370 Chipset Family Power Management Controller
+-1f.3 Intel Corporation 200 Series PCH HD Audio
+-1f.4 Intel Corporation 200 Series/Z370 Chipset Family SMBus Controller
\-1f.6 Intel Corporation Ethernet Connection (2) I219-V
_
syslog.1
_Aug 14 10:14:03 user kernel: [ 10.680132] pcieport 0000:00:1c.7: AER: Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 10.680135] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 10.680135] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 10.680136] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 14 10:14:03 user kernel: [ 10.680187] pcieport 0000:00:1c.7: AER: Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 10.680190] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 10.680190] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 10.680191] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 14 10:14:03 user kernel: [ 10.680281] pcieport 0000:00:1c.7: AER: Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 10.680284] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 10.680284] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 10.680285] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 14 10:14:03 user kernel: [ 10.680374] pcieport 0000:00:1c.7: AER: Multiple Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 10.680378] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 10.680379] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 10.680380] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 14 10:14:03 user kernel: [ 10.680586] pcieport 0000:00:1c.7: AER: Multiple Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 10.680590] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 10.680591] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 10.680591] pcieport 0000:00:1c.7: [ 0] RxErr
_
syslog
_Aug 15 09:04:23 user kernel: [ 307.590656] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 15 09:04:23 user kernel: [ 307.590836] pcieport 0000:00:1c.7: AER: Multiple Corrected error received: 0000:00:1c.7
Aug 15 09:04:23 user kernel: [ 307.590841] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 15 09:04:23 user kernel: [ 307.590843] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 15 09:04:23 user kernel: [ 307.590844] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 15 09:04:23 user kernel: [ 307.591125] pcieport 0000:00:1c.7: AER: Multiple Corrected error received: 0000:00:1c.7
Aug 15 09:04:23 user kernel: [ 307.591134] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 15 09:04:23 user kernel: [ 307.591135] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 15 09:04:23 user kernel: [ 307.591136] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 15 09:04:23 user kernel: [ 307.591414] pcieport 0000:00:1c.7: AER: Multiple Corrected error received: 0000:00:1c.7
Aug 15 09:04:23 user kernel: [ 307.591419] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 15 09:04:23 user kernel: [ 307.591420] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 15 09:04:23 user kernel: [ 307.591422] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 15 09:04:23 user kernel: [ 307.591607] pcieport 0000:00:1c.7: AER: Multiple Corrected error received: 0000:00:1c.7
Aug 15 09:04:23 user kernel: [ 307.591614] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 15 09:04:23 user kernel: [ 307.591616] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 15 09:04:23 user kernel: [ 307.591617] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 15 09:04:23 user kernel: [ 307.591896] pcieport 0000:00:1c.7: AER: Multiple Corrected error received: 0000:00:1c.7
Aug 15 09:04:23 user kernel: [ 307.591901] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
_
kern.log
_Aug 14 10:14:03 user kernel: [ 11.219257] pcieport 0000:00:1c.7: AER: Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 11.219259] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 11.219260] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 11.219260] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 14 10:14:03 user kernel: [ 11.219443] pcieport 0000:00:1c.7: AER: Multiple Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 11.219448] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 11.219448] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 11.219449] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 14 10:14:03 user kernel: [ 11.219714] pcieport 0000:00:1c.7: AER: Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 11.219717] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 11.219718] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 11.219718] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 14 10:14:03 user kernel: [ 11.219916] pcieport 0000:00:1c.7: AER: Multiple Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 11.219922] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 11.219923] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 11.219924] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 14 10:14:03 user kernel: [ 11.220101] pcieport 0000:00:1c.7: AER: Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 11.220104] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 11.220105] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 11.220105] pcieport 0000:00:1c.7: [ 0] RxErr
_
私はここの手順を使用してすでに問題を解決していると思います:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/152117
対応策:カーネルコマンドラインにpci = noaerを追加します。
(1)/ etc/default/grubを編集して、GRUB_CMDLINE_LINUX_DEFAULTで始まる行にpci = noaerを追加します。次のようになります。
GRUB_CMDLINE_LINUX_DEFAULT = "静かなスプラッシュpci = noaer"
(2)「Sudo update-grub」を実行します
(3)再起動
これらの手順を実装した後、システムをシャットダウンした後、pcieport
- messagesが表示されなくなり、ログファイルのサイズが大幅に増大しなくなりました。
ただし、これによりエラーメッセージの根本的な原因が修正されたかどうかはわかりません...