Ubuntu 16.04 (и 17.04) замораживающийся случайным образом

Мы заказали предварительно созданную машину с ниже спецификаций и установили Ubuntu 17.04 на SSD. Ubuntu случайным образом заморозилась в четырех различных точках (включая при установке анаконды для Python). Мы решили переместиться в 16,04 (так как мы были более знакомы с этой ОС во всяком случае от наших дневных заданий) на том же твердотельном диске и сохраненной проблеме.

Это - вывод системных журналов, коррелируемых со временем одного из замораживаний.

Jun  5 02:22:08 PsertainTech org.gnome.evolution.dataserver.Sources5[1648]: ** (evolution-source-registry:1869): WARNING **: secret_service_search_sync: must specify at least one attribute to match
Jun  5 02:22:18 PsertainTech kernel: [ 3648.736597] INFO: rcu_sched detected stalls on CPUs/tasks:
Jun  5 02:22:18 PsertainTech kernel: [ 3648.736609]     7-...: (433 GPs behind) idle=07d/1/0 softirq=176166/176166 fqs=7188 
Jun  5 02:22:18 PsertainTech kernel: [ 3648.736613]     (detected by 3, t=15002 jiffies, g=241027, c=241026, q=3366)
Jun  5 02:22:18 PsertainTech kernel: [ 3648.736620] Task dump for CPU 7:
Jun  5 02:22:18 PsertainTech kernel: [ 3648.736622] swapper/7       R  running task        0     0      1 0x00000008
Jun  5 02:22:18 PsertainTech kernel: [ 3648.736628]  0000000000000010 0000000000000246 ffff8c86585b7e70 0000000000000018
Jun  5 02:22:18 PsertainTech kernel: [ 3648.736632]  7735940000000000 00000343907a93bf 0000000000000007 ffff8c86585b8000
Jun  5 02:22:18 PsertainTech kernel: [ 3648.736636]  ffff8c865f3e2900 ffffffffac4b92e0 ffff8c86585b4000 ffff8c86585b7eb8
Jun  5 02:22:18 PsertainTech kernel: [ 3648.736640] Call Trace:
Jun  5 02:22:18 PsertainTech kernel: [ 3648.736650]  [<ffffffffabd157f7>] ? cpuidle_enter+0x17/0x20
Jun  5 02:22:18 PsertainTech kernel: [ 3648.736655]  [<ffffffffab6c7a5a>] ? call_cpuidle+0x2a/0x50
Jun  5 02:22:18 PsertainTech kernel: [ 3648.736658]  [<ffffffffab6c7e3e>] ? cpu_startup_entry+0x29e/0x350
Jun  5 02:22:18 PsertainTech kernel: [ 3648.736662]  [<ffffffffab651891>] ? start_secondary+0x151/0x190
Jun  5 02:25:18 PsertainTech kernel: [ 3828.752786] INFO: rcu_sched detected stalls on CPUs/tasks:
Jun  5 02:25:18 PsertainTech kernel: [ 3828.752796]     7-...: (433 GPs behind) idle=07d/1/0 softirq=176166/176166 fqs=27789 
Jun  5 02:25:18 PsertainTech kernel: [ 3828.752799]     (detected by 4, t=60007 jiffies, g=241027, c=241026, q=11266)
Jun  5 02:25:18 PsertainTech kernel: [ 3828.752803] Task dump for CPU 7:
Jun  5 02:25:18 PsertainTech kernel: [ 3828.752805] swapper/7       R  running task        0     0      1 0x00000008
Jun  5 02:25:18 PsertainTech kernel: [ 3828.752810]  0000000000000010 0000000000000246 ffff8c86585b7e70 0000000000000018
Jun  5 02:25:18 PsertainTech kernel: [ 3828.752815]  7735940000000000 00000343907a93bf 0000000000000007 ffff8c86585b8000
Jun  5 02:25:18 PsertainTech kernel: [ 3828.752819]  ffff8c865f3e2900 ffffffffac4b92e0 ffff8c86585b4000 ffff8c86585b7eb8
Jun  5 02:25:18 PsertainTech kernel: [ 3828.752823] Call Trace:
Jun  5 02:25:18 PsertainTech kernel: [ 3828.752833]  [<ffffffffabd157f7>] ? cpuidle_enter+0x17/0x20
Jun  5 02:25:18 PsertainTech kernel: [ 3828.752839]  [<ffffffffab6c7a5a>] ? call_cpuidle+0x2a/0x50
Jun  5 02:25:18 PsertainTech kernel: [ 3828.752843]  [<ffffffffab6c7e3e>] ? cpu_startup_entry+0x29e/0x350
Jun  5 02:25:18 PsertainTech kernel: [ 3828.752848]  [<ffffffffab651891>] ? start_secondary+0x151/0x190
Jun  5 02:25:28 PsertainTech org.gnome.evolution.dataserver.Sources5[2720]: ** (evolution-source-registry:2988): WARNING **: secret_service_search_sync: must specify at least one attribute to match
Jun  5 02:28:18 PsertainTech kernel: [ 4008.773198] INFO: rcu_sched detected stalls on CPUs/tasks:
Jun  5 02:28:18 PsertainTech kernel: [ 4008.773210]     7-...: (433 GPs behind) idle=07d/1/0 softirq=176166/176166 fqs=48323 
Jun  5 02:28:18 PsertainTech kernel: [ 4008.773214]     (detected by 0, t=105012 jiffies, g=241027, c=241026, q=19082)
Jun  5 02:28:18 PsertainTech kernel: [ 4008.773221] Task dump for CPU 7:
Jun  5 02:28:18 PsertainTech kernel: [ 4008.773223] swapper/7       R  running task        0     0      1 0x00000008
Jun  5 02:28:18 PsertainTech kernel: [ 4008.773228]  0000000000000010 0000000000000246 ffff8c86585b7e70 0000000000000018
Jun  5 02:28:18 PsertainTech kernel: [ 4008.773232]  7735940000000000 00000343907a93bf 0000000000000007 ffff8c86585b8000
Jun  5 02:28:18 PsertainTech kernel: [ 4008.773236]  ffff8c865f3e2900 ffffffffac4b92e0 ffff8c86585b4000 ffff8c86585b7eb8
Jun  5 02:28:18 PsertainTech kernel: [ 4008.773240] Call Trace:
Jun  5 02:28:18 PsertainTech kernel: [ 4008.773251]  [<ffffffffabd157f7>] ? cpuidle_enter+0x17/0x20
Jun  5 02:28:18 PsertainTech kernel: [ 4008.773256]  [<ffffffffab6c7a5a>] ? call_cpuidle+0x2a/0x50
Jun  5 02:28:18 PsertainTech kernel: [ 4008.773259]  [<ffffffffab6c7e3e>] ? cpu_startup_entry+0x29e/0x350
Jun  5 02:28:18 PsertainTech kernel: [ 4008.773264]  [<ffffffffab651891>] ? start_secondary+0x151/0x190
Jun  5 02:31:18 PsertainTech kernel: [ 4188.789674] INFO: rcu_sched detected stalls on CPUs/tasks:
Jun  5 02:31:18 PsertainTech kernel: [ 4188.789686]     7-...: (433 GPs behind) idle=07d/1/0 softirq=176166/176166 fqs=68966 
Jun  5 02:31:18 PsertainTech kernel: [ 4188.789690]     (detected by 0, t=150017 jiffies, g=241027, c=241026, q=26836)
Jun  5 02:31:18 PsertainTech kernel: [ 4188.789696] Task dump for CPU 7:
Jun  5 02:31:18 PsertainTech kernel: [ 4188.789698] swapper/7       R  running task        0     0      1 0x00000008
Jun  5 02:31:18 PsertainTech kernel: [ 4188.789703]  0000000000000010 0000000000000246 ffff8c86585b7e70 0000000000000018
Jun  5 02:31:18 PsertainTech kernel: [ 4188.789707]  7735940000000000 00000343907a93bf 0000000000000007 ffff8c86585b8000
Jun  5 02:31:18 PsertainTech kernel: [ 4188.789710]  ffff8c865f3e2900 ffffffffac4b92e0 ffff8c86585b4000 ffff8c86585b7eb8
Jun  5 02:31:18 PsertainTech kernel: [ 4188.789715] Call Trace:
Jun  5 02:31:18 PsertainTech kernel: [ 4188.789725]  [<ffffffffabd157f7>] ? cpuidle_enter+0x17/0x20
Jun  5 02:31:18 PsertainTech kernel: [ 4188.789730]  [<ffffffffab6c7a5a>] ? call_cpuidle+0x2a/0x50
Jun  5 02:31:18 PsertainTech kernel: [ 4188.789733]  [<ffffffffab6c7e3e>] ? cpu_startup_entry+0x29e/0x350
Jun  5 02:31:18 PsertainTech kernel: [ 4188.789738]  [<ffffffffab651891>] ? start_secondary+0x151/0x190
Jun  5 02:34:18 PsertainTech kernel: [ 4368.808848] INFO: rcu_sched detected stalls on CPUs/tasks:
Jun  5 02:34:18 PsertainTech kernel: [ 4368.808861]     7-...: (433 GPs behind) idle=07d/1/0 softirq=176166/176166 fqs=89546 
Jun  5 02:34:18 PsertainTech kernel: [ 4368.808865]     (detected by 3, t=195022 jiffies, g=241027, c=241026, q=34852)
Jun  5 02:34:18 PsertainTech kernel: [ 4368.808871] Task dump for CPU 7:
Jun  5 02:34:18 PsertainTech kernel: [ 4368.808873] swapper/7       R  running task        0     0      1 0x00000008
Jun  5 02:34:18 PsertainTech kernel: [ 4368.808879]  0000000000000010 0000000000000246 ffff8c86585b7e70 0000000000000018
Jun  5 02:34:18 PsertainTech kernel: [ 4368.808883]  7735940000000000 00000343907a93bf 0000000000000007 ffff8c86585b8000
Jun  5 02:34:18 PsertainTech kernel: [ 4368.808887]  ffff8c865f3e2900 ffffffffac4b92e0 ffff8c86585b4000 ffff8c86585b7eb8
Jun  5 02:34:18 PsertainTech kernel: [ 4368.808891] Call Trace:
Jun  5 02:34:18 PsertainTech kernel: [ 4368.808901]  [<ffffffffabd157f7>] ? cpuidle_enter+0x17/0x20
Jun  5 02:34:18 PsertainTech kernel: [ 4368.808906]  [<ffffffffab6c7a5a>] ? call_cpuidle+0x2a/0x50
Jun  5 02:34:18 PsertainTech kernel: [ 4368.808909]  [<ffffffffab6c7e3e>] ? cpu_startup_entry+0x29e/0x350
Jun  5 02:34:18 PsertainTech kernel: [ 4368.808914]  [<ffffffffab651891>] ? start_secondary+0x151/0x190
Jun  5 02:37:18 PsertainTech kernel: [ 4548.828850] INFO: rcu_sched detected stalls on CPUs/tasks:
Jun  5 02:37:18 PsertainTech kernel: [ 4548.828862]     7-...: (433 GPs behind) idle=07d/1/0 softirq=176166/176166 fqs=110118 
Jun  5 02:37:18 PsertainTech kernel: [ 4548.828865]     (detected by 9, t=240027 jiffies, g=241027, c=241026, q=42598)
Jun  5 02:37:18 PsertainTech kernel: [ 4548.828871] Task dump for CPU 7:
Jun  5 02:37:18 PsertainTech kernel: [ 4548.828873] swapper/7       R  running task        0     0      1 0x00000008
Jun  5 02:37:18 PsertainTech kernel: [ 4548.828878]  0000000000000010 0000000000000246 ffff8c86585b7e70 0000000000000018
Jun  5 02:37:18 PsertainTech kernel: [ 4548.828882]  7735940000000000 00000343907a93bf 0000000000000007 ffff8c86585b8000
Jun  5 02:37:18 PsertainTech kernel: [ 4548.828886]  ffff8c865f3e2900 ffffffffac4b92e0 ffff8c86585b4000 ffff8c86585b7eb8
Jun  5 02:37:18 PsertainTech kernel: [ 4548.828889] Call Trace:
Jun  5 02:37:18 PsertainTech kernel: [ 4548.828900]  [<ffffffffabd157f7>] ? cpuidle_enter+0x17/0x20
Jun  5 02:37:18 PsertainTech kernel: [ 4548.828904]  [<ffffffffab6c7a5a>] ? call_cpuidle+0x2a/0x50
Jun  5 02:37:18 PsertainTech kernel: [ 4548.828907]  [<ffffffffab6c7e3e>] ? cpu_startup_entry+0x29e/0x350
Jun  5 02:37:18 PsertainTech kernel: [ 4548.828912]  [<ffffffffab651891>] ? start_secondary+0x151/0x190
Jun  5 02:40:18 PsertainTech kernel: [ 4728.843394] INFO: rcu_sched detected stalls on CPUs/tasks:
Jun  5 02:40:18 PsertainTech kernel: [ 4728.843406]     7-...: (433 GPs behind) idle=07d/1/0 softirq=176166/176166 fqs=130672 
Jun  5 02:40:18 PsertainTech kernel: [ 4728.843410]     (detected by 3, t=285032 jiffies, g=241027, c=241026, q=50334)

Мы неудачно попробовали c-состояние, фиксируют (GRUB_CMDLINE_LINUX_DEFAULT="quiet splash intel_idle.max_cstate=1").

  • Материнская плата: Asus Intel X99-A II USB 3.1 Series DDR4/Quad CrossFireX&Quad SLI/SATA3&USB3.1
  • Память DDR3/4: 64 ГБ DDR4-2133/2400 PC4-17000/19200 4X16 ГБ
  • Процессор: Intel Core i7-6850K Broadwell-E 6xCore 3.6GHz 2011 v3
  • SSD: твердотельный накопитель серии 500GB Samsung 850 EVO
  • Жесткий диск: Western Digital Black 7200RPM SATA 3 на 2 ТБ 6Gb/s кэш 64 МБ
  • Видеокарта: NVIDIA GeForce GT 730 2GB Видеокарта PCI Express DDR

Как мы можем зафиксировать его?

Обновление:

свободные-h производят:

              total        used        free      shared  buff/cache   available
Mem:            62G        2.1G         59G         55M        1.4G         60G
Swap:           63G          0B         63G

swapon производят: НАЗОВИТЕ ИСПОЛЬЗУЕМЫЙ раздел PRIO/dev/sdb5 РАЗМЕРА ШРИФТА 63.9G 0B-1

sudo blkid вывод:

/dev/sda1: UUID="a8fb3b82-1a85-4377-99e3-20d22e63a451" TYPE="swap" PARTUUID="7f08bb24-9185-47bf-a949-a316a0b63f5b"
/dev/sdb1: UUID="1c22f4a6-4c23-46c1-bdcc-7644ed39e193" TYPE="ext4" PARTUUID="43ee50b9-01"
/dev/sdb5: UUID="c7a7c6b7-e3a2-4942-b4a1-7d66eccb0915" TYPE="swap" PARTUUID="43ee50b9-05"
/dev/sdb6: UUID="8668e4db-4dd9-4b76-8d06-b6241d521800" TYPE="ext4" PARTUUID="43ee50b9-06"

кошка/etc/fstab вывод:

# /etc/fstab: static file system information.
#
# Use 'blkid' to print the universally unique identifier for a
# device; this may be used with UUID= as a more robust way to name devices
# that works even if disks are added and removed. See fstab(5).
#
# <file system> <mount point>   <type>  <options>       <dump>  <pass>
# / was on /dev/sdb1 during installation
UUID=1c22f4a6-4c23-46c1-bdcc-7644ed39e193 /               ext4    errors=remount-ro 0       1
# /home was on /dev/sdb6 during installation
UUID=8668e4db-4dd9-4b76-8d06-b6241d521800 /home           ext4    defaults        0       2
# swap was on /dev/sdb5 during installation
UUID=c7a7c6b7-e3a2-4942-b4a1-7d66eccb0915 none            swap    sw              0       0

кошка/etc/crypt* вывод: кошка: '/etc/crypt* ': Никакой такой файл или каталог

ls-alh / своп-файл производят: ls: не может получить доступ к '/swapfile': Никакой такой файл или каталог

ОБНОВЛЕНИЕ 2

Jun  8 08:29:22 psertain kernel: [37593.535883] nouveau 0000:01:00.0: fifo: SCHED_ERROR 0a [CTXSW_TIMEOUT]
Jun  8 08:29:22 psertain kernel: [37593.535898] nouveau 0000:01:00.0: fifo: runlist 0: scheduled for recovery
Jun  8 08:29:22 psertain kernel: [37593.535911] nouveau 0000:01:00.0: fifo: channel 3: killed
Jun  8 08:29:22 psertain kernel: [37593.535918] nouveau 0000:01:00.0: fifo: engine 0: scheduled for recovery
Jun  8 08:29:22 psertain kernel: [37593.535998] nouveau 0000:01:00.0: Xorg[1105]: channel 3 killed!
Jun  8 08:29:33 psertain kernel: [37604.030229] [drm:drm_atomic_helper_swap_state [drm_kms_helper]] *ERROR* [CRTC:37:head-0] hw_done timed out
Jun  8 08:29:43 psertain kernel: [37614.269426] [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:37:head-0] hw_done timed out
Jun  8 08:29:53 psertain kernel: [37624.508634] [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:37:head-0] flip_done timed out
Jun  8 08:30:59 psertain rsyslogd: [origin software="rsyslogd" swVersion="8.16.0" x-pid="909" x-info="http://www.rsyslog.com"] start
3
задан 6 January 2019 в 03:00

1 ответ

Таким образом, это складывается, я должен был переустановить Linux и включить 340 версий драйвера Nvidia, а не, чем более новая протестированная версия и до сих пор никакие замораживания в течение 24 часов. Выход из резервного устройства также работает. Надо надеяться, это продолжается. – @sousuffer

1
ответ дан 1 December 2019 в 17:30

Другие вопросы по тегам:

Похожие вопросы: