Centos 7 Random Panic / Oops

1

Finalmente, depois de muitas alterações na configuração, recebi um despejo completo de falhas que pude inspecionar:

  KERNEL: /usr/lib/debug/lib/modules/3.10.0-327.28.3.el7.x86_64/vmlinux
DUMPFILE: vmcore  [PARTIAL DUMP]
    CPUS: 64
    DATE: Wed Aug 24 20:11:12 2016
  UPTIME: 02:16:27
LOAD AVERAGE: 1.00, 8.29, 7.26
   TASKS: 1175
NODENAME: dev1.soft.com
 RELEASE: 3.10.0-327.28.3.el7.x86_64
 VERSION: #1 SMP Thu Aug 18 19:05:49 UTC 2016
 MACHINE: x86_64  (2260 Mhz)
  MEMORY: 256 GB
   PANIC: "BUG: unable to handle kernel paging request at 00007f3b31c9a798"

Em Crash eu corri bt

crash> bt
PID: 11768  TASK: ffff8840173d0000  CPU: 38  COMMAND: "cp"
 #0 [ffff884017277660] machine_kexec at ffffffff81051e9b
 #1 [ffff8840172776c0] crash_kexec at ffffffff810f27c2
 #2 [ffff884017277790] oops_end at ffffffff8163f448
 #3 [ffff8840172777b8] no_context at ffffffff8162f588
 #4 [ffff884017277808] __bad_area_nosemaphore at ffffffff8162f61e
 #5 [ffff884017277850] bad_area at ffffffff8162f942
 #6 [ffff884017277878] __do_page_fault at ffffffff81642225
 #7 [ffff8840172778d8] do_page_fault at ffffffff81642353
 #8 [ffff884017277900] page_fault at ffffffff8163e648
    [exception RIP: radix_tree_next_chunk+323]
    RIP: ffffffff812f8e83  RSP: ffff8840172779b8  RFLAGS: 00010246
    RAX: 0000000000000000  RBX: 0000000000000000  RCX: 0000000000000006
    RDX: 00007f3b31c9a770  RSI: ffff884017277a00  RDI: 0000000000000006
    RBP: ffff8840172779e0   R8: ffff884017277a00   R9: 0000000000000000
    R10: 0000000000000000  R11: 0000000000000220  R12: 0000000000000040
    R13: 0000000000000000  R14: 0000000000000012  R15: ffff880fddf9cfd8
    ORIG_RAX: ffffffffffffffff  CS: 0010  SS: 0018
 #9 [ffff8840172779e8] __find_get_pages at ffffffff8116c065
#10 [ffff884017277a50] __pagevec_lookup at ffffffff8117880e
#11 [ffff884017277a68] truncate_inode_pages_range at ffffffff81179240
#12 [ffff884017277bb8] truncate_pagecache at ffffffff81179927
#13 [ffff884017277be0] ext4_setattr at ffffffffa0226889 [ext4]
#14 [ffff884017277c40] notify_change at ffffffff811fbd09
#15 [ffff884017277c88] do_truncate at ffffffff811dce03
#16 [ffff884017277d00] do_last at ffffffff811ec3a2
#17 [ffff884017277db0] path_openat at ffffffff811eed02
#18 [ffff884017277e48] do_filp_open at ffffffff811f04cb
#19 [ffff884017277f18] do_sys_open at ffffffff811dde73
#20 [ffff884017277f70] sys_open at ffffffff811ddf8e
#21 [ffff884017277f80] system_call_fastpath at ffffffff81646b49
    RIP: 00007fdad8f142b0  RSP: 00007ffdda059160  RFLAGS: 00010246
    RAX: 0000000000000002  RBX: ffffffff81646b49  RCX: 00007fdad8f13e64
    RDX: 0000000000000000  RSI: 0000000000000201  RDI: 00007ffdda05a385
    RBP: 00007ffdda059550   R8: 00007ffdda059730   R9: 00000000000001b4
    R10: 00007ffdda058ee0  R11: 0000000000000246  R12: ffffffff811ddf8e
    R13: ffff884017277f78  R14: 00007ffdda05a34c  R15: 00007ffdda059730
    ORIG_RAX: 0000000000000002  CS: 0033  SS: 002b

crash> files
PID: 11768  TASK: ffff8840173d0000  CPU: 38  COMMAND: "cp"
ROOT: /    CWD: /home/dev/OPT-10.7.1/data
 FD       FILE            DENTRY           INODE       TYPE PATH
  0 ffff8830264e1e00 ffff882028c00240 ffff882029270850 CHR  /dev/null
  1 ffff8830264e1c00 ffff882028c00240 ffff882029270850 CHR  /dev/null
  2 ffff8830264e1c00 ffff882028c00240 ffff882029270850 CHR  /dev/null
  3 ffff8830287ed800 ffff8820185026c0 ffff88101f699940 REG  /home/dev/OPT-10.7.1/files/Ratings.txt

A dificuldade é que eu não sei porque não foi possível cp o arquivo. Ele faz isso diariamente, às vezes várias vezes ao dia, e o arquivo tem cerca de 7,5Gb.

O que posso fazer para obter mais informações e recuperar a caixa?

ATUALIZADO: executei o fsck no caminho / home e parece um bom código de saída 0:

ATUALIZADO2:Eucorrinovamentecomocódigoderetornodosinalizador-f0:

    
por jjhayter 31.08.2016 / 02:50

0 respostas