Eu possuo um watchdog?

11

Muitas vezes, quando faço uma reinicialização, recebo a seguinte mensagem de erro:

kernel: watchdog watchdog0: watchdog did not stop!

Eu tentei descobrir mais sobre o watchdog fazendo man watchdog , mas ele diz que não há entrada manual. Eu tentei yum list watchdog e descobri que não foi instalado. No entanto, quando olho para o diretório /dev , na verdade encontrei dois watchdogs:

watchdog e watchdog0

Estou curioso. Eu realmente possuo algum watchdogs? Por que o kernel reclama que não parou quando eu reiniciei?

    
por Question Overflow 24.08.2014 / 19:30

1 resposta

7

A maioria dos hardwares de PC modernos inclui recursos de timer watchdog. Você pode ler mais sobre eles aqui na wikipedia: Watchdog Timers . Também da documentação do kernel do Linux:

excerto - link

A Watchdog Timer (WDT) is a hardware circuit that can reset the computer system in case of a software fault. You probably knew that already.

Usually a userspace daemon will notify the kernel watchdog driver via the /dev/watchdog special device file that userspace is still alive, at regular intervals. When such a notification occurs, the driver will usually tell the hardware watchdog that everything is in order, and that the watchdog should wait for yet another little while to reset the system. If userspace fails (RAM error, kernel bug, whatever), the notifications cease to occur, and the hardware watchdog will reset the system (causing a reboot) after the timeout occurs.

The Linux watchdog API is a rather ad-hoc construction and different drivers implement different, and sometimes incompatible, parts of it. This file is an attempt to document the existing usage and allow future driver writers to use it as a reference.

Este SO Q & A intitulado, Quem está atualizando o watchdog de hardware no Linux? , abrange a ligação entre o kernel do Linux e o temporizador do watchdog de hardware.

O que acontece com o pacote watchdog?

A descrição no RPM deixa isso bem claro, IMO. O daemon watchdog pode atuar como um watchdog de software ou pode interagir com a implementação de hardware.

trecho da descrição do RPM

The watchdog program can be used as a powerful software watchdog daemon or may be alternately used with a hardware watchdog device such as the IPMI hardware watchdog driver interface to a resident Baseboard Management Controller (BMC). watchdog periodically writes to /dev/watchdog; the interval between writes to /dev/watchdog is configurable through settings in the watchdog sysconfig file.

This configuration file is also used to set the watchdog to be used as a hardware watchdog instead of its default software watchdog operation. In either case, if the device is open but not written to within the configured time period, the watchdog timer expiration will trigger a machine reboot. When operating as a software watchdog, the ability to reboot will depend on the state of the machine and interrupts.

When operating as a hardware watchdog, the machine will experience a hard reset (or whatever action was configured to be taken upon watchdog timer expiration) initiated by the BMC.

    
por 24.08.2014 / 19:37