Estou convertendo meu myongodb autônomo em um conjunto de réplicas. Eu adicionei mais um membro (e eu quero adicionar mais dois membros depois, e desligar o servidor primário).
Meu mongodb principal está executando o 2.2.3, e o novo membro da réplica executando a versão mais recente do mongodb, 2.6.4.
Ambos os bancos de dados estão sendo executados no servidor Ubuntu 14.04, no microsoft Azure e estão sendo executados no mesmo grupo de afinidade. (O tamanho de Vm é A2)
Eu editei o ulimit de "nofile" e "nproc" para 65535, Depois de ver o conselho de monitoramento do MMMS, BU somente nos secundários, para evitar o tempo de reinicialização das máquinas, É necessário?
Cheguei em algum lugar acima do documento de 80m no banco de dados principal e está sendo executado em produção ao vivo. é por causa disso?
Após algumas horas de sincronização de dados, o TTL mostrou os seguintes erros e começou a sincronizar novamente. e continua em loop.
[rsSync] done building bottom layer, going to commit
[rsSync] old journal file will be removed:
/datadrive/data/journal/j._9
[rsSync] build index done. scanned 55381316 total records. 1348.97
secs
[conn221] serverStatus was very slow: { after basic: 0, after asserts:
0, after backgroundFlushing: 0, after connections: 0, after cursors:
0, after dur: 0, after extra_info: 0, after globalLock: 0, after
indexCounters: 0, after locks: 0, after network: 0, after opcounters:
0, after opcountersRepl: 0, after recordStats: 744214, after repl:
744214, at end: 744214 }
[conn221] command admin.$cmd command: serverStatus { serverStatus: 1 }
keyUpdates:0 numYields:0 locks(micros) r:31 reslen:3920 1243515ms
[conn228] serverStatus was very slow: { after basic: 0, after asserts:
0, after backgroundFlushing: 0, after connections: 0, after cursors:
0, after dur: 0, after extra_info: 0, after globalLock: 0, after
indexCounters: 0, after locks: 0, after network: 0, after opcounters:
0, after opcountersRepl: 0, after recordStats: 634932, after repl:
634932, at end: 634932 }
[conn228] command admin.$cmd command: serverStatus { serverStatus: 1 }
keyUpdates:0 numYields:0 locks(micros) r:33 reslen:3920 1073310ms
[conn235] serverStatus was very slow: { after basic: 0, after asserts:
0, after backgroundFlushing: 0, after connections: 0, after cursors:
0, after dur: 0, after extra_info: 0, after globalLock: 0, after
indexCounters: 0, after locks: 0, after network: 0, after opcounters:
0, after opcountersRepl: 0, after recordStats: 578551, after repl:
578551, at end: 578551 }
[conn235] command admin.$cmd command: serverStatus { serverStatus: 1 }
keyUpdates:0 numYields:0 locks(micros) r:28 reslen:3920 963376ms
[conn194] SocketException handling request, closing client connection:
9001 socket exception [SEND_ERROR] server [ServerIp:1250]
[conn252] SocketException handling request, closing client connection:
9001 socket exception [SEND_ERROR] server [ServerIp:1248]
[rsSync] Socket say send() errno:110 Connection timed out
ServerIp:27017
[rsSync] replSet initial sync exception: 9001 socket exception
[SEND_ERROR] server [Serverip:27017] 8 attempts remaining
[rsSync] replSet initial sync pending
[rsSync] replSet syncing to: [ServerAddress]:27017
[rsSync] replSet initial sync drop all databases
[rsSync] dropAllDatabasesExceptLocal 2
[rsSync] removeJournalFiles
[rsSync] replSet initial sync clone all databases
[rsSync] replSet initial sync cloning db: PkgsKeyValues
[FileAllocator] allocating new datafile
/datadrive/data/PkgsKeyValues.ns, filling with zeroes...
[FileAllocator] allocating new datafile
/datadrive/data/PkgsKeyValues.3, filling with zeroes...
[FileAllocator] done allocating datafile
/datadrive/data/PkgsKeyValues.3, size: 512MB, took 0.124 secs