Quantcast
Channel: Symantec Connect - Backup and Recovery - Discussions
Viewing all articles
Browse latest Browse all 8635

NDMP backups are failing with media write errors

$
0
0
I need a solution

Hi

We have NDMP backups configured for both emc celerra and sun 7410. Some of the shares in sun filer seems to be working fine but some are starting well and then fails with media write errors. The celerra backups are not working at all.

Netbackup version -7.5.0.3

OS version - Redhat 6.2

Any help would be appreciated. I have opened case with symantec and did not find any solution yet.

I can connect to both the filers from nbu servers.

[root@slprdsan6019 bptm]# tpautoconf -verify sl-emc-c1-dm2
Connecting to host "sl-emc-c1-dm2" as user "ndmp"...
Waiting for connect notification message...
Opening session--attempting with NDMP protocol version 4...
Opening session--successful with NDMP protocol version 4
  host supports TEXT authentication
  host supports MD5 authentication
Getting MD5 challenge from host...
Logging in using MD5 method...
Host info is:
  host name "server_2"
  os type "DartOS"
  os version "EMC Celerra File Server.T.6.0.51.6"
  host id "abc1997"
Login was successful
Host supports LOCAL backup/restore
Host supports 3-way backup/restore
 

Sun shares errors:

1/08/2012 15:02:11 - Info bptm (pid=31869) using 262144 data buffer size
11/08/2012 15:02:11 - Info bptm (pid=31869) start backup
11/08/2012 15:02:11 - Info bptm (pid=31869) Waiting for mount of media id 100174 (copy 1) on server slprdsan6019.lhr.stor.s.nokia.com.
11/08/2012 15:02:11 - granted resource  slprdsan6019.lhr.stor.s.nokia.com.NBU_CLIENT.MAXJOBS.sl-sun7410-1a
11/08/2012 15:02:11 - granted resource  slprdsan6019.lhr.stor.s.nokia.com.NBU_POLICY.MAXJOBS.sun-ndmp-vshards-prod-pr3
11/08/2012 15:02:11 - granted resource  100174
11/08/2012 15:02:11 - granted resource  HP.ULTRIUM4-SCSI.000
11/08/2012 15:02:11 - granted resource  slprdsan1001-hcart-robot-tld-0-sun7410a
11/08/2012 15:02:11 - estimated 0 kbytes needed
11/08/2012 15:02:11 - Info nbjm (pid=15579) started backup (backupid=sl-sun7410-1a_1352386931) job for client sl-sun7410-1a, policy sun-ndmp-vshards-prod-pr3, schedule Daily_Clnc on storage unit slprdsan1001-hcart-robot-tld-0-sun7410a
11/08/2012 15:02:11 - started process bpbrm (pid=31866)
11/08/2012 15:02:11 - connecting
11/08/2012 15:02:11 - connected; connect time: 0:00:00
11/08/2012 15:02:11 - mounting 100174
11/08/2012 15:03:16 - Info bptm (pid=31869) media id 100174 mounted on drive index 0, drivepath /dev/rmt/3n, drivename HP.ULTRIUM4-SCSI.000, copy 1
11/08/2012 15:03:16 - mounted 100174; mount time: 0:01:05
11/08/2012 15:03:21 - positioning 100174 to file 38
11/08/2012 15:04:25 - Info ndmpagent (pid=31868) sl-sun7410-1a: Direct Access Restore information is supported
11/08/2012 15:04:25 - positioned 100174; position time: 0:01:04
11/08/2012 15:04:25 - begin writing
11/08/2012 15:04:26 - Info ndmpagent (pid=31868) sl-sun7410-1a: Backing up "/export/vshards-pr-21".
11/08/2012 15:04:26 - Info ndmpagent (pid=31868) sl-sun7410-1a: Tape record size: 262144.
11/08/2012 15:04:26 - Info ndmpagent (pid=31868) sl-sun7410-1a: File history: Y.
11/08/2012 15:04:26 - Info ndmpagent (pid=31868) sl-sun7410-1a: Date of the last level '0': Sat Sep  8 09:00:20 2012.
11/08/2012 15:04:26 - Info ndmpagent (pid=31868) sl-sun7410-1a: Date of this level '1': Thu Nov  8 15:04:25 2012.
11/08/2012 15:04:26 - Info ndmpagent (pid=31868) sl-sun7410-1a: Update: TRUE.
11/08/2012 15:09:43 - Error ndmpagent (pid=31868) MOVER_HALTED media write error - reason = 5 (NDMP_MOVER_HALT_MEDIA_ERROR)
11/08/2012 15:09:43 - Info ndmpagent (pid=31868) sl-sun7410-1a: Runtime [/export/vshards-pr-21] 872876032 bytes (872876032): 312 seconds
11/08/2012 15:09:43 - Error ndmpagent (pid=31868) NDMP backup failed, path = /export/vshards-pr-21
11/08/2012 15:09:43 - Error ndmpagent (pid=31868) sl-sun7410-1a: Filesystem traverse error.
11/08/2012 15:09:45 - Error bptm (pid=31869) cannot write image to media id 100174, drive index 0
11/08/2012 15:09:45 - Error bptm (pid=31869) io_ioctl_ndmp (MTBSF) failed on media id 100174, drive index 0, return code 7 (NDMP_IO_ERR) (bptm.c.8757)
11/08/2012 15:09:50 - Info bptm (pid=31869) EXITING with status 84 <----------
11/08/2012 15:09:50 - Info ndmpagent (pid=0) done. status: 84: media write error
11/08/2012 15:09:50 - end writing; write time: 0:05:25
11/08/2012 15:19:50 - Info bpbrm (pid=516) sl-sun7410-1a is the host to backup data from
11/08/2012 15:19:50 - Info bpbrm (pid=516) reading file list from client
11/08/2012 15:19:50 - Info bpbrm (pid=516) starting ndmpagent on client
11/08/2012 15:19:50 - Info ndmpagent (pid=518) Backup started
11/08/2012 15:19:50 - Info bpbrm (pid=516) bptm pid: 519
11/08/2012 15:19:51 - Info bptm (pid=519) start
11/08/2012 15:19:51 - Info bptm (pid=519) using 32 data buffers
11/08/2012 15:19:51 - Info bptm (pid=519) using 262144 data buffer size
11/08/2012 15:19:51 - Info bptm (pid=519) start backup
11/08/2012 15:19:51 - Info bptm (pid=519) Waiting for mount of media id 100174 (copy 1) on server slprdsan6019.lhr.stor.s.nokia.com.
media write error  (84)
 

Celerra Errors

1/08/2012 14:32:15 - Info nbjm (pid=15579) starting backup job (jobid=4319) for client sl-emc-c1-dm2, policy emc-ndmp-mapr-access-prod, schedule Weekly_Full
11/08/2012 14:32:16 - Info bpbrm (pid=29762) sl-emc-c1-dm2 is the host to backup data from
11/08/2012 14:32:16 - Info bpbrm (pid=29762) reading file list from client
11/08/2012 14:32:16 - Info bpbrm (pid=29762) starting ndmpagent on client
11/08/2012 14:32:16 - Info ndmpagent (pid=29764) Backup started
11/08/2012 14:32:16 - Info bpbrm (pid=29762) bptm pid: 29765
11/08/2012 14:32:16 - Info bptm (pid=29765) start
11/08/2012 14:32:16 - estimated 0 kbytes needed
11/08/2012 14:32:16 - Info nbjm (pid=15579) started backup (backupid=sl-emc-c1-dm2_1352385136) job for client sl-emc-c1-dm2, policy emc-ndmp-mapr-access-prod, schedule Weekly_Full on storage unit slprdsan0021-hcart-robot-tld-0-emc-c1-dm2
11/08/2012 14:32:16 - started process bpbrm (pid=29762)
11/08/2012 14:32:16 - connecting
11/08/2012 14:32:16 - connected; connect time: 0:00:00
11/08/2012 14:32:17 - Info bptm (pid=29765) using 32 data buffers
11/08/2012 14:32:17 - Info bptm (pid=29765) using 1048576 data buffer size
11/08/2012 14:32:17 - Info bptm (pid=29765) start backup
11/08/2012 14:32:17 - Info bptm (pid=29765) Waiting for mount of media id 100118 (copy 1) on server slprdsan6019.lhr.stor.s.nokia.com.
11/08/2012 14:32:17 - mounting 100118
11/08/2012 14:47:24 - Error bptm (pid=29765) error requesting media, TpErrno = Robot operation failed
11/08/2012 14:47:24 - Warning bptm (pid=29765) media id 100118 load operation reported an error
11/08/2012 14:47:24 - current media 100118 complete, requesting next media Any
11/08/2012 14:47:59 - granted resource  100118
11/08/2012 14:47:59 - granted resource  HP.ULTRIUM4-SCSI.001
11/08/2012 14:47:59 - granted resource  slprdsan0021-hcart-robot-tld-0-emc-c1-dm2
11/08/2012 14:48:00 - Info bptm (pid=29765) Waiting for mount of media id 100118 (copy 1) on server slprdsan6019.lhr.stor.s.nokia.com.
11/08/2012 14:48:00 - mounting 100118
11/08/2012 14:53:06 - Info ndmpagent (pid=0) done
11/08/2012 14:53:06 - Error bptm (pid=29765) media manager terminated during mount of media id 100118, possible media mount timeout
11/08/2012 14:53:06 - Error bptm (pid=29765) media manager terminated by parent process
11/08/2012 14:53:06 - Error ndmpagent (pid=29764) NDMP backup failed, path = UNKNOWN
11/08/2012 14:53:06 - Info ndmpagent (pid=0) done. status: 150: termination requested by administrator
11/08/2012 14:53:06 - Error ndmpagent (pid=29764) connection 0x12f6fc0 ndmp_message_process_one failed, status = 18 (NDMP_XDR_DECODE_ERR)
11/08/2012 14:53:06 - Error ndmpagent (pid=29764) connection 0x12f9790 ndmp_message_process_one failed, status = 18 (NDMP_XDR_DECODE_ERR)
11/08/2012 14:53:06 - end writing
termination requested by administrator  (150)


Viewing all articles
Browse latest Browse all 8635

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>