Note: This was originally posted by an inactive account. Content was preserved by moving under an admin account.
Originally posted by: stonysmithYou are welcome to bring the question up here, but it might be best for you to have a chat with your Technical Account Manager to explore what solutions best fit your needs.
What level of Disaster Recovery are you looking for?
1) running node continuation on 2nd server
2) running graph (controller) continuation on 2nd server
3) FARM partial failure
4) failed graph restart in middle
5) failed graph complete rerun from start
Situation 1 is usually only available on machines with special hardware and operating systems.
LAE does not at this time have a facility to handle situations 2 or 3.
Situations 4 and 5 can be implemented, but both are dependent upon Control-M, not LAE.
An important part of any DR consideration is whether a second server has access to the temporary disk files.
Common configurations of hardware involve SAN/NAS disk drives - and that alone represents a possible point of failure.
The connection to the disk space must be available on the 'recovery' server or you won't be able to implement a restart facility.
====
When you execute the LAE Controller, an option you have is to produce an output BRS using the -dump filename.brs command line option.
That BRS can then be used to re-start a failed graph, assuming that the referenced temp files are available to the second server.
Note: the temp files must be available on exactly the same filenames, so any disk mounts must be available with the same names on both machines.