As Great East Japan Earthquake occurred at 11 March 2011 suffered a heavy loss, Japan is one of the most risky country stroke by an earthquake. Therefore, it is a social mission for enterprises in Japan, especially for telecommunication careers like us, to establish disaster recovery plan against a great earthquake. One way to achieve disaster recovery is locating a backup system geographically away from primary system. Authors have invented D3A (Distributed Data Driven Architecture) that makes thousands of small IA servers cooperated and derives high-performance. We have introduced the technology in commercial large-scale OSS (Operation Support System) to maintain tens of thousands of NE (Network Element) which constitute our telecommunication network. We have already possessed a backup system of the OSS for disaster recovery purpose. However, the backup system leads to great rise in TCO (Total Cost of Ownership) since it also consists of more than 1,000 machines. In this paper, we discuss about an autonomously continuous distributed disaster recovery system that is made up by several locations and every location is considered as primary system equivalently, and outage of one location does not impact on overall service of the system.
Published in:
Network Operations and Management Symposium (APNOMS), 2011 13th Asia-Pacific
Date of Conference: 21-23 Sept. 2011