Quinn Pham, Danila Seliayeu, et al.
CASCON 2024
In response to the strong desire of customers to be provided with advance notice of unplanned outages, techniques were developed that detect the occurrence of software aging due to resource exhaustion, estimate the time remaining until the exhaustion reaches a critical level, and automatically perform proactive software rejuvination of an application, process group, or entire operating system. The resulting techniques are very general and can capture a multitude of cluster system characteristics, failure behavior, and performability measures.
Quinn Pham, Danila Seliayeu, et al.
CASCON 2024
David S. Kung
DAC 1998
Raymond Wu, Jie Lu
ITA Conference 2007
Bowen Zhou, Bing Xiang, et al.
SSST 2008