US 7,581,003 B2
System and method for automatic recovery from fault conditions in networked computer services
Randall J. Macbeth, West Edmonds, Wash. (US); and Marcelo Aizenstein Furman Calbucci, Redmond, Wash. (US)
Assigned to Microsoft Corporation, Redmond, Wash. (US)
Filed on Dec. 31, 2003, as Appl. No. 10/748,675.
Prior Publication US 2005/0193225 A1, Sep. 01, 2005
Int. Cl. G06F 15/173 (2006.01)
U.S. Cl. 709—225  [709/217; 707/6] 31 Claims
OG exemplary drawing
 
1. A system for monitoring a networked computer service for fault recovery, the networked computer service comprising a set of features, the system comprising:
an input interface to receive network status data from a network monitor monitoring a computer services network, wherein network status data comprises at least one of page latency data, processor utilization data, connection data and storage data;
a control engine in a server device, the control engine communicating with the input interface to receive the network status data and automatically generate control commands to dynamically adjust the set of features based on a fault condition detected in the network status data, the fault condition associated with one or more features within the set of features, wherein the set of features normally provide a plurality of panels of information for presentation on one or more web pages provided by the networked computer service to one or more users, each feature corresponding to one or more of the plurality of panels of information, and wherein the fault condition comprises undesired performance degradation of the one or more features; and
an output interface, the output interface communicating with the control engine and the computer services network, to transmit the control commands to the computer services network to dynamically adjust the set of features in response to the detection of the fault condition by deactivating the one or more features associated with the fault condition while maintaining active features in the set of features to continue to provide the one or more users with a portion of the networked computer service, the portion of the networked computer service comprising the active features while each of the one or more panels associated with the one or more features associated with the fault condition are at least temporarily removed from the networked computer service, such that the one or more web pages only include panels of information associated with the active features.