<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:atom="http://www.w3.org/2005/Atom" version="2.0">
  <channel>
    <title>Recent Posts in ''Distributed' circuit breakers: co-ordinate or not?' | Pragmatic Forums</title>
    <link>http://forums.pragprog.com/forums/46/topics/262</link>
    <language>en-us</language>
    <ttl>60</ttl>
    <description></description>
    <item>
      <title>'Distributed' circuit breakers: co-ordinate or not? posted by Oliver Kamps @ Thu, 14 Feb 2008 08:55:53 -0000</title>
      <description>&lt;p&gt;Hi Mike,&lt;/p&gt;


	&lt;p&gt;Thanks for sharing your thoughts so quickly on this.&lt;/p&gt;


	&lt;p&gt;Your reasoning makes a lot of sense to me; I was leaning in the same direction but hadn&amp;#8217;t thought things through that far.&lt;/p&gt;


	&lt;p&gt;Cheers,&lt;br /&gt;Oliver&lt;/p&gt;</description>
      <pubDate>Thu, 14 Feb 2008 08:55:53 -0000</pubDate>
      <guid isPermaLink="false">forums.pragprog.com:46:262:2292</guid>
      <author>Oliver Kamps</author>
      <link>http://forums.pragprog.com/forums/46/topics/262</link>
    </item>
    <item>
      <title>'Distributed' circuit breakers: co-ordinate or not? posted by Michael Nygard @ Thu, 14 Feb 2008 05:40:54 -0000</title>
      <description>&lt;p&gt;Oliver,&lt;/p&gt;


	&lt;p&gt;The prospect of coordinating circuit breakers makes e very nervous, for a couple of reasons. Coordinating across the circuit breakers would either be done via point-to-point connections, which suggests a scalability problem, or via broadcast messages. (Broadcasts would not present a scalability problem.)&lt;/p&gt;


	&lt;p&gt;I see a second, slightly more subtle problem. Suppose a farm of 10 servers is calling a back end system. Due to an external event, maybe load-related, maybe human-error, whatever, disrupts the back end.  If the first circuit breaker causes all 10 to stop making calls, then when the time limit passes, all 10 will start making calls at the same time.  This can cause a surge in load on the newly restored back end system, exactly the same as when the power company restores service after a summertime blackout, only to have thousands of air conditioners turn on immediately. The surge in demand can destabilize the back end.&lt;/p&gt;


	&lt;p&gt;Finally, coordinating actions across the entire tier will tend to make the whole cluster react.  If you assume that the circuit breaker trips only because of a problem with the back end, then communication between the circuit breakers would save a little bit of time.  On the other hand, I have seen problems affect one host, disrupting its connection to the back end, thus causing circuit breakers to trip.  If the problem is confined to that one host, you would not want the other circuit breakers to also flip open.  (Imagine, for example, a failed &lt;span class="caps"&gt;NIC&lt;/span&gt; on that host.  It still makes sense to protect the application by aborting the calls, even though it&amp;#8217;s not strictly a problem with the back end system.)&lt;/p&gt;


	&lt;p&gt;So, in summary, I would choose not to have the circuit breakers communicate their status with each other.&lt;/p&gt;


	&lt;p&gt;Cheers,&lt;br /&gt;-Mike Nygard&lt;/p&gt;</description>
      <pubDate>Thu, 14 Feb 2008 05:40:54 -0000</pubDate>
      <guid isPermaLink="false">forums.pragprog.com:46:262:2290</guid>
      <author>Michael Nygard</author>
      <link>http://forums.pragprog.com/forums/46/topics/262</link>
    </item>
    <item>
      <title>'Distributed' circuit breakers: co-ordinate or not? posted by Oliver Kamps @ Wed, 13 Feb 2008 16:14:48 -0000</title>
      <description>&lt;p&gt;Hello everybody,&lt;/p&gt;


	&lt;p&gt;Assume a redundantly deployed component (e.g. the same &lt;span class="caps"&gt;EJB&lt;/span&gt; deployed to a number of Java EE server instances) has an integration point to an external system and I&amp;#8217;d want to make this more robust using a Circuit Breaker. Would you have the different instances of the Circuit Breaker communicate with each other (i.e. if one flips open, they all do)? Or would this be overkill as they&amp;#8217;d probably all flip open within a short period of time anyway if there&amp;#8217;s a systemic problem?&lt;/p&gt;


	&lt;p&gt;Thanks a lot,&lt;/p&gt;


	&lt;p&gt;Oliver&lt;/p&gt;</description>
      <pubDate>Wed, 13 Feb 2008 16:14:48 -0000</pubDate>
      <guid isPermaLink="false">forums.pragprog.com:46:262:2287</guid>
      <author>Oliver Kamps</author>
      <link>http://forums.pragprog.com/forums/46/topics/262</link>
    </item>
  </channel>
</rss>
