Uploaded image for project: 'Bamboo Data Center'
  1. Bamboo Data Center
  2. BAM-3264

System (server) will shutdown EC2 instance when agent heartbeat stopped.

    • Our product teams collect and evaluate feedback from a number of different sources. To learn more about how we use customer feedback in the planning process, check out our new feature policy.

      • configurable grace period between when the server detects that the heartbeat has stopped and when the instance is shutdown.
      • backend functionality to terminate arbitrary instance exists already

            [BAM-3264] System (server) will shutdown EC2 instance when agent heartbeat stopped.

            sladey added a comment -

            moving to M5 since there is no doubt review work required here.

            sladey added a comment - moving to M5 since there is no doubt review work required here.

            MarkC added a comment -

            No there's no functionality to restart, but there's a use case there for debugging when things goes wrong
            So that's why we give them a time out

            MarkC added a comment - No there's no functionality to restart, but there's a use case there for debugging when things goes wrong So that's why we give them a time out

            Krystian Brazulewicz added a comment - - edited

            What is the reason behind delaying instance shutdown if an agent is killed and unregistered - after that it cannot get back online and I have not seen any functionality to start an elastic agent on a running instance.

            Krystian Brazulewicz added a comment - - edited What is the reason behind delaying instance shutdown if an agent is killed and unregistered - after that it cannot get back online and I have not seen any functionality to start an elastic agent on a running instance.

            MarkC added a comment -

            To clarify:

            • When heartbeat period expires, the agent is killed immediately & the instance killing wait period begins
            • The instance gets killed after the wait period

            MarkC added a comment - To clarify: When heartbeat period expires, the agent is killed immediately & the instance killing wait period begins The instance gets killed after the wait period

            MarkC added a comment -

            The shutdown should be triggerred happen when an agent has stopped. The trigger should wait the timeout in ElasticConfiguration before it actually shuts down the agent.

            MarkC added a comment - The shutdown should be triggerred happen when an agent has stopped. The trigger should wait the timeout in ElasticConfiguration before it actually shuts down the agent.

            Current behavior is that when remote (or elastic) agent does not send heartbeat for bamboo.agent.heartbeatTimeoutSeconds then LocalAgentManager.stopAgent is called. This method for elastic agents performs deregistration of the agent.

            Shall this behavior be retained or shall I change it so it will not be unregistered immediately after heartbeat delay?

            Krystian Brazulewicz added a comment - Current behavior is that when remote (or elastic) agent does not send heartbeat for bamboo.agent.heartbeatTimeoutSeconds then LocalAgentManager.stopAgent is called. This method for elastic agents performs deregistration of the agent. Shall this behavior be retained or shall I change it so it will not be unregistered immediately after heartbeat delay?

            sladey added a comment -

            updated from M4 planning

            sladey added a comment - updated from M4 planning

              Unassigned Unassigned
              Anonymous Anonymous
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

                Created:
                Updated:
                Resolved:

                  Estimated:
                  Original Estimate - 4h Original Estimate - 4h
                  4h
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 12h
                  12h