Uploaded image for project: 'HipChat'
  1. HipChat
  2. HCPUB-1087

Tomcat Crash Brought down Crowd

This issue belongs to an archived project. You can view it, but you can't modify it. Learn more

    XMLWordPrintable

Details

    • Severity 3 - Minor

    Description

      Problem

      The HipChat Server instance was partially unavailable for about 20 minutes. Upon investigating, it was noticed that HipChat's embedded Atlassian Crowd instance had crashed within with the following error message in catalina.out:

      Jun 28, 2016 7:21:55 PM org.apache.catalina.core.StandardServer await
      SEVERE: StandardServer.await: accept:
      java.net.SocketTimeoutException: Accept timed out
      at java.net.PlainSocketImpl.socketAccept(Native Method)
      at java.net.AbstractPlainSocketImpl.accept(AbstractPlainSocketImpl.java:398)
      at java.net.ServerSocket.implAccept(ServerSocket.java:530)
      at java.net.ServerSocket.accept(ServerSocket.java:498)
      at org.apache.catalina.core.StandardServer.await(StandardServer.java:451)
      at org.apache.catalina.startup.Catalina.await(Catalina.java:777)
      at org.apache.catalina.startup.Catalina.start(Catalina.java:723)
      at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
      at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
      at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
      at java.lang.reflect.Method.invoke(Method.java:606)
      at org.apache.catalina.startup.Bootstrap.start(Bootstrap.java:321)
      at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:455)
      

      Crowd inside of HipChat Server seems to run under Tomcat 7.0.54, it might be related to this bug? https://bz.apache.org/bugzilla/show_bug.cgi?id=56684

      The concern is that this could happen again, and it was requested that Tomcat in HipChat's built-in Crowd be updated to support Tomcat 7.0.55 or higher in a future server release, as trying to patch Tomcat for an existing release may not be trivial.

      Steps to Reproduce

      None. The server appeared to be operating normally, and crowd suddenly quit responding and shut down.

      Attachments

        Activity

          People

            Unassigned Unassigned
            kbaxley Kent Baxley
            Archiver:
            mandreacchio Michael Andreacchio

            Dates

              Created:
              Updated:
              Resolved:
              Archived: