History | Log In     View a printable version of the current page.  
HQ 4.0 EE Release is Now Available | HQ 3.2.5-EE Maintenance Release is Now Available
Issue Details (XML | Word | Printable)

Key: HHQ-871
Type: Bug Bug
Status: Closed Closed
Resolution: Fixed
Priority: Blocker Blocker
Assignee: Jon Travis
Reporter: Ryan Morgan
Votes: 0
Watchers: 0
Operations

If you were logged in you would be able to see more operations.
Hyperic HQ

High load and transaction timeouts on server restart when lather connections setting is high

Created: 03/May/07 09:24 AM   Updated: 09/May/07 12:09 PM
Component/s: Server
Affects Version/s: 3.0.4
Fix Version/s: 3.1.0, 3.0.5

File Attachments: 1. Text File server.out.txt (2.31 Mb)


Last comment: 80 weeks, 2 days ago
Resolution Date: 04/May/07 08:41 AM


 Description  « Hide

It has been observed in large environments where the lather settings are set higher than the default (25) that the server can get locked up processing backlogged data from agents. The amount of data that is backlogged does not seem to matter. I have seen this with the server only being down for about 2 minutes.

A full thread dump shows contention in the sending of metric events after the data has been persisted.

I'll attach the full thread dump.


 All   Comments   Change History      Sort Order: Ascending order - Click to sort in descending order
Ryan Morgan - 03/May/07 09:26 AM

See attached thread dump. There are 2 separate thread dumps each taken about 5 minutes apart. The server also had verbose gc logging turned on, so you can ignore those entries.

There are dozens of entries stuck on:

org.hyperic.hq.measurement.server.session.DataManagerEJBImpl.sendMetricEvents



Ryan Morgan - 09/May/07 12:09 PM

Verified fixed with 3.0.5 #400.