Check the attached pictures for schedule and phone numbers.

WLCG Data Transfer Dashboards

If plots seems to be up to date (data in the last hours) then everything is OK and no further checks are required. Else, phone/email Cris or Luca! Please check the dashboards below:

WLCG Dashboard

Link to check:

FAX Dashboard

Link to check:

AAA Dashboard

Link to check:

FTS Dashboard

Link to check:

Guides

For the record, the support guide is:

DDM Dashboard:

Go to the following link:

If "Transfer Volume" and "Transfer Successes" plots are up-to-date (i.e. no more than 3 empty bins on the right) then everything is OK, else, phone or email Sergey and/or Luca

In general, we no longer see any failures that are resolved by a simple restart.

For the record, the support guide is:

SSB:

Go to the following 2 links:

Check in the colum headers if there is a clock icon next to EVERY datatable column, meaning that these metrics haven't been recently updated If this is the case, sms or phone Pablo.

SAM3:

Go to the following link:

select Last 12 hours time range and hit "Show results" button. If data for the last more than 3-4 hours is missing (last 1-2 hours is ok as the results were probably not calculated yet from SAM), sms or phone Pablo

Job Monitoring for ATLAS and CMS:

There is no need for active monitoring. If you get an alarm from the collectors, check the following use cases:

ATLAS

If you get an alarm like this: 'URGENT: PANDA collector is an hour or more behind!!!'. Please first check the delay at: http://dashb-atlas-stats.cern.ch/daily.html and then check the guide at https://twiki.cern.ch/twiki/bin/view/ArdaGrid/AtlasPandaGangaCollectors

If this doesn't help and you continue to get emails, then call Eddie.

CMS

If you get an alarm from the CMS collectors similar to this: 'Information from ML server on dashb-ai-583 is not updated in the dashboard for more than half an hour'. Please check the guide here: https://twiki.cern.ch/twiki/bin/view/ArdaGrid/JobMonitoringApplications

HammerCloud:

If the page is unavailable or there aren't any test in 'running' state, please send sms or email to Valentina and Jarka. Check the following links:

HammerCloud is providing service availability metrics to lemon, a notification has been configured to send alarms in case availability = 'available'. In case of a hc_service_degraded notification, please send sms or email to Valentina and Jarka. Contacts can be found here: https://twiki.cern.ch/twiki/bin/view/ArdaGrid/HammerCloud#Contacts

Topic attachments
I Attachment History Action Size Date Who Comment
PNGpng dec.png r1 manage 44.2 K 2013-12-20 - 12:10 PabloSaiz  
PNGpng jan.png r1 manage 23.5 K 2013-12-20 - 12:10 PabloSaiz  
Edit | Attach | Watch | Print version | History: r19 < r18 < r17 < r16 < r15 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r19 - 2015-12-28 - LucaMagnoni
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    ArdaGrid All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright &© 2008-2024 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback