Check the attached pictures for schedule and phone numbers.
WLCG Data Transfer Dashboards
If plots seems to be up to date (data in the last hours) then everything is OK and no further checks are required.
Else, phone/email Cris or Luca! Please check the dashboards below:
WLCG Dashboard
Link to check:
FAX Dashboard
Link to check:
AAA Dashboard
Link to check:
FTS Dashboard
Link to check:
Guides
For the record, the support guide is:
DDM Dashboard:
Go to the following link:
If "Transfer Volume" and "Transfer Successes" plots are up-to-date (i.e. no more than 3 empty bins on the right)
then everything is OK,
else, phone or email Sergey and/or Luca
In general, we no longer see any failures that are resolved by a simple restart.
For the record, the support guide is:
Go to the following 2 links:
Check in the colum headers if there is a clock icon next to
EVERY datatable column, meaning that these metrics haven't been recently updated
If this is the case, sms or phone Pablo.
SAM3:
Go to the following link:
select Last 12 hours time range and hit "Show results" button. If data for the last more than 3-4 hours is missing (last 1-2 hours is ok as the results were probably not calculated yet from SAM), sms or phone Pablo
Job Monitoring for ATLAS and CMS:
There is no need for active monitoring. If you get an alarm from the collectors, check the following use cases:
ATLAS
If you get an alarm like this: 'URGENT: PANDA collector is an hour or more behind!!!'. Please first check the delay at:
http://dashb-atlas-stats.cern.ch/daily.html and then check the guide at
https://twiki.cern.ch/twiki/bin/view/ArdaGrid/AtlasPandaGangaCollectors
If this doesn't help and you continue to get emails, then call Eddie.
CMS
If you get an alarm from the CMS collectors similar to this: 'Information from ML server on dashb-ai-583 is not updated in the dashboard for more than half an hour'.
Please check the guide here:
https://twiki.cern.ch/twiki/bin/view/ArdaGrid/JobMonitoringApplications
HammerCloud:
If the page is unavailable or there aren't any test in 'running' state, please send sms or email to Valentina and Jarka.
Check the following links:
HammerCloud is providing service availability metrics to lemon, a notification has been configured to send alarms in case availability = 'available'. In case of a hc_service_degraded notification, please send sms or email to Valentina and Jarka.
Contacts can be found here:
https://twiki.cern.ch/twiki/bin/view/ArdaGrid/HammerCloud#Contacts