TWiki
>
LCG Web
>
WLCGCommonComputingReadinessChallenges
>
WLCGOperationsWeb
>
WLCGOpsCoordination
>
SquidMonitoringTaskForce
(2019-11-20,
DaveDykstra
)
(raw view)
E
dit
A
ttach
P
DF
---+ Squid Monitoring Task Force Contents of this page: %TOC% ---++ Objectives 1. Make a plan for moving squid monitoring from CMS Frontier management to WLCG management 1. Decide how to integrate squid monitoring better with WLCG common operations 1. Produce an architecture for a common squid monitoring system that is configured the same way for all VOs We should avoid defining how applications discover what proxies to use -- that will be a separate task force. ---++ Members * [[https://phonebook.cern.ch/phonebook/?from=xwho#id=PE372511][Dario Barberis]] (ATLAS) * [[https://phonebook.cern.ch/phonebook/?from=xwho#id=PE710305][Alexandre Beche]] (CERN-IT) * [[https://phonebook.cern.ch/phonebook/?from=xwho#id=PE541401][Doug Benjamin]] (ATLAS) * [[https://phonebook.cern.ch/phonebook/?from=xwho#id=PE375888][Barry Blumenfeld]] (CMS) * [[https://phonebook.cern.ch/phonebook/?from=xwho#id=PE531497][Simone Campana]] (CERN-IT & ATLAS) * [[https://phonebook.cern.ch/phonebook/#personDetails/?id=485759][David Crooks]] (NGI_UK, added August 2014) * [[https://phonebook.cern.ch/phonebook/?from=xwho#id=PE654209][Alastair Dewhurst]] (ATLAS) * [[https://phonebook.cern.ch/phonebook/?from=xwho#id=PE614260][Alessandro Di Girolamo]] (CERN-IT & ATLAS) * [[https://phonebook.cern.ch/phonebook/?from=xwho#id=PE662978][Dave Dykstra]] (CMS, OSG), Chair * [[https://phonebook.cern.ch/phonebook/#personDetails/?id=650724][Costin Grigoras]] (ALICE, added July 2014) * [[https://phonebook.cern.ch/phonebook/?from=xwho#id=PE742828][Luis Linares]] (CMS) * [[https://phonebook.cern.ch/phonebook/?id=PE576600][Stefan Roiser]] (CERN-IT & LHCb) * [[https://phonebook.cern.ch/phonebook/?from=xwho#id=PE731053][Scott Teige]] (OSG, left August 2014) * [[https://phonebook.cern.ch/phonebook/?from=xwho#id=PE437551][Andrea Valassi]] (CERN-IT) All members can be contacted at wlcg-ops-coord-tf-squidmon@cern.ch ---++ Task Overview | *#* | *Task* | *Deadline* | *Progress* | *Affected VO* | *Affected Siites* | *Comment* | | 1 | Move MRTG to wlcg-squid-monitor | May 2013 | 100% | All | All | Done by Luis Linares | | 2 | Move awstats to wlcg-squid-monitor | August 2014 | 100% | All | All | Done by Luis | | 3 | Change frontier-awstats rpm to send to wlcg-squid-monitor | December 2014 | 100% | All | Launchpad | Done by Dave Dykstra | | 4 | Update frontier-awstats on launchpads and backup proxies | October 2015 | 100% | All | Launchpad, backup proxies, and stratum 1s | Done by administrators. | | 5 | Implement squid monitoring based on GOCDB/OIM | November 2014 | 100% | All | All | Done by Alastair Dewhurst | | 5b | Implement page for accessing removed MRTG plots | October 2019 | 100% | All | All | Done by Edita [[https://its.cern.ch/jira/browse/FTOPSDEVEL-65][#65]] | | 6 | Implement exceptions list for generating MRTG plots | February 2016 | 100% | All | some | Done by Alastair [[https://its.cern.ch/jira/browse/FTOPSDEVEL-56][#56]] [[https://its.cern.ch/jira/browse/FTOPSDEVEL-57][#57]] | | 7 | Implement mapping to CMS site names | October 2014 | 100% | CMS | All CMS | Done by Luis | | 8 | Add generating CMS MRTG page | November 2019 | 100% | CMS | All CMS | To be done by Edita [[https://its.cern.ch/jira/browse/FTOPSDEVEL-58][#58]] | | 9 | Add generating ATLAS MRTG page | Novemer 2017 | 100% | ATLAS | All ATLAS | Done by Michal Svatos | | 10 | Update ATLAS SSB to be based on new MRTG page | November 2017 | 100% | ATLAS | All ATLAS | Done by Michal Svatos [[https://its.cern.ch/jira/browse/FTOPSDEVEL-111][#111]] | | 11 | Implement generalized failover monitor | July 2014 | 100% | CMS | All CMS | By Luis, using CMS squid list | | 12 | Convert failover monitor to use GOCDB/OIM squid organizations list | November 2019 | 100% | All | All | By Michal Svatos, add ATLAS & CVMFS [[https://its.cern.ch/jira/browse/FTOPSDEVEL-102][#102]] | | 13 | Implement SAM/SUM test based on failovers | September 2019 | 100% | All | All | Done by Edita [[https://its.cern.ch/jira/browse/FTOPSDEVEL-236][#236]], ATLAS might not want | | 14 | Integrate !MonALISA-based squid monitor | November 2014 | 100% | All | All | Done by Costin Grigoras with help from Luis & Dave | | 15 | Implement exception list to convert monitor view to worker node view | September 2016 | 100% | All | All | Done by Dave | | 16 | Disable CMS audit comparing MRTG config to SITECONF | November 2019 | 100% | CMS | All CMS | To be done by Edita | ---++ Meetings * 2012-10-04 [[SquidMonitoringTF20121004MeetingNotes][Notes]] * 2012-11-30 [[SquidMonitoringTF20121130MeetingNotes][Notes]] * 2012-12-14 [[SquidMonitoringTF20121214MeetingNotes][Notes]] * 2013-01-17 [[SquidMonitoringTF20130117MeetingNotes][Notes]] * 2013-02-01 [[SquidMonitoringTF20130201MeetingNotes][Notes]] * 2014-08-28 [[SquidMonitoringTF20140828MeetingNotes][Notes]] ---++ Presentations * [[https://indico.cern.ch/getFile.py/access?subContId=1&contribId=1&resId=0&materialId=slides&confId=215003][Update for 2012-11-01 WLCG Operations Coordination meeting]] * [[https://indico.cern.ch/getFile.py/access?subContId=1&contribId=1&resId=0&materialId=slides&confId=219493][Update for 2012-12-06 WLCG Operations Coordination meeting]] * [[https://indico.cern.ch/getFile.py/access?subContId=1&contribId=0&resId=0&materialId=slides&confId=231008][Update for 2013-01-24 WLCG Operations Coordination meeting]] * [[https://indico.cern.ch/getFile.py/access?subContId=0&contribId=1&resId=0&materialId=slides&confId=233962][Update for 2013-02-07 WLCG Operations Coordination meeting]] ---++ Agreements * Move MRTG and awstats squid monitoring from [[http://frontier.cern.ch][frontier.cern.ch]] to new pair of virtual machines more closely associated with WLCG * The machines will be a contribution from CMS and hosted in the vocms cluster * The public alias will be wlcg-squid-monitor.cern.ch * Squid services will be registered in GOCDB & OIM, as publicly available round-robin DNS aliases if there is more than one squid implementing the same service. Sites may have multiple independent squid services registered. GOCDB & OIM will not distinguish between different purposes/applications for squid services. * Additional information needed for squid monitoring beyond that stored in GOCDB & OIM will be maintained by per-VO operations personnel in per-VO files on the squid monitoring servers. These will be combined with the GOCDB & OIM information into another file as the input into a MRTG configurator, to allow ATLAS to possibly instead generate that file in the future from AGIS. * A SAM/SUM test of Squid services will be set up, based on non-response to MRTG monitoring probes a few periods in a row. Those Squids for which SAM tests are required must be declared in GOCDB or OIM. * An additional SAM/SUM test of Squid services will be set up based on hits to Frontier/CVMFS reverse-proxy Squids *not* going through the site's Squid services (also known as "failovers"), based on awstats on monitoring server * ATLAS operations is not currently planning to use this * Integrate the ALICE !MonALISA-based squid monitor including an optional host-based data collection rpm (see [[SquidMonitoringTF20140828MeetingNotes][2014-08-28 meeting notes]]) ---++ Questions * Questions we asked and their answers are on the SquidMonitoringTaskForceQuestions page ---++ Proposals * [[SquidMonitoringTFInfoSystem][Squid configuration information proposal]] ---++ Production documentation * [[SquidMonitoringMachines][Layout and Configuration of the Squid Monitoring Machines]] * [[WLCGSquidRegistration][Instructions for registering Squids in GOCDB or OIM]]
E
dit
|
A
ttach
|
Watch
|
P
rint version
|
H
istory
: r56
<
r55
<
r54
<
r53
<
r52
|
B
acklinks
|
V
iew topic
|
WYSIWYG
|
M
ore topic actions
Topic revision: r56 - 2019-11-20
-
DaveDykstra
Log In
LCG
LCG Wiki Home
LCG Web Home
Changes
Index
Search
LCG Wikis
LCG Service
Coordination
LCG Grid
Deployment
LCG
Apps Area
Public webs
Public webs
ABATBEA
ACPP
ADCgroup
AEGIS
AfricaMap
AgileInfrastructure
ALICE
AliceEbyE
AliceSPD
AliceSSD
AliceTOF
AliFemto
ALPHA
Altair
ArdaGrid
ASACUSA
AthenaFCalTBAna
Atlas
AtlasLBNL
AXIALPET
CAE
CALICE
CDS
CENF
CERNSearch
CLIC
Cloud
CloudServices
CMS
Controls
CTA
CvmFS
DB
DefaultWeb
DESgroup
DPHEP
DM-LHC
DSSGroup
EGEE
EgeePtf
ELFms
EMI
ETICS
FIOgroup
FlukaTeam
Frontier
Gaudi
GeneratorServices
GuidesInfo
HardwareLabs
HCC
HEPIX
ILCBDSColl
ILCTPC
IMWG
Inspire
IPv6
IT
ItCommTeam
ITCoord
ITdeptTechForum
ITDRP
ITGT
ITSDC
LAr
LCG
LCGAAWorkbook
Leade
LHCAccess
LHCAtHome
LHCb
LHCgas
LHCONE
LHCOPN
LinuxSupport
Main
Medipix
Messaging
MPGD
NA49
NA61
NA62
NTOF
Openlab
PDBService
Persistency
PESgroup
Plugins
PSAccess
PSBUpgrade
R2Eproject
RCTF
RD42
RFCond12
RFLowLevel
ROXIE
Sandbox
SocialActivities
SPI
SRMDev
SSM
Student
SuperComputing
Support
SwfCatalogue
TMVA
TOTEM
TWiki
UNOSAT
Virtualization
VOBox
WITCH
XTCA
Welcome Guest
Login
or
Register
Cern Search
TWiki Search
Google Search
LCG
All webs
Copyright &© 2008-2024 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use
Discourse
or
Send feedback