Grid
Deployment Board (GDB)
Web - Wiki - Agendas - Minutes
9 September 2009
GDB Meeting Agenda
SL5
Deployment at Sites J.Gordon summarized the
situation of the SL5 deployment at several WLCG Sites.
Middleware
Updates - A.Unterkircher presented
the status of the migration of the Worker Node to SL5. He also summarized the upcoming released
of several gLite and storageware packages.
CREAM
Deployment at Sites A.Retico presented the
status of the deployment of the CREAM CE.
Experiments. In particular ALICE has asked that CREAM is deployed at
all their Tier-1 and Tier-2 Sites.
LHC-OPN
Update W.Salter presented the
status and progress of the LCG Optical Private Network which is connecting
the Tier-0 and Tier-1 Sites of the WLCG.
He also presented the recent modifications that were agreed and
executed on the OPN.
Experiments
Tests in SAM J.Shade presented the
strategy for moving SAM from a centralised infrastructure at CERN to a distributed framework based on Nagios.
He also described how Experiments tests and results will be integrated in
this new scenario.
ALICE
Operations P.Mendez-Lorenzo presented
the priorities of the ALICE requirements. In particular ALICE needs SL5 and
CREAM installed at all ALICE Sites.
Virtualization
for WLCG T.Cass proposed a strategy
for introducing virtualization in the WLCG infrastructure in order to
enable experiments/users to choose the environment for their jobs execution
and ensure sites have control and traceability over resource usage.
WLCG
Technical Forum M.Litmaath presented the
newly launched working group in charge of clarifying the areas for
improvement among the WLCG stakeholders (Sites, Experiments, Software providers)
in order to form a common WLCG position w.r.t. EGEE, EGI, OSG and other
software and hardware providers. In the longer term will represent also the
WLCG needs w.r.t. services and middleware, in term of sustainability and
evolution of the existing middleware in the light of changing technologies
and the experience gained.
8 July 2009 GDB
Meeting Agenda
Security
Policies D.Kelsey presented the
status and progress of the Joint Security Policy Group and of four document
under final approval call regarding: VO Registration, VO Management,
User-level accounting, VO Portals and Security Incident Response.
SCAS/glExec
and Pilots Jobs M.Litmaath presented the
open issues with SCAS, glExec on the Worker Nodes and the Pilot Jobs
frameworks from the Experiments.
Discussion
on Chimera Migration M.Jouvin presented the
reaction of the Sites to the advice to upgrade dCache from PNFS to Chimera.
The Tier-1 Sites running dCache replied that they cannot migrate to Chimera
before data-taking in 2009, and maybe 2010.
SL5
Migration and Middleware Updates A.Unterkircher presented the status of the migration of the Worker
Node to SL5. He also summarized the
upcoming released of several gLite and storageware packages.
EGEE
Operational Tools The tools used in EGEE
Operations were presented, as well as coming improvements. In particular
there were presentations about SAM regional monitoring, Distributed GOCDB,
GSTAT 2.0 and HEPSPEC Benchmarking for CPU Accounting.
Installed
Capacity S.Traylen presented the
status and progress of the activity collecting installed capacity at the
Sites and also at the migration to the use of the new HEPSPEC-06 CPU
benchmark.
VOM(R)S
Working Group Wrap-up M.Dimou presented mandate
and the progress of the working group and on its activity in order to
foster VOM(R)S/VOMS-admin convergence.
10 June 2009 GDB Meeting Agenda
Middleware and CREAM Update A.Unterkircher presented the
status and progress of the updated expected for end of June and N.Thackray
presented the update of the CREAM CE testing and deployment.
STEP09 Progress J.Shiers presented a summary of the Scale
Test for the Experiment Program (STEP09). The goal was to test the
readiness of the WLCG Sites under realistic use conditions from the LHC
Experiments. The presentation shows
the main achievements reached and the issues still to address. A Post Mortem Workshop, to discuss the
lessons learned, took place on the 9-10 July 2009 at CERN. Link
HEPiX Summary M.Jouvin provided a summary of the HEPiX
meeting in Umea focusing on virtualization, benchmarking, file system and
data centers in High Energy Physics.
SL5 for the Experiments and the LCG Meta-package P.Mato
summarized the requirements of the Experiments in order to migrate their
Physics Applications to SL5. O.Keeble presented the idea of a deployment
meta-package that would groups all needed packages under a set of package
dependencies for the WLCG software stack.
User Analysis Support M.Lamanna
chaired a discussion on support for user analysis and presented the tools
currently used in the Experiments for Sites availability, robots for
regular job submission on selected datasets and load generators.
9 June 2009 Pre-GDB Meeting Agenda
Tier-2 Storage Support The meeting focused on storage at the
Tier-2 Sites. The roadmap and support models of dCache, DPM and StoRM were
presented and discussed. The Experiments and the US Tier-2 Sites also
provided their input.
16 September 2009 Minutes
Move to WMS.3.1.21-0 - Call to encourage sites to move to the most recent version of WMS
available in gLite 3.1, which corresponds to gLite-WMS 3.1.21-0 in the
gLite repository.
Security Patches - Last month several alerts were sent to all Sites about critical
security vulnerabilities in the Linux kernel. It is essential to actively
and urgently ensure all the Sites have applied these security patches.
7 September 2009 Minutes
DPM Move to SL4 - Last reminder that the default DPM used for SAM tests were be
upgraded to SL4 on the 7th of September and that sites with obsolete client
software will start failing tests.
31 August 2009 Minutes
SAM
MPI Tests - Despite the fact that
there are MPI deployment problems with the gLite SL5 Worker Nodes, the SAM
MPI tests can be enabled because the sites published information system is
interrogated first to check for MPI support at each given site.
24 August 2009 Minutes
Default
DPM on SL4 - Last reminder that the
default DPM used for SAM tests will be upgraded to SL4 and that sites with
obsolete client S/W will start failing tests. There are still 11 CEs
failing.
SAM
MPI Tests Validation - At the TMB on 05/08/2009
it was agreed that SAM MPI tests should be re-instated and results followed
up. They will go through validation, for a shorter period, as they had gone
through validation before. All Sites are requested to check the validation
results and follow up.
Tier-2
Sites Asked to Move to SL5 - WLCG
MB agreed on 4th of August to ask for the SL5 migration at all Sites, including
the Tier-2 Sites. A web page is available to provide pointers to the
relevant information to support the migration, including links to the
necessary packages. (Link). It is understood that
the Experiments are all ready and able to use SL5 resources.
3 August 2009 Minutes
DPM
on SL3 Tests Will Fail -
Because of the imminent upgrade to SL4 of one of the services (DPM) used to
submit SAM tests sites still running on SL3 (no longer supported) would
start failing the tests. This effect can already be seen from the SAM
validation instance. All sites filing the tests in the validation instance
should upgrade their WNs to the last version ASAP and ROCs are invited to
monitor this process. The switch of the SAM DPM production is scheduled for
the 10th of August, less than 2 weeks from now.
27 July 2009 Minutes
BDII Patch Needed
for SL5 Sites - Sites running top-level BDIIs on SL5 are required to
install the latest patch included in gLite 3.2 update 0.4.
20 July 2009 Minutes
Incompatibility
between VOMS and mod_gridsite - The UK/Ireland region highlighted an
incompatibility between VOMS when configured in a particular style and with
mod_gridsite as part of the WMS. This is being investigated now by the EGEE
EMT.
|
|
Management Board (MB)
Web - Wiki - Members - Agendas - Minutes
8 September 2009 Agenda, Minutes
WLCG
Operations Weekly Report M.Girone
presented the summary of status and progress of the LCG Operations since
last MB meeting. Link
Update on the SRM MoU Addendum Features A.Sciabα
presented the prioritized requirements, by the Experiments, on the short
term development of the SRM implementations. All information is now
collected in a wiki page with all details on the features and the ranking
of each Experiment. Link
Update on Data Access - M.Litmaath
presented the latest news on data access, the issues encountered during
STEP09 and the ongoing work.
1 September 2009 Agenda, Minutes
WLCG
Operations Weekly Report
O.Barring presented the summary of status and progress of the LCG
Operations since last MB meeting. The main events were the
BNL site name change: BNL-LCG2 to BNL-ATLAS and a kernel patch for
published local exploits.
2009/2010 Resources
- I.Bird summarized the situation with the 2009 Experiments requirements
and Sites procurement.
CASTOR Versions
- A.Pace reported about
the current and coming version of CASTOR. Version 2.1.7 is in use at the
Tier-1 Sites while CERN has already moved to 2.1.8.
High Level
Milestones Update The MB discussed the status and progress of the
High Level Milestones. In particular the updates to SL5, to the CREAM CE
and the use of the new HEPSPEC-06 benchmark.
18 August 2009 Agenda, Minutes
ALICE
on SL5 and CREAM CE - ALICE officially requested all
ALICE WLCG Sites to provide as soon as possible SL5 Worker Nodes and the
CREAM CE. Those sites which have not
completed the SL5 migration will not be able to participate in the
production until the full migration.
WLCG
OPS Weekly Report H.Renshall presented the
summary of status and progress of the LCG Operations since last MB meeting.
Three alarm tickets issued in the two weeks period at CERN, DE-KIT and RAL.
The daily meetings summaries are always available: Link
Update
on the EU Proposals J.Shiers presented the status of
the EU proposals in terms of scope the projects and manpower resources
needed. He also asked for input from the four LHC Experiments.
4 August 2009 Agenda, Minutes
Changing
T1 and T2 Sites Names - I.Bird noted that Sites
intending to change name should consider all the side effect of such
decisions. This item is mentioned because BNL would like to change their
Tier-1 name in.
WLCG
OPS Weekly Report H.Renshall presented the
summary of status and progress of the LCG Operations since last MB meeting.
There was a mixture of problems, mostly site related, and no alarm tickets.
The incidents leading to service incident reports were a power cut at ASGC
and storage degradation at NL-T1. Link
SL5
Deployment - Some Sites have asked for
clarification about the SL5 deployment. J.Gordon asked that the MB approve
and requests the move to SL5 to the Sites. Experiments want to have
initially a separate CE in order to test SL5 nodes at a given Site. The
Experiments will then ask for the change in proportion of SL4/SL5 nodes at
each Site.
SRM
Status - A.Sciabα summarized the current
status and issues of the different SRM systems.
DCache
Migration to Chimera and New Release Policy - P.Fuhrmann
presented the status of dCache and the migration to Chimera. There are also
papers explaining more in technical details the motivations and the
procedures to follow. DCache.org is moving towards time-based releases.
Time bases release a common practice in large software projects.
21 July 2009 Agenda, Minutes
WLCG
OPS Weekly Report M.Girone presented the summary
of status and progress of the LCG Operations since last MB meeting. No
alarm tickets for the two weeks. The main topics of the week were RAL move
to new machine room successfully completed and a NIKHEF cooling problem and
30% capacity off until move to new CC.
WLCG
Technical Forum - M.Litmaath presented the
goals and his ideas on the WLCG Technical Forum that he will chair. The TF
scope and mandate include all topics for improvement between WLCG
stakeholders and provide input on a single WLCG position to EGEE, EGI, OSG
and Experiments.
Follow-up
to STEP09 Post Mortem Workshop - I.Bird presented a summary of the discussion at the STEP09
Workshop and of the actions for Tier-0/1/2 Sites and the four Experiments.
.
7 July 2008 Agenda, Minutes
Approval of Security
Policy Documents - D.Kelsey summarized the process followed and
reported about the EGEE procedures for storing user logging information.
The final agreed wording now says one year of data storage and custody
The Security Policy documents presented were approved by the WLCG MB.
GGUS Notification to
OSG Sites M.Dimou presented the notification process of GGUS ticket to
OSG Sites. She also provided links to documentation on the whole set of
definitions and background information.
Update on the HEP
SSC Preparation - J.Shiers reported on the workshop in Paris about the
preparation of the HEP SSC proposal.
CMS
Quarterly Report - M.Kasemann presented the 2009Q2 quarterly report for CMS.
Architects Forum (AF)
Minutes
- Web
20 August 2009
- Minutes
SELinux Enabled - Confirmation
that the additional bit to be enabled in SELinux (i.e. allow_execheap) will
be required as long ATLAS needs to run old software and the problem with
the Oracle client library is not fixed. No clear time scale yet.
LCG_56c Released
- The release of the LCG_56c configuration was done successfully and well
in time for the ATLAS releases.
LCG_56d Preparation
- Agreed to prepare a new configuration LCG_56d consisting of only LCGCMT
modifications but no other package changes.
Migration to Oracle
11.1.0.17 - The proposal to migrate the Oracle client version to
11.1.0.7 presented by A.Valassi was agreed.
23 July 2009 -
Minutes
SELinux Needed on
SL5 - Inconclusive tests on what SELinux bits are required for Oracle
library to work on SL5
LCG_56c Preparation
- Details of the contents for the new configuration (LCG_56c) have been
discussed and agreed.
11 June 2009 - Minutes
SLC5 Deployment at
CERN - Discussion on the SLC5 deployment at CERN and follow-up of the
proposal of disabling SELinux. Additional SLC5 issues and feedback from
latest GDB.
LHC Applications
Heap Tool - Proposal for a LHC-wide tool to analyze the content of our
applications heap. There was general agreement that the kind of information
collected by the different tools is similar. Agreed to setup a group of
experts and discuss the details.
Multi-core
Applications Workshop - The workshop on adapting Physics applications
and computing services to multi-core and virtualization was discussed and
defined. .
General News and Main Events
LCG Meetings - Calendar
|
21-25
September 2009
|
EGEE
2009 Conference Barcelona
Web Site
|
14
October 2009
|
Grid
Deployment Board (GDB) - CERN.
GDB
Agenda
|
11
November 2009
|
Grid
Deployment Board (GDB) - CERN.
GDB
Agenda
|
8
December 2009
|
Grid
Deployment Board (GDB) - CERN.
|
|