Survey							
                            
		                
		                * Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
LCG Accounting John Gordon Grid Deployment Board 13th January 2004 [email protected] Overview  Aim is to gather accounting information at the job level from each site and store in a central database so that arbitrary queries can be done against it.  Four parts     Schema Gathering Sending Querying [email protected] Schema  Defined our own  Then discovered GGF Usage Schema  http://www.psc.edu/~lfm/Grid/UR-WG/ [email protected] Accounting Schema                    LocalGroup - the group that the job ran as on the system LocalJobID - the ID of the job on the system on which it ran ** LCGJobID - the job ID assigned by the RB (null for jobs submitted locally or via globus) ** ExecutingSite - the site at which the job ran ** ExecutingCE - the queue from which the job ran SubmittingRB - the RB used to submit the job LocalUserID - the ID of the user on the system on which the job ran ** LCGUserDN - the DN of the user within LCG ** LCGUserVO - the virtual organisation of this user for this job ** LocalStartTime - the local wallclock date/time at which the job commenced execution LocalStopTime - the local wallclock date/time at which the job ceased execution UTCStartTime - the wallclock date/time at which the job commenced execution in UTC (GMT) ** UTCStopTime - the wallclock date/time at which the job ceased execution in UTC ** ElapsedTime - StopTime-StartTime (calculated seconds) BaseCpuTime - the actual CPU time recorded by the system on which the job ran (seconds) BaseCpuPower - the power of the system on which the job ran in SpecInts (needs more precise definition) SpecIntSecs - the computing power consumed by the job in SpecInt-Seconds (calculated from previous two fields) ** DiskSpace - the maximum aggregated amount of local disk space used by the job (Mb) DiskIO - the aggregated volume of disk data read/written by the job (Mb) [email protected] Gathering  PBS doesn’t produce job accounts, only logs events  Four solutions     RAL processes event logs and populkates db NIKHEF does similar pbsacct in SourceForge Job info available in pbs prolog. Do it direct.  Still considering  Assume unix group = vo name  Several solutions to mapping unix username to DN  Not yet chosen  Initial version probably VO only [email protected]  LSF will be easier as it has job accounts  Site ultimately responsible.  They can filter out certain info (eg non-LHC VOs)  Or remap groups to Vos  Or replace the whole thing by calls to their own internal database. [email protected] Sending  Send schema info over grid to central database  Considered  Spitfire  Develop our own web service  R-GMA • R-GMA Archiver is a web-service which takes a row of information and inserts it in a persistent database  Chose R-GMA  Stream producer and general archiver  Retain records in local Mon to provide resilience against network failure.  Sites could also run archivers to keep a copy of their data local. [email protected] Querying  Simple SQL query to start  Canned queries with plotting  General queries later [email protected] Progress  Evaluated different components separately  Now integrating     Our Schema PBS prolog Calling R-GMA stream producer With general archive consuming  Prototype by 23rd. [email protected]