Processing Throughput Plots

I plotted the things processed/hour for a number of stages, for the month of September 2010, to look for patterns in the processing. These plots include any kind of processing (stdscience, update, pstamp, etc). I couldn't really find any patterns, other than that we did more processing on 24-25 (photfest), and there was a pile of data that went through chip/cam/warp on 8 & 9 (no idea what this is).

Grabbing the data

Grabbing the data to plot the throughput was a bit complicated. Not all the stages have times in them, and some of them that do have inconsistent time zones, or are not used at all.

stages with no columns relating to times

  • chipRun
  • camRun
  • magicDSRun

stages with incomplete time information

  • warpRun (all values since 2008 are NULL)

stages with epoch (timestamp) time information

  • summitExp
  • newExp
  • rawExp
  • fakeRun
  • flatcorrRun

stages with registered (with time set by a script) time information

  • stackRun
  • diffRun
  • magicRun
  • chipBackgroundRun
  • detRun
  • diffPhotRun
  • dqstatsRun
  • staticskyRun

stages with time_stamp (timestamp)

  • distRun

I did not use all of these stages/tables. For the stages that I used, I discovered that epoch uses HST, and registered uses UTC.

chip, cam, warp were stages with missing information. To get this information, I grabbed a list of chips with chip_id > some_number, and did neb-stat on the log file to find out when it was created. I did similar queries for cam and warp.

For stack, diff, magic, registered refers to when they were added to the db to be processed, not when they were processed. This is why there are spikes for stacks, they are all queued up at the same time each day.

The plots

Color code

  • red - epoch (in gpc1 db, it is a TIMESTAMP)
  • cyan - file creation times were grabbed from the log files (slow).
  • purple - registered (in gpc1 db)
  • black - time_stamp (in gpc1 db, it is a TIMESTAMP)

Black vertical lines are for 6am HST for each day, gray vertical lines are for midnight and noon HST. The numbers on the bottom refer to the date.

  • I haven't finished grabbing all chip and warp dates for September...

Attachments