FAS Research Computing - seas_gpu partition occasional pre-emption issue – Incident details

Status page for the Harvard FAS Research Computing cluster and other resources.

Cluster Utilization (VPN and FASRC login required): Cannon | FASSE


Please scroll down to see details on any Incidents or maintenance notices.
Monthly maintenance occurs on the first Monday of the month (except holidays).

GETTING HELP
https://docs.rc.fas.harvard.edu | https://portal.rc.fas.harvard.edu | Email: rchelp@rc.fas.harvard.edu


The colors shown in the bars below were chosen to increase visibility for color-blind visitors.
For higher contrast, switch to light mode at the bottom of this page if the background is dark and colors are muted.

seas_gpu partition occasional pre-emption issue

Resolved
Operational
Started about 2 months agoLasted about 2 months

Affected

Cannon Cluster

Degraded performance from 2:01 PM to 2:13 PM

seas_compute

Degraded performance from 2:01 PM to 2:13 PM

Updates
  • Resolved
    Resolved

    This issue has not reoccurred and appears to be due to unique cluster state. We are resolving this but will continue to monitor for recurrence.

  • Identified
    Identified
    We are investigating an issue with the seas_gpu queue where backfill jobs sometimes are not pre-empted. We have filed a bug with SchedMD and are monitoring.